- Rule Induction of CNP Explanations (RICE) - A method and tools to try and create human-readable explanations of back box systems, such as Machine Learning models. You can read about the initial work on this in our pre-print on arXiv, “Why did you do that?”: Explaining black box models with Inductive Synthesis and check out the demo code in the RICE GitHub repository.
- ISA API - Part of the ISA Metadata Framework, I was lead developer of the ISA API authoring most of the code of its first versions. You can find out more on the ISA API GitHub repository.
- textmining3 - A new Python library I have been working on to lower the barriers to teaching text analytics. It is based on the
textminingPython library by Christian Peccei, Director of Analytics at 23andMe.
textmining3started out as Python 3 port of the original pacakge. You can find out more in the
- mico - My doctoral work was on developing peer-to-peer services for collaborative group work, building on Java ME and the JXTA protocols. It is quite outdated now (Java has moved on a lot and the JXTA project has since been abandoned), but you can check out the source code in the mico GitHub respository.
- Programming for Data Science - I developed and taught a course delivered through the Oxford University Department for Continuing Education as my contribution to continuous learning to help those wishing to learn data science. Each iteration of the course was pitched to the general public and delivered using state-of-the-art data science and collaboration tools over 10 weeks. I am currently working on converting Programming for Data Science into an online format. You can read more about the course on the OUDCE course catalog listing.