Credibility Coalition - Economics of Misinformation Working Group
I became a member of the Credibility Coalition in early 2020. At the beginning of 2021 I became chair of the Economics of Misinformation working group. Most recently we have been examining how programmatic advertising causes brands to unintentionally pay millions of dollars to spreaders of misinformation.
Kensho Derived Wikimedia Dataset
A Kaggle hosted dataset that serves as a sandbox for doing Natural Language Processing with Wikimedia data. Includes plain text and link offsets from Wikipedia and the Wikidata knowledge graph.
qwikidata Python Package
A Python package that allows you to represent Wikidata items, properties, and lexemes as classes.
hilbertcurve Python Package
A Python package for calculating Hilbert Curves in arbitrary dimensions and using arbitrarily large integers. Supports multiprocessing for converting a large number of points to distances along the curve or vice versa.
Steve Tingley-Hock of the American Voter Project (the non-profit behind the Ohio Voter Project) has been collecting voter registration data for years. In collaboration with him, I've produced two Kaggle datasets and associated example notebooks.
You can learn more about Steve's work at,
A research project to build a browser extension that will help people reach Wikipedia articles relevant to stories they are viewing online. A perpetual work in progress.
Mentor for Harvard IACS Capstone Courses
Work done in collaboration with students in the Institute for Applied Computational Science Capstone Research course. The students typically write blog posts summarizing their work.
Mentor for Cornell Financial Engineering Projects