Starred repositories
Sample code for deep learning & neural networks
Investment Research for Everyone, Everywhere.
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
Convert news articles, blog posts (and more) into audio podcast episodes using natural-sounding AI text-to-speech models
An extremely fast Python linter and code formatter, written in Rust.
A bridge between Lichess bots and chess engines
Curated list of datasets and tools for post-training.
Sunfish: a Python Chess Engine in 111 lines of code
A chess library for Python, with move generation and validation, PGN parsing and writing, Polyglot opening book reading, Gaviota tablebase probing, Syzygy tablebase probing, and UCI/XBoard engine c…
WHATWG-compliant and fast URL parser written in modern C++, part of Node.js, Clickhouse, Redpanda, Kong, Telegram, Datadog and Cloudflare Workers.
List of libraries, tools and APIs for web scraping and data processing.
Build a RAG dataset for your domain in just a few lines of codes, using your XML sitemap
Chatmail Rust Core library, used by Android/iOS/desktop apps, bindings and bots 📧
🕸 GlotWeb: Web Indexing for Low-Resource Languages -- under construction.
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
BibBot is a browser extension that removes the paywall on German online news sites using your library account's access to press databases.
Top2Vec learns jointly embedded topic, document and word vectors.
Basis of FragDenStaat.de's „Koalitionstracker“
Master the fundamentals of machine learning, deep learning, and mathematical optimization by building key concepts and models from scratch using Python.
A benchmark for evaluating robustness of automatic genre identification models to test their usability for the automatic enrichment of large text collections with genre information.
Software for humanities scholars using quantitative or computational methods.
Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.
Tool to bulk follow accounts related Open Science on Mastodon. Runs at https://germanrepro.github.io/Mastodon-OpenScience/ Based on the DIY webapp to bulk follow sociological accounts on Mastodon b…