Skip to content
View vpj's full-sized avatar
😜
😜

Organizations

@labmlai

Block or report vpj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper

Python 588 29 Updated Mar 26, 2025

Fully open reproduction of DeepSeek-R1

Python 23,970 2,190 Updated Apr 16, 2025

πŸ™ Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 55,077 5,413 Updated Apr 5, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,510 245 Updated Apr 7, 2025

Entropy Based Sampling and Parallel CoT Decoding

Python 3,353 319 Updated Nov 13, 2024

Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044

Python 32 5 Updated Oct 3, 2024

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Python 784 40 Updated Mar 3, 2025

A comprehensive repository of reasoning tasks for LLMs (and beyond)

JavaScript 430 49 Updated Sep 27, 2024

LLM101n: Let's build a Storyteller

33,179 1,811 Updated Aug 1, 2024
Jupyter Notebook 1,618 348 Updated Apr 16, 2025

LLM Analytics

TypeScript 654 28 Updated Oct 19, 2024

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 17,140 1,118 Updated Apr 16, 2025

Convert PDF to markdown + JSON quickly with high accuracy

Python 24,201 1,518 Updated Apr 16, 2025

πŸ”Ž Monitor deep learning model training and hardware usage from your mobile phone πŸ“±

Python 2,147 140 Updated Apr 10, 2025

Curate better data for LLMs

Python 1,023 99 Updated Mar 19, 2024

Code for Quiet-STaR

Python 730 89 Updated Aug 21, 2024

Grok open release

Python 50,243 8,348 Updated Aug 30, 2024

A multi-programming language benchmark for LLMs

Python 240 43 Updated Jan 23, 2025

MLX: An array framework for Apple silicon

C++ 20,201 1,175 Updated Apr 16, 2025

DeepSeek LLM: Let there be answers

Makefile 6,303 974 Updated Feb 4, 2024

A quick guide (especially) for trending instruction finetuning datasets

3,006 195 Updated Nov 28, 2023

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 174,515 45,566 Updated Apr 16, 2025

πŸ“ CodeEdit App for macOS – Elevate your code editing experience. Open source, free forever.

Swift 21,616 1,060 Updated Apr 16, 2025

A terminal for a more modern age

TypeScript 63,069 3,563 Updated Apr 15, 2025

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Jupyter Notebook 2,712 140 Updated Aug 4, 2024

Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)

Python 363 67 Updated Apr 11, 2024

The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER). We have shared a pre-trained 9B parameter model.

Jupyter Notebook 122 4 Updated Apr 29, 2023

High-Resolution Image Synthesis with Latent Diffusion Models

Python 40,776 5,206 Updated Oct 10, 2024

πŸ§‘β€πŸ« 60+ Implementations/tutorials of deep learning papers with side-by-side notes πŸ“; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 60,031 6,071 Updated Aug 24, 2024

Fast and memory-efficient exact attention

Python 16,911 1,607 Updated Apr 13, 2025
Next