Skip to content
View YZ-Cai's full-sized avatar

Highlights

  • Pro

Block or report YZ-Cai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

《大模型白盒子构建指南》:一个全手搓的Tiny-Universe

Python 2,680 280 Updated Feb 12, 2025
Python 346 17 Updated Apr 9, 2025

Is ChatGPT Good at Search? LLMs as Re-Ranking Agent [EMNLP 2023 Outstanding Paper Award]

Python 588 57 Updated Mar 10, 2024

Test-time compute in information retrieval

Python 22 3 Updated Apr 8, 2025

Document Ranking with Large Language Models.

Python 144 17 Updated Mar 20, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,176 551 Updated Apr 11, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 8,540 840 Updated Apr 4, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

Python 2,705 285 Updated Mar 10, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,388 703 Updated Apr 7, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,418 821 Updated Mar 1, 2025

Official Repo for Open-Reasoner-Zero

Python 1,832 90 Updated Apr 8, 2025

Fully local web research and report writing assistant

Python 6,945 682 Updated Mar 24, 2025

Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)

TypeScript 3,891 366 Updated Apr 11, 2025
Jupyter Notebook 2,491 346 Updated Feb 3, 2025

An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl

TypeScript 5,336 664 Updated Feb 23, 2025

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 15,416 1,590 Updated Mar 24, 2025

LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.

Python 20,836 2,708 Updated Apr 9, 2025

Official code of the paper "SimGRAG: Leveraging Similar Subgraphs for Knowledge Graphs Driven Retrieval-Augmented Generation"

Python 106 14 Updated Dec 17, 2024

A collection of AWESOME things about Graph-Related LLMs.

2,098 149 Updated Apr 8, 2025

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 14,736 2,030 Updated Apr 10, 2025

Benchmark designed to evaluate the performance and cost-effectiveness of vector databases.

Python 686 192 Updated Apr 9, 2025

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 30,295 2,414 Updated Apr 10, 2025

Github Pages template based upon HTML and Markdown for personal, portfolio-based websites.

HTML 13,798 46,662 Updated Apr 3, 2025

vsag is a vector indexing library used for similarity search.

C++ 246 30 Updated Apr 10, 2025

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

TypeScript 48,557 4,560 Updated Apr 11, 2025

Benchmarks of approximate nearest neighbor libraries in Python

Python 5,216 794 Updated Apr 3, 2025

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 54,429 6,459 Updated Mar 31, 2025

The world’s fastest framework for building websites.

Go 79,386 7,727 Updated Apr 10, 2025

Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search

C++ 1,307 275 Updated Apr 9, 2025
Next