select between over 15,059 AI Tool and 9,214 AI News Posts.
A high-throughput and memory-efficient inference and serving engine for LLMs - vllm-project/vllm
Blazingly fast LLM inference. Contribute to EricLBuehler/mistral.rs development by creating an account on GitHub. [...]
Match Score: 33.94
LLM inference in C/C++. Contribute to ggerganov/llama.cpp development by creating an account on GitHub. [...]
GitHub is where people build software. More than 100 million people use GitHub to discover, fork, and contribute to over 420 million projects. [...]
Match Score: 29.93
Match Score: 29.90
GitHub is where people build software. More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects. [...]
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights. - turboderp/exllama [...]
Match Score: 29.02
A Gradio web UI for Large Language Models with support for multiple inference backends. - oobabooga/text-generation-webui [...]
Match Score: 27.97
Memory Lane is an innovative platform that harnesses the power of AI to preserve and revisit cherished memories. By uploading photos, videos, and stories, users can create a digital time capsule that [...]
Match Score: 27.19
Use hosted open-source models and achieve faster, cheaper and more accurate inference results than with proprietary APIs. [...]
Match Score: 24.08