select between over 22,900 AI Tool and 17,900 AI News Posts.
A high-throughput and memory-efficient inference and serving engine for LLMs - vllm-project/vllm
Blazingly fast LLM inference. Contribute to EricLBuehler/mistral.rs development by creating an account on GitHub. [...]
Match Score: 32.09
LLM inference in C/C++. Contribute to ggerganov/llama.cpp development by creating an account on GitHub. [...]
GitHub is where people build software. More than 100 million people use GitHub to discover, fork, and contribute to over 420 million projects. [...]
Match Score: 28.04
GitHub is where people build software. More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects. [...]
Match Score: 27.97
"Discover why the GitHub Pages site at https://kosuket-dev.github.io is not found. Learn about potential issues and solutions to resolve missing GitHub Pages sites." [...]
A Gradio web UI for Large Language Models with support for multiple inference backends. - oobabooga/text-generation-webui [...]
Match Score: 26.51
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights. - turboderp/exllama [...]
Match Score: 26.12