select between over 15,059 AI Tool and 9,214 AI News Posts.
A high-throughput and memory-efficient inference and serving engine for LLMs - vllm-project/vllm
Blazingly fast LLM inference. Contribute to EricLBuehler/mistral.rs development by creating an account on GitHub. [...]
Match Score: 33.53
LLM inference in C/C++. Contribute to ggerganov/llama.cpp development by creating an account on GitHub. [...]
GitHub is where people build software. More than 100 million people use GitHub to discover, fork, and contribute to over 420 million projects. [...]
Match Score: 28.86
GitHub is where people build software. More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects. [...]
Match Score: 28.82
"Discover why the GitHub Pages site at https://kosuket-dev.github.io is not found. Learn about potential issues and solutions to resolve missing GitHub Pages sites." [...]
A Gradio web UI for Large Language Models with support for multiple inference backends. - oobabooga/text-generation-webui [...]
Match Score: 27.77
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights. - turboderp/exllama [...]
Match Score: 27.69