select between over 10280 AI Tool and 3598 AI News Posts.
A high-throughput and memory-efficient inference and serving engine for LLMs - vllm-project/vllm
Blazingly fast LLM inference. Contribute to EricLBuehler/mistral.rs development by creating an account on GitHub. [...]
Match Score: 32.89
LLM inference in C/C++. Contribute to ggerganov/llama.cpp development by creating an account on GitHub. [...]
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights. - turboderp/exllama [...]
Match Score: 30.27
GitHub is where people build software. More than 100 million people use GitHub to discover, fork, and contribute to over 420 million projects. [...]
Match Score: 29.31
Match Score: 29.27
A Gradio web UI for Large Language Models with support for multiple inference backends. - oobabooga/text-generation-webui [...]
Match Score: 27.04
Use hosted open-source models and achieve faster, cheaper and more accurate inference results than with proprietary APIs. [...]
Match Score: 23.13
ChatGPT-powered AI to help you understand any code repositories on Github to boost your project [...]
Match Score: 20.70
Powerful model serving and orchestration for your AI & ML projects, without the hassle of managing Kubernetes & cloud infrastructure. [...]
Match Score: 20.31