Discover ANY AI to make more online for less.

select between over 22,900 AI Tool and 17,900 AI News Posts.


Microsoft introduces Phi-4-mini-flash-reasoning with up to 10x higher token throughput
Microsoft introduces Phi-4-mini-flash-reasoning with up to 10x higher token throughput

Microsoft has introduced Phi-4-mini-flash-reasoning, a lightweight AI model built for scenarios with tight computing, memory, or latency limits. Designed for edge devices and mobile apps, the model aims to deliver strong reasoning abilities without demanding hardware.
The article Microsoft introduces Phi-4-mini-flash-reasoning with up to 10x higher token throughput appeared first on THE DECODER.

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat
Microsoft built Phi-4-reasoning-vision-15B to know when to think — and wh

<p><a href="https://www.microsoft.com/en-us">Microsoft</a> on Tuesday released <a href="https://www.microsoft.com/en-us/research/blog/phi-4-reasoning-vision-and-the [...]

Match Score: 650.66

venturebeat
Phi-4 proves that a 'data-first' SFT methodology is the new diffe

<p>AI engineers often chase performance by scaling up LLM parameters and data, but the trend toward smaller, more efficient, and better-focused models has accelerated. </p><p>The &l [...]

Match Score: 558.21

venturebeat
Google releases Gemini 3.1 Flash Lite at 1/8th the cost of Pro

<p>Google&#x27;s newest AI model is here:<a href="https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-1-flash-lite/"> Gemini 3.1 Flash-Lite</a [...]

Match Score: 216.51

venturebeat
Gemini 3 Flash arrives with reduced costs and latency — a powerful combo

<p>Enterprises can now harness the power of a large language model that&#x27;s near that of the state-of-the-art<a href="https://venturebeat.com/ai/google-unveils-gemini-3-claiming-t [...]

Match Score: 175.93

venturebeat
Researchers baked 3x inference speedups directly into LLM weights — witho

<p>As agentic AI workflows multiply the cost and latency of long reasoning chains, a team from the University of Maryland, Lawrence Livermore National Labs, Columbia University and TogetherAI ha [...]

Match Score: 128.22

venturebeat
AI inference costs dropped up to 10x on Nvidia's Blackwell — but har

<p>Lowering the cost of inference is typically a combination of hardware and software. A new analysis released Thursday by Nvidia details how four leading inference providers are reporting 4x to [...]

Match Score: 116.90

venturebeat
Microsoft says ungoverned AI agents could become corporate 'double age

<p>Microsoft today announced the general availability of <a href="https://www.microsoft.com/en-us/microsoft-agent-365">Agent 365</a> and <a href="https://www.micros [...]

Match Score: 116.71

Microsoft expands its SLM lineup with new multimodal and mini Phi-4 models
Microsoft expands its SLM lineup with new multimodal and mini Phi-4 models

<p><img width="1456" height="816" src="https://the-decoder.com/wp-content/uploads/2025/02/microsoft_ai_neural_network_illustration.png" class="attachment-fu [...]

Match Score: 112.49

venturebeat
Microsoft launches 3 new AI models in direct shot at OpenAI and Google

<p><a href="https://www.microsoft.com/en-us">Microsoft</a> on Wednesday launched <a href="https://microsoft.ai/news/today-were-announcing-3-new-world-class-mai-mode [...]

Match Score: 110.89