Inference Engine Architecture

Taalas Etches AI Models Onto Transistors To Rocket Boost Inference

Adding big blocks of SRAM to collections of AI tensor engines, or better still, a waferscale collection of such engines, turbocharges AI inference, as has ...

Morningstar

Quadric, Inference Engine for On-Device AI Chips, Raises $30M Series C as Design Wins Accelerate Across Edge LLMs, Automotive, and Enterprise

Tripling product revenues, comprehensive developer tools, and scalable inference IP for vision and LLM workloads, position Quadric as the platform for on-device AI. ACCELERATE Fund, managed by BEENEXT ...

VentureBeat

Pipeshift cuts GPU usage for AI inferences 75% with modular interface engine

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now DeepSeek’s release of R1 this week was a ...

Reuters

Fortytwo Introduces ‘Swarm Inference’: A New AI Architecture That Outperforms Frontier Models on Key Benchmarks

MOUNTAIN VIEW, CA, October 31, 2025 (EZ Newswire) -- Fortytwo, opens new tab research lab today announced benchmarking results for its new AI architecture, known as Swarm Inference. Across key AI ...

New Atlas

Next-level AI engine comes top in LLM speed showdown

Responses to AI chat prompts not snappy enough? California-based generative AI company Groq has a super quick solution in its LPU Inference Engine, which has recently outperformed all contenders in ...

insideHPC

VAST Data Redesigns Inference Architecture for Agentic AI with NVIDIA

NEW YORK – – VAST Data, the AI Operating System company, today announced a new inference architecture that enables the NVIDIA Inference Context Memory Storage Platform – deployments for the era of ...

The Next Platform

Cerebras Trains Llama Models To Leap Over GPUs

It was only a few months ago when waferscale compute pioneer Cerebras Systems was bragging that a handful of its WSE-3 engines lashed together could run circles around Nvidia GPU instances based on ...

Forbes

Distributed Intelligence Is Here—And Reshaping Device Architecture

Iri Trashanski, Chief Strategy Officer at Ceva, is shaping the future of the Smart Edge with extensive experience across tech sectors. AI inference is happening across a network of local ...

SiliconANGLE

AI inference startup Runware raises $50 to make AI run faster

Artificial intelligence startup Runware Ltd. wants to make high-performance inference accessible to every company and application developer after raising $50 million in Series A funding. It’s backed ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results