25 TOPS at 5W. Fanless AI Inference for Any Enclosure.

Deepx - 25 TOPS at 5W. Fanless AI Inference for Any Enclosure.

 

From embedded modules to high-density PCIe cards and complete systems, DEEPX delivers production AI inference where cloud and GPU solutions cannot operate. Exclusively available from Macnica.

 

The DEEPX product line spans from a 42mm M.2 module for drones and embedded devices to high-density PCIe inference cards for multi-channel surveillance servers. The DX-M and DX-H families are deployed today in security systems, smart factories, robotics platforms, and transportation infrastructure worldwide. All products share the same SDK and model framework support.

 

Macnica is the exclusive authorized distributor for North America, providing design-in support, evaluation hardware, and supply chain planning for production volumes, covering the full engagement from concept through volume ramp. Contact Macnica to request an evaluation kit or discuss your application with a field application engineer.

 

 

 

DEEPX NPU Architecture Built Around Power, Thermal, and Precision

 

GPUs overheat in sealed enclosures, low-power NPUs cannot sustain production frame rates on real detection models, and cloud inference introduces round-trip latency and data egress that industrial and safety-critical systems cannot tolerate. DEEPX NPU architecture is designed specifically around these failure modes, with power envelope, thermal characteristics, and quantization fidelity treated as primary design constraints rather than secondary considerations. All performance claims are independently benchmarked against production workloads.

 

Deepx NPU Infographic

 

The DEEPX IQ8 quantization engine addresses the precision trade-off that limits most edge NPU deployments. Standard INT8 quantization reduces memory bandwidth and compute requirements but introduces accuracy degradation that requires per-layer calibration to recover. IQ8 performs INT8-width computation while preserving the numerical range of FP32 inference, delivering full-speed throughput without model accuracy loss and without manual calibration. For teams deploying YOLO-family or custom detection models, this removes a significant pre-production engineering step.

 

 

Thermal performance is validated in direct hardware comparison. Running identical workloads, the DX-M1 board surface measured 35.5 degrees C versus 60.7 degrees C for a competing chip. In sealed enclosures without active cooling, that 25-degree delta is the margin between a qualified fanless design and one that requires thermal redesign or derating at operating temperature.

 

 

 

DEEPX Products: Chips, modules, PCIe cards, and Complete Systems

 

The DEEPX portfolio covers two distinct deployment scenarios. The DX-M Series are bare chips and M.2 modules for embedding directly into host systems: robots, cameras, vehicles, and edge devices. The DX-H Series are low-profile PCIe cards for servers and workstations running multi-channel video analytics and AI inference at scale. All products share the same DXNN SDK and support the same model frameworks.

 

DEEPX Products - Chips & modules

 

 

DX-M1 SoCs and M2 Modules

DX-M1 SoCs and M2 Modules

Embedded chip with external memory, industrial and automotive grades

 

The DX-M1 is the flagship embedded NPU. Its external memory architecture lets designers size LPDDR4x or LPDDR5 independently of the chip, supporting up to 8GB for workloads running multiple simultaneous models. Available as a bare chip or as an M.2 2280 module with 4GB LPDDR5 onboard. AEC-Q100 Grade 3 qualified, with Grade 2 available on request. This is the only DEEPX chip with automotive-grade certification.

 

Best for: Designs requiring memory flexibility, automotive-grade certification, or multi-model inference. Robotics platforms, industrial inspection systems, and smart city nodes.

 

AI performance

25 TOPS (INT8)

Power

1 to 5W

Memory

Up to 8GB external LPDDR5

Chip package

17 x 17mm FCBGA

Module

M.2 2280, M-Key

Host interface

PCIe Gen3 x4

Temp range

-40 to 85 degrees C / AEC-Q100 Grade 3

Status

Mass production

DX-M1M SoCs and M.2 Modules

DX-M1M SoCs and M.2 Modules

Compact all-in-one chip and module with integrated memory, half the footprint of DX-M1

 

The DX-M1M integrates 2GB LPDDR4x directly into the chip package, eliminating the external DRAM device and reducing module length to 42mm. The result is a complete AI accelerator in a form factor that fits where the M1 cannot: drones, dashcams, and handheld devices. Hardware-based security includes a Root of Trust and True Random Number Generator.

 

Best for: Drones, ADAS systems, dashcams, autonomous mobile robots, and AIoT devices where size, weight, and power budget are the binding constraints.

 

AI performance

25 TOPS (INT8)

Power

3W typical

Memory

2GB integrated LPDDR4x

Chip package

21 x 21mm FCL-BGA

Module

M.2 2242, M+B Key

Host interface

PCIe Gen3 x2 (module)

Temp range

-40 to 85 degrees C industrial

Status

Mass production Q4 2025

DX-H1 Quattro PCIe Card

DX-H1 Quattro PCIe Card

100 TOPS inference card, four DX-M1 chips on one low-profile PCIe slot

 

The DX-H1 Quattro installs into a standard PCIe x16 slot and delivers 100 TOPS at 20W TDP, enough to process up to 128 AI video channels from a single workstation, or 768 channels from a 2U server with six cards. It works alongside existing NVR or decoder infrastructure; no onboard video codec is included. The host motherboard must support PCIe bifurcation x4x4x4x4, splitting the x16 slot into four independent x4 lanes. Verify BIOS support before specifying. Compatible with Dell, HPE, Lenovo, Supermicro, and KAYTUS servers.

 

Best for: Adding AI inference to existing camera and NVR infrastructure. Confirm BIOS bifurcation x4x4x4x4 support before selecting. Compatible with Dell, HPE, Lenovo, Supermicro, and KAYTUS servers.

 

AI performance

100 TOPS (INT8)

Power

20W TDP

Memory

16GB LPDDR5

Video codec

Not included

Form factor

Low-profile PCIe card

Interface

PCIe x16 (bifurcation x4x4x4x4 required)

Dimensions

68.1 x 165.9 x 16mm with heatsink

Max channels

128 channels per workstation (YOLOv5s at 25fps)

Status

Mass production

DX-H1 V-NPU PCIe Card

DX-H1 V-NPU PCIe Card

Decode and infer on one card, full H.264/H.265 video pipeline with onboard codec

 

The DX-H1 V-NPU adds hardware H.264/H.265 codec blocks alongside its dual NPUs, enabling the card to ingest, decode, and analyze live camera streams without involving the host CPU. One card handles 64 channels of 1080p decode and 50 TOPS of AI inference simultaneously, making it the right choice for new-build AI surveillance systems where eliminating separate video servers is a design goal. The PCIe interface uses x8 electrical lanes in a physical x16 slot, compatible with a broader range of server motherboards than the Quattro. DEEPX reports 82% lower hardware cost and 85% electricity savings versus GPU setups in 1,000-channel deployments over five years.

Best for: New-build AI CCTV infrastructure where the goal is to consolidate decode and inference onto one card. DEEPX reports 82% lower hardware cost and 85% electricity savings versus GPU setups in 1,000-channel deployments over five years.

 

AI performance

50 TOPS (INT8)

Power

30W TDP

Memory

24GB LPDDR5

Video codec

H.264/H.265: 64CH decode, 32CH encode at 1080p 30fps

Form factor

Low-profile PCIe card

Interface

PCIe x16 physical, x8 electrical

Dimensions

68.1 x 164.3 x 18mm with heatsink

Max channels

64 channels per workstation (YOLOv5s at 30fps)

Status

Available

 

 

From Chips to Complete Systems: AI Systems with DEEPX NPUs

 

For teams that need a validated AI system without a custom hardware design cycle, the DX-AIPlayer series integrates DX-M1 NPU acceleration with onboard compute, storage, and I/O in a production-ready enclosure. Compact and expanded configurations are available to match application I/O and compute requirements.

 

DX-AIBOX-M1-N97-08064-A
DX-AIBOX-M1-i5-08128-i

 

 

Part Number

DX-AIBOX-M1-N97-08064-A

DX-AIBOX-M1-i5-08128-i

Dimensions (mm)

95 x 95 x 55 mm

81 x 150 x190 mm

CPU

Intel® Processor N97

13th Gen Intel® Core™ i5-1340PE

Storage

eMMC Onboard 64GB

2.5'' SATA 128GB

Memory

RAM: 8GB LPDDR5

RAM: 8GB DDR4

I/O Interfaces

HDMI 2.0b x 1/DP 1.2 x 1

Gb Ethernet (full speed) RJ-45 x 2

RS-232/422/485 x 2

USB 2.0 x 2 (via 10 Pin Header x 1) USB 3.2 Gen 2 x 3 (Type-A)

HDMI/DP,

3 x2.5GbE LAN

4 x COM

6 x USB

1 x 12-bit DIO

Certification

KC, CE, and FCC

Power

12V DC-in, 5A

12~28V DC, RoHS

Weight

450g

1.88kg

Operating Temperature

0°C ~ 60° C

-20°C ~ 60°C

 

 

Applications Where DEEPX Is Deployed

 

DEEPX hardware is deployed across industries where real-time inference, low power consumption, and fanless operation are hard requirements. 

 

Deepx Applications

 

 

DXNN® SDK: One Toolchain for Every DEEPX Product

 

The DXNN® SDK is the unified software stack for all DEEPX hardware. It provides a compiler that takes models from standard training frameworks and produces optimized binaries for DEEPX NPUs, applying IQ8 quantization automatically without manual per-layer tuning. The runtime supports concurrent multi-model execution and manages memory across chips.

 

The SDK supports x86 and ARM aarch-64 host architectures, runs on Windows and major Linux distributions, and is compatible with containerized deployments via Docker. Software partners including Edge Impulse, CVEDIA, NetworkOptix, Nota AI, and others have pre-integrated DEEPX hardware into their platforms using the DXNN SDK. 

 

DXNN® SDK: One Toolchain for Every DEEPX Product

 

 

Contact Macnica Americas

 

Macnica provides evaluation hardware, design-in support, and production supply for all DEEPX products in America and Canada. Whether you are evaluating a chip for a new design or planning a production order, the team handles the full engagement from first sample through volume ramp.

 

Contact Macnica to request evaluation hardware, speak to an expert, or get a quote

 

Would you like to be added to our mailing list to receive the Macnica Americas Newsletter?

Newsletter Signup