25 TOPS at 5W. Fanless AI Inference for Any Enclosure.

From embedded modules to high-density PCIe cards and complete systems, DEEPX delivers production AI inference where cloud and GPU solutions cannot operate. Exclusively available from Macnica.

The DEEPX product line spans from a 42mm M.2 module for drones and embedded devices to high-density PCIe inference cards for multi-channel surveillance servers. The DX-M and DX-H families are deployed today in security systems, smart factories, robotics platforms, and transportation infrastructure worldwide. All products share the same SDK and model framework support.

Macnica is the exclusive authorized distributor for North America, providing design-in support, evaluation hardware, and supply chain planning for production volumes, covering the full engagement from concept through volume ramp. Contact Macnica to request an evaluation kit or discuss your application with a field application engineer.

Architecture
Products
Complete Systems
Applications
Software
Contact Us

DEEPX NPU Architecture Built Around Power, Thermal, and Precision

GPUs overheat in sealed enclosures, low-power NPUs cannot sustain production frame rates on real detection models, and cloud inference introduces round-trip latency and data egress that industrial and safety-critical systems cannot tolerate. DEEPX NPU architecture is designed specifically around these failure modes, with power envelope, thermal characteristics, and quantization fidelity treated as primary design constraints rather than secondary considerations. All performance claims are independently benchmarked against production workloads.

The DEEPX IQ8 quantization engine addresses the precision trade-off that limits most edge NPU deployments. Standard INT8 quantization reduces memory bandwidth and compute requirements but introduces accuracy degradation that requires per-layer calibration to recover. IQ8 performs INT8-width computation while preserving the numerical range of FP32 inference, delivering full-speed throughput without model accuracy loss and without manual calibration. For teams deploying YOLO-family or custom detection models, this removes a significant pre-production engineering step.

Thermal performance is validated in direct hardware comparison. Running identical workloads, the DX-M1 board surface measured 35.5 degrees C versus 60.7 degrees C for a competing chip. In sealed enclosures without active cooling, that 25-degree delta is the margin between a qualified fanless design and one that requires thermal redesign or derating at operating temperature.

DEEPX Products: Chips, modules, PCIe cards, and Complete Systems

The DEEPX portfolio covers two distinct deployment scenarios. The DX-M Series are bare chips and M.2 modules for embedding directly into host systems: robots, cameras, vehicles, and edge devices. The DX-H Series are low-profile PCIe cards for servers and workstations running multi-channel video analytics and AI inference at scale. All products share the same DXNN SDK and support the same model frameworks.

DX-M1 SoCs & M.2
DX-M1M SoCs & M.2
DX-H1 Quattro PCIe
DX-H1 V-NPU PCIe

DX-M1 SoCs and M2 Modules

Embedded chip with external memory, industrial and automotive grades

The DX-M1 is the flagship embedded NPU. Its external memory architecture lets designers size LPDDR4x or LPDDR5 independently of the chip, supporting up to 8GB for workloads running multiple simultaneous models. Available as a bare chip or as an M.2 2280 module with 4GB LPDDR5 onboard. AEC-Q100 Grade 3 qualified, with Grade 2 available on request. This is the only DEEPX chip with automotive-grade certification.

Best for: Designs requiring memory flexibility, automotive-grade certification, or multi-model inference. Robotics platforms, industrial inspection systems, and smart city nodes.

AI performance	25 TOPS (INT8)
Power	1 to 5W
Memory	Up to 8GB external LPDDR5
Chip package	17 x 17mm FCBGA
Module	M.2 2280, M-Key
Host interface	PCIe Gen3 x4
Temp range	-40 to 85 degrees C / AEC-Q100 Grade 3
Status	Mass production

Get the DX-M1 datasheet

Get the DX-M1 M.2 datasheet

DX-M1M SoCs and M.2 Modules

Compact all-in-one chip and module with integrated memory, half the footprint of DX-M1

The DX-M1M integrates 2GB LPDDR4x directly into the chip package, eliminating the external DRAM device and reducing module length to 42mm. The result is a complete AI accelerator in a form factor that fits where the M1 cannot: drones, dashcams, and handheld devices. Hardware-based security includes a Root of Trust and True Random Number Generator.

Best for: Drones, ADAS systems, dashcams, autonomous mobile robots, and AIoT devices where size, weight, and power budget are the binding constraints.

AI performance	25 TOPS (INT8)
Power	3W typical
Memory	2GB integrated LPDDR4x
Chip package	21 x 21mm FCL-BGA
Module	M.2 2242, M+B Key
Host interface	PCIe Gen3 x2 (module)
Temp range	-40 to 85 degrees C industrial
Status	Mass production Q4 2025

Get the DX-M1M datasheet

Get the DX-M1M M.2 datasheet

DX-H1 Quattro PCIe Card

100 TOPS inference card, four DX-M1 chips on one low-profile PCIe slot

The DX-H1 Quattro installs into a standard PCIe x16 slot and delivers 100 TOPS at 20W TDP, enough to process up to 128 AI video channels from a single workstation, or 768 channels from a 2U server with six cards. It works alongside existing NVR or decoder infrastructure; no onboard video codec is included. The host motherboard must support PCIe bifurcation x4x4x4x4, splitting the x16 slot into four independent x4 lanes. Verify BIOS support before specifying. Compatible with Dell, HPE, Lenovo, Supermicro, and KAYTUS servers.

Best for: Adding AI inference to existing camera and NVR infrastructure. Confirm BIOS bifurcation x4x4x4x4 support before selecting. Compatible with Dell, HPE, Lenovo, Supermicro, and KAYTUS servers.

AI performance	100 TOPS (INT8)
Power	20W TDP
Memory	16GB LPDDR5
Video codec	Not included
Form factor	Low-profile PCIe card
Interface	PCIe x16 (bifurcation x4x4x4x4 required)
Dimensions	68.1 x 165.9 x 16mm with heatsink
Max channels	128 channels per workstation (YOLOv5s at 25fps)
Status	Mass production

Get the DX-H1 Quatro datasheet

DX-H1 V-NPU PCIe Card

Decode and infer on one card, full H.264/H.265 video pipeline with onboard codec

The DX-H1 V-NPU adds hardware H.264/H.265 codec blocks alongside its dual NPUs, enabling the card to ingest, decode, and analyze live camera streams without involving the host CPU. One card handles 64 channels of 1080p decode and 50 TOPS of AI inference simultaneously, making it the right choice for new-build AI surveillance systems where eliminating separate video servers is a design goal. The PCIe interface uses x8 electrical lanes in a physical x16 slot, compatible with a broader range of server motherboards than the Quattro. DEEPX reports 82% lower hardware cost and 85% electricity savings versus GPU setups in 1,000-channel deployments over five years.

Best for: New-build AI CCTV infrastructure where the goal is to consolidate decode and inference onto one card. DEEPX reports 82% lower hardware cost and 85% electricity savings versus GPU setups in 1,000-channel deployments over five years.

AI performance	50 TOPS (INT8)
Power	30W TDP
Memory	24GB LPDDR5
Video codec	H.264/H.265: 64CH decode, 32CH encode at 1080p 30fps
Form factor	Low-profile PCIe card
Interface	PCIe x16 physical, x8 electrical
Dimensions	68.1 x 164.3 x 18mm with heatsink
Max channels	64 channels per workstation (YOLOv5s at 30fps)
Status	Available

Get the DX-H1 V-NPU datasheet

From Chips to Complete Systems: AI Systems with DEEPX NPUs

For teams that need a validated AI system without a custom hardware design cycle, the DX-AIPlayer series integrates DX-M1 NPU acceleration with onboard compute, storage, and I/O in a production-ready enclosure. Compact and expanded configurations are available to match application I/O and compute requirements.

Part Number	DX-AIBOX-M1-N97-08064-A	DX-AIBOX-M1-i5-08128-i
Dimensions (mm)	95 x 95 x 55 mm	81 x 150 x190 mm
CPU	Intel® Processor N97	13th Gen Intel® Core™ i5-1340PE
Storage	eMMC Onboard 64GB	2.5'' SATA 128GB
Memory	RAM: 8GB LPDDR5	RAM: 8GB DDR4
I/O Interfaces	HDMI 2.0b x 1/DP 1.2 x 1 Gb Ethernet (full speed) RJ-45 x 2 RS-232/422/485 x 2 USB 2.0 x 2 (via 10 Pin Header x 1) USB 3.2 Gen 2 x 3 (Type-A)	HDMI/DP, 3 x2.5GbE LAN 4 x COM 6 x USB 1 x 12-bit DIO
Certification	KC, CE, and FCC
Power	12V DC-in, 5A	12~28V DC, RoHS
Weight	450g	1.88kg
Operating Temperature	0°C ~ 60° C	-20°C ~ 60°C

Request a quote

Applications Where DEEPX Is Deployed

DEEPX hardware is deployed across industries where real-time inference, low power consumption, and fanless operation are hard requirements.

DXNN® SDK: One Toolchain for Every DEEPX Product

The DXNN® SDK is the unified software stack for all DEEPX hardware. It provides a compiler that takes models from standard training frameworks and produces optimized binaries for DEEPX NPUs, applying IQ8 quantization automatically without manual per-layer tuning. The runtime supports concurrent multi-model execution and manages memory across chips.

The SDK supports x86 and ARM aarch-64 host architectures, runs on Windows and major Linux distributions, and is compatible with containerized deployments via Docker. Software partners including Edge Impulse, CVEDIA, NetworkOptix, Nota AI, and others have pre-integrated DEEPX hardware into their platforms using the DXNN SDK.

DXNN® SDK: One Toolchain for Every DEEPX Product

Contact Macnica Americas

Macnica provides evaluation hardware, design-in support, and production supply for all DEEPX products in America and Canada. Whether you are evaluating a chip for a new design or planning a production order, the team handles the full engagement from first sample through volume ramp.

Contact Macnica to request evaluation hardware, speak to an expert, or get a quote

First/Given Name *

Last/Family Name *

Company *

Title

Email Address *

Phone Number *

City *

State/Province *

Country

Subject *

Message

Would you like to be added to our mailing list to receive the Macnica Americas Newsletter?

Newsletter Signup

Yes