25 TOPS at 5W. Fanless AI Inference for Any Enclosure.
From embedded modules to high-density PCIe cards and complete systems, DEEPX delivers production AI inference where cloud and GPU solutions cannot operate. Exclusively available from Macnica.
The DEEPX product line spans from a 42mm M.2 module for drones and embedded devices to high-density PCIe inference cards for multi-channel surveillance servers. The DX-M and DX-H families are deployed today in security systems, smart factories, robotics platforms, and transportation infrastructure worldwide. All products share the same SDK and model framework support.
Macnica is the exclusive authorized distributor for North America, providing design-in support, evaluation hardware, and supply chain planning for production volumes, covering the full engagement from concept through volume ramp. Contact Macnica to request an evaluation kit or discuss your application with a field application engineer.
DEEPX NPU Architecture Built Around Power, Thermal, and Precision
GPUs overheat in sealed enclosures, low-power NPUs cannot sustain production frame rates on real detection models, and cloud inference introduces round-trip latency and data egress that industrial and safety-critical systems cannot tolerate. DEEPX NPU architecture is designed specifically around these failure modes, with power envelope, thermal characteristics, and quantization fidelity treated as primary design constraints rather than secondary considerations. All performance claims are independently benchmarked against production workloads.
The DEEPX IQ8 quantization engine addresses the precision trade-off that limits most edge NPU deployments. Standard INT8 quantization reduces memory bandwidth and compute requirements but introduces accuracy degradation that requires per-layer calibration to recover. IQ8 performs INT8-width computation while preserving the numerical range of FP32 inference, delivering full-speed throughput without model accuracy loss and without manual calibration. For teams deploying YOLO-family or custom detection models, this removes a significant pre-production engineering step.
Thermal performance is validated in direct hardware comparison. Running identical workloads, the DX-M1 board surface measured 35.5 degrees C versus 60.7 degrees C for a competing chip. In sealed enclosures without active cooling, that 25-degree delta is the margin between a qualified fanless design and one that requires thermal redesign or derating at operating temperature.
DEEPX Products: Chips, modules, PCIe cards, and Complete Systems
The DEEPX portfolio covers two distinct deployment scenarios. The DX-M Series are bare chips and M.2 modules for embedding directly into host systems: robots, cameras, vehicles, and edge devices. The DX-H Series are low-profile PCIe cards for servers and workstations running multi-channel video analytics and AI inference at scale. All products share the same DXNN SDK and support the same model frameworks.
- DX-M1 SoCs & M.2
- DX-M1M SoCs & M.2
- DX-H1 Quattro PCIe
- DX-H1 V-NPU PCIe
DX-M1 SoCs and M2 Modules
Embedded chip with external memory, industrial and automotive grades
The DX-M1 is the flagship embedded NPU. Its external memory architecture lets designers size LPDDR4x or LPDDR5 independently of the chip, supporting up to 8GB for workloads running multiple simultaneous models. Available as a bare chip or as an M.2 2280 module with 4GB LPDDR5 onboard. AEC-Q100 Grade 3 qualified, with Grade 2 available on request. This is the only DEEPX chip with automotive-grade certification.
Best for: Designs requiring memory flexibility, automotive-grade certification, or multi-model inference. Robotics platforms, industrial inspection systems, and smart city nodes.
AI performance | 25 TOPS (INT8) |
Power | 1 to 5W |
Memory | Up to 8GB external LPDDR5 |
Chip package | 17 x 17mm FCBGA |
Module | M.2 2280, M-Key |
Host interface | PCIe Gen3 x4 |
Temp range | -40 to 85 degrees C / AEC-Q100 Grade 3 |
| Status | Mass production |
DX-M1M SoCs and M.2 Modules
Compact all-in-one chip and module with integrated memory, half the footprint of DX-M1
The DX-M1M integrates 2GB LPDDR4x directly into the chip package, eliminating the external DRAM device and reducing module length to 42mm. The result is a complete AI accelerator in a form factor that fits where the M1 cannot: drones, dashcams, and handheld devices. Hardware-based security includes a Root of Trust and True Random Number Generator.
Best for: Drones, ADAS systems, dashcams, autonomous mobile robots, and AIoT devices where size, weight, and power budget are the binding constraints.
AI performance | 25 TOPS (INT8) |
Power | 3W typical |
Memory | 2GB integrated LPDDR4x |
Chip package | 21 x 21mm FCL-BGA |
Module | M.2 2242, M+B Key |
Host interface | PCIe Gen3 x2 (module) |
Temp range | -40 to 85 degrees C industrial |
| Status | Mass production Q4 2025 |
DX-H1 Quattro PCIe Card
100 TOPS inference card, four DX-M1 chips on one low-profile PCIe slot
The DX-H1 Quattro installs into a standard PCIe x16 slot and delivers 100 TOPS at 20W TDP, enough to process up to 128 AI video channels from a single workstation, or 768 channels from a 2U server with six cards. It works alongside existing NVR or decoder infrastructure; no onboard video codec is included. The host motherboard must support PCIe bifurcation x4x4x4x4, splitting the x16 slot into four independent x4 lanes. Verify BIOS support before specifying. Compatible with Dell, HPE, Lenovo, Supermicro, and KAYTUS servers.
Best for: Adding AI inference to existing camera and NVR infrastructure. Confirm BIOS bifurcation x4x4x4x4 support before selecting. Compatible with Dell, HPE, Lenovo, Supermicro, and KAYTUS servers.
AI performance | 100 TOPS (INT8) |
Power | 20W TDP |
Memory | 16GB LPDDR5 |
Video codec | Not included |
Form factor | Low-profile PCIe card |
Interface | PCIe x16 (bifurcation x4x4x4x4 required) |
Dimensions | 68.1 x 165.9 x 16mm with heatsink |
Max channels | 128 channels per workstation (YOLOv5s at 25fps) |
| Status | Mass production |
DX-H1 V-NPU PCIe Card
Decode and infer on one card, full H.264/H.265 video pipeline with onboard codec
The DX-H1 V-NPU adds hardware H.264/H.265 codec blocks alongside its dual NPUs, enabling the card to ingest, decode, and analyze live camera streams without involving the host CPU. One card handles 64 channels of 1080p decode and 50 TOPS of AI inference simultaneously, making it the right choice for new-build AI surveillance systems where eliminating separate video servers is a design goal. The PCIe interface uses x8 electrical lanes in a physical x16 slot, compatible with a broader range of server motherboards than the Quattro. DEEPX reports 82% lower hardware cost and 85% electricity savings versus GPU setups in 1,000-channel deployments over five years.
Best for: New-build AI CCTV infrastructure where the goal is to consolidate decode and inference onto one card. DEEPX reports 82% lower hardware cost and 85% electricity savings versus GPU setups in 1,000-channel deployments over five years.
AI performance | 50 TOPS (INT8) |
Power | 30W TDP |
Memory | 24GB LPDDR5 |
Video codec | H.264/H.265: 64CH decode, 32CH encode at 1080p 30fps |
Form factor | Low-profile PCIe card |
Interface | PCIe x16 physical, x8 electrical |
Dimensions | 68.1 x 164.3 x 18mm with heatsink |
Max channels | 64 channels per workstation (YOLOv5s at 30fps) |
| Status | Available |
From Chips to Complete Systems: AI Systems with DEEPX NPUs
For teams that need a validated AI system without a custom hardware design cycle, the DX-AIPlayer series integrates DX-M1 NPU acceleration with onboard compute, storage, and I/O in a production-ready enclosure. Compact and expanded configurations are available to match application I/O and compute requirements.
Part Number | DX-AIBOX-M1-N97-08064-A | DX-AIBOX-M1-i5-08128-i |
Dimensions (mm) | 95 x 95 x 55 mm | 81 x 150 x190 mm |
CPU | Intel® Processor N97 | 13th Gen Intel® Core™ i5-1340PE |
Storage | eMMC Onboard 64GB | 2.5'' SATA 128GB |
| Memory | RAM: 8GB LPDDR5 | RAM: 8GB DDR4 |
I/O Interfaces | HDMI 2.0b x 1/DP 1.2 x 1 Gb Ethernet (full speed) RJ-45 x 2 RS-232/422/485 x 2 USB 2.0 x 2 (via 10 Pin Header x 1) USB 3.2 Gen 2 x 3 (Type-A) | HDMI/DP, 3 x2.5GbE LAN 4 x COM 6 x USB 1 x 12-bit DIO |
Certification | KC, CE, and FCC | |
Power | 12V DC-in, 5A | 12~28V DC, RoHS |
| Weight | 450g | 1.88kg |
Operating Temperature | 0°C ~ 60° C | -20°C ~ 60°C |
Applications Where DEEPX Is Deployed
DEEPX hardware is deployed across industries where real-time inference, low power consumption, and fanless operation are hard requirements.
DXNN® SDK: One Toolchain for Every DEEPX Product
The DXNN® SDK is the unified software stack for all DEEPX hardware. It provides a compiler that takes models from standard training frameworks and produces optimized binaries for DEEPX NPUs, applying IQ8 quantization automatically without manual per-layer tuning. The runtime supports concurrent multi-model execution and manages memory across chips.
The SDK supports x86 and ARM aarch-64 host architectures, runs on Windows and major Linux distributions, and is compatible with containerized deployments via Docker. Software partners including Edge Impulse, CVEDIA, NetworkOptix, Nota AI, and others have pre-integrated DEEPX hardware into their platforms using the DXNN SDK.
Contact Macnica Americas
Macnica provides evaluation hardware, design-in support, and production supply for all DEEPX products in America and Canada. Whether you are evaluating a chip for a new design or planning a production order, the team handles the full engagement from first sample through volume ramp.
Contact Macnica to request evaluation hardware, speak to an expert, or get a quote