[Case Study] Nvidia/Sopon + FPGA + AI High-Performance Edge Computing Box: Vehicle 3D Perception System
Invested in and participated by CAS Source Computing, the company boasts a top-tier team, decades of research accumulation, and successful technology development and implementation. The core team members have focused on IoT and edge computing research for decades, possessing extensive experience in the R&D and implementation of large-scale intelligent IoT systems and "device-edge-cloud" integrated computing systems.
The team has participated in numerous national-level fundamental scientific research projects in the IoT field, earning high recognition both domestically and internationally. Currently, the company holds dozens of intellectual property rights, has obtained qualifications as a National High-tech Enterprise and a Zhongguancun High-tech Enterprise, and received the "Double Soft" certification from the China Software Industry Association. It has been honored with titles such as CAS STS Innovation and Entrepreneurship Guidance Project, Zhongguancun Golden Seed Enterprise, Haidian District Germination Enterprise, and Beijing Golden Bridge Project Enterprise, and was awarded the Science and Technology Achievement Transformation Award by the Beijing Branch of the Chinese Academy of Sciences.

The applications of 3D vision perception technology in the automotive field are mainly divided into exterior and interior applications. Exterior applications include autonomous driving and assisted driving 360-degree 3D surround view, exterior identity recognition, etc. Interior applications include driver detection and in-cabin interaction. 3D vision perception technology enables smart hardware to perceive, think, judge, and make decisions like humans, greatly accelerating the process of terminal intelligence. The smart vehicle sector is the next 'main stage' where 3D vision perception technology will shine, leading to the mass production and integration of automotive-grade 3D cameras, lidar, and other products into vehicles.
-
- Product Overview
The XM-AIBOX-32 Smart Edge Analysis All-in-One Machine is a high-performance, low-power edge computing product. Equipped with the BM1684X main chip, it delivers INT8 computing power up to 32 TOPS, FP16/BF16 computing power up to 16 TFLOPS, and FP32 computing power up to 2 TFLOPS. It can simultaneously process 32 channels of high-definition video, supporting hardware decoding for 32 channels of 1080P HD video and encoding for 12 channels.
This product highly integrates high-precision AI intelligent algorithms based on computer vision and deep learning networks, along with a comprehensive intelligent video management platform. The AI intelligent algorithms cover various scenarios such as parks, communities, construction sites, and campuses, and can be combined and configured as needed for specific scenes. The comprehensive intelligent video management platform supports front-end device management, real-time video preview, alarm push, forensic capture, online algorithm loading and optimization, and large-screen display of data situation analysis. The device is easy to operate, plug-and-play, and features rich northbound API interfaces to empower upper-layer business application platforms.
-
- Product Features
Ultra-High Performance Computing and Codec Capabilities
- Supports up to 32 TOPS of INT8 peak computing power;
- Supports up to 16 TFLOPS of FP16/BF16 half-precision computing power;
- Supports 2 TFLOPS of FP32 high-precision computing power;
- Supports hardware decoding for up to 32 channels of H.264/H.265 1080P@25FPS video;
- Supports hardware encoding for up to 12 channels of H.264/H.265 1080P@25FPS video.
Built-in Rich AI Algorithms
- Built-in with over 30 AI algorithms, supporting free combination and custom configurations;
(Supports algorithms such as personnel structuring / facial recognition / vehicle structuring / license plate recognition / flame detection / smoke detection / smoking detection / phone call detection / mobile phone usage detection / no mask detection / personnel absenteeism detection / personnel sleeping on duty detection / personnel fall detection / personnel electrostatic discharge / area crowd counting / insufficient area occupancy / area overcrowding / abnormal area occupancy / area intrusion detection / work uniform detection / safety helmet detection / reflective vest detection / electric vehicle detection / regulated parking (illegal parking) / entrance/exit passenger flow statistics / perimeter fence climbing intrusion / personnel boundary crossing detection / area loitering detection / fire lane obstruction / fire escape obstruction / garbage not in bin detection / overflowing garbage bin detection / garbage disposal reminder / camera abnormal displacement detection, etc.)
- Each video channel supports up to 3 AI analysis tasks running simultaneously;
- Supports up to 32 video AI analysis tasks running simultaneously; when exceeding 32 AI analysis tasks, polling analysis can be performed.
Rich Interfaces, Flexible Deployment
- Supports rich interfaces: 1000M Ethernet port, USB3.0/USB2.0, HDMI, RS-485, RS-232;
- Supports wide operating temperature environment from -20℃ to +60℃;
- Supports IP30 protection level, supports fanless cooling (subject to specific model);
- Adapts to support SATA storage, supports 2TB storage capacity (subject to specific model);
- Optional support for LTE wireless backhaul function (subject to specific model);
- Northbound interfaces: Supports HTTP protocol, MQTT protocol, GB28281
- Southbound interfaces: Supports GB28281, Onvif, RTSP
High Reliability, Encryption Protection
- Supports high-capacity eMMC, can be developed to support primary/secondary partitions;
- Supports abnormal fault alarm and protection handling mechanisms;
- Supports programmable encryption chip for privacy information protection.
Easy-to-use Toolchain, Flexible Development
- One-stop deep learning development toolkit Sophon SDK;
- Supports mainstream deep learning frameworks such as Caffe/DarkNet/TensorFlow/PyTorch/MXNet/ONNX/PaddlePaddle;
- Supports mainstream network models for classification and detection, supports custom operator development;
Supports Docker containerization for rapid deployment of algorithm applications.
-
- Technical Specifications
Specifications
XM-AIBOX-32
Technical Specifications
Chip
SOC
BM1684X
CPU
AI Computing Power
INT8
32 TOPS
BF16/FP16
16 TFLOPS
FP32
2 TFLOPS
Video/Image Codec
Video Decoding Capability
H.264/H.265: 1080P @800fps
Video Decoding Resolution
8K / 4K / 1080P / 720P / D1 / CIF
Video Encoding Capability
H.264/H.265:1080P @300fps
Video Encoding Resolution
4K / 1080P / 720P / D1 / CIF
Image Encoding/Decoding Capability
600 images/second (JPEG)
Max Image Decoding Resolution
32768 * 32768
Memory and Storage
Memory
16 GB
eMMC
64 GB
External Interfaces
Ethernet Port
10/100/1000Mbps adaptive *2
USB
USB3.0 *2, USB2.0 *2
Storage
MicroSD *1
Display
HDMI *1
Serial Port
RS232 *1/RS485 *1
Extended Storage
SSD (Optional)
M.2 SSD
Wireless Function
4G/5G Wireless Module (Optional)
Mini-PCIE 4G Module/M.2 5G Module
Antenna
SMA Female *1(LTE)
SMA Female *4(5G), requires motherboard replacement
SMA Female *2 (Wi-Fi)
SMA Female *1(BT)
SIM
Standard SIM Card Slot
Wi-Fi/BT
Wi-Fi supports 802.11a/b/g/n/ac
BT5.0
Physical Specifications
Dimensions
LengthWidthHeight
210 mm * 130 mm * 44.5 mm
Power Supply and Consumption
Power Supply
DC 12V
Typical Power Consumption
≤20W
Note: Hard drive, 4G/5G functions are optional and not standard product configurations. Typical power consumption does not include hard drive or wireless module power consumption.