Nvidia/Sophon + FPGA + High-Performance AI Edge Computing Box: Intelligent Early Warning System for Large Machinery
Sany Heavy Industry Co., Ltd., founded by Sany Group in 1994, rapidly rose by breaking the traditional "technophobia" of the Chinese people and insisting on independent innovation. In July 2011, Sany Heavy Industry, with a market capitalization of US$21.584 billion, was listed among the Financial Times Global 500, making it the only Chinese machinery enterprise to date to make the list. In January 2012, Sany Heavy Industry acquired Putzmeister, the "world's number one concrete machinery brand" from Germany, fundamentally altering the global industry competitive landscape. Sany Heavy Industry's parent company, Sany Group, was officially established in Lianyuan, Hunan, in 1989. The company's name originates from its founding vision: "To build a first-class enterprise, cultivate first-class talent, and make first-class contributions." For many years, Sany has upheld "Quality Changes the World" as its mission, striving to contribute a world-class brand to the Chinese nation.

Safety solutions based on visual AI intelligent algorithms can be designed for heavy-duty trucks such as large vehicles, commercial vehicles, mining engineering machinery, port engineering machinery, park engineering vehicles, dump trucks, mixers, and freight trucks. By utilizing in-vehicle cameras for panoramic blind spot recognition, driver voice warnings, in-cabin displays, and loudspeakers, image data is collected for intelligent recognition and early warning alerts. When AI vision cameras detect pedestrians or vehicles approaching within 3 to 5 meters of the vehicle, the in-cabin display immediately magnifies and shows the video image on the right side, helping the driver enhance visual perception. When the distance of pedestrians or vehicles exceeds a certain threshold, the AI device immediately triggers audible and visual early warning alerts.
-
- Product Overview
The XM-AIBOX-16 intelligent edge analysis all-in-one machine is a high-performance, low-power edge computing product. It is equipped with the domestic TPU chip BM1684, offering INT8 computing power up to 17.6 TOPS, capable of simultaneously processing 16 channels of high-definition video, and supporting hardware decoding of 32 channels of 1080P HD video and encoding of 2 channels.
This product highly integrates high-precision AI intelligent algorithms based on computer vision and deep learning networks, as well as a comprehensive intelligent video management platform. The AI intelligent algorithms cover various algorithms for scenarios such as industrial parks, communities, construction sites, and campuses, which can be combined on demand and configured by scenario. The comprehensive intelligent video management platform supports front-end device management, real-time video preview, alarm push, forensic snapshot, online algorithm loading and optimization, and large-screen display of data situational analysis. The device is easy to operate, plug-and-play, and also features rich northbound API interfaces to empower upper-layer business application platforms.
-
- Product Features
Ultra-High Performance Computing and Codec Capabilities
- Supports peak INT8 computing power up to 17.6 TOPS or high-precision FP32 computing power of 2.2 TFLOPS;
- Supports full-process handling of up to 16 channels of 1080P HD video;
- Supports hardware decoding of up to 32 channels of H.264/H.265 1080P@25FPS video;
- Supports hardware encoding of up to 2 channels of H.264/H.265 1080P@25FPS video.
Rich Built-in AI Algorithms
- Built-in with over 30 AI algorithms, supporting free combination and custom configurations;
(Supports Person Structuring / Face Recognition / Vehicle Structuring / License Plate Recognition / Flame Detection / Smoke Detection / Smoking Detection / Phone Call Detection / Mobile Phone Play Detection / Mask-Wearing Detection / Personnel Absenteeism Detection / Personnel Sleeping on Duty Detection / Personnel Fall Detection / Personnel Static Elimination / Area People Counting / Insufficient People in Area / Overcrowding in Area / Abnormal Number of People in Area / Area Intrusion Detection / Work Uniform Detection / Safety Helmet Detection / Reflective Vest Detection / E-bike Detection / Standardized Parking (Illegal Parking) / Entrance/Exit Flow Statistics / Perimeter Crossing Intrusion / Personnel Boundary Crossing Detection / Area Loitering Detection / Fire Lane Occupancy / Fire Escape Route Occupancy / Un-bucketed Waste Detection / Overflowing Waste Bin Detection / Waste Disposal Reminder / Camera Abnormal Displacement Detection, and other algorithms)
- Each video channel supports up to 3 AI analysis tasks running simultaneously;
- Supports up to 16 video AI analysis tasks running simultaneously; when exceeding 16 AI analysis tasks, polling analysis can be performed.
Rich Interfaces, Flexible Deployment
- Supports rich interfaces: 1000M Ethernet port, USB3.0/USB2.0, HDMI, RS-485, RS-232;
- Supports wide operating temperature range from -20℃ to +60℃;
- Supports IP30 protection rating, fanless cooling (subject to specific model);
- Adapts to support SATA storage, with 2TB storage capacity (subject to specific model);
- Optional support for LTE wireless backhaul (subject to specific model);
- Northbound interfaces: Supports HTTP protocol, MQTT protocol, GB28281
- Southbound interfaces: Supports GB28281, Onvif, RTSP
High Reliability, Encryption Protection
- High-capacity eMMC supports development of primary/secondary partitions;
- Supports abnormal fault alarm and protection mechanisms;
- Supports programmable encryption chip for privacy information protection.
User-Friendly Toolchain, Flexible Development
- One-stop deep learning development toolkit: Sophon SDK;
- Supports mainstream deep learning frameworks such as Caffe/DarkNet/TensorFlow/PyTorch/MXNet/ONNX/PaddlePaddle;
- Supports mainstream network models for classification and detection, and custom operator development;
Supports Docker containerization for rapid deployment of algorithm applications.
-
- Technical Specifications
Specifications
Model
XM-AIBOX-16
Technical Specifications
Chip
TPU
BM1684
CPU
AI Computing Power
INT8
17.6 TOPS
FP32
2.2 TFLOPS
Video/Image Codec
Video Decoding Capability
H.264/H.265: 1080P @960fps
Video Decoding Resolution
8K / 4K / 1080P / 720P / D1 / CIF
Video Encoding Capability
H.264/H.265: 1080P @50fps
Video Encoding Resolution
4K / 1080P / 720P / D1 / CIF
Image Encoding/Decoding Capability
480 frames/sec@1080P
Max Image Decoding Resolution
32768 * 32768
Memory and Storage
Memory
12 GB
eMMC
32 GB
External Interfaces
Ethernet Port
10/100/1000Mbps Adaptive *2
USB
USB3.0 *2
N/A
Storage
MicroSD *1
Display
HDMI *1
N/A
Serial Port
RS232 *1/RS485 *1/Custom I/O
Extended Storage
SSD (Optional)
2TB (Optional)
N/A
Wireless Functionality
4G/5G Wireless Module (Optional)
Mini-PCIe 4G Module
Antenna (Optional)
SMA Female *2 (Wi-Fi)
SMA Female *4 (5G)
SIM
Standard SIM Card Slot
Wireless Functionality (Non-Optional)
5G Wireless Module
M.2 5G Module
Physical Specifications
Dimensions
LWH
188 mm * 148 mm * 44.5 mm
Power Supply and Consumption
Power Supply
DC 12V
Typical Power Consumption
≤23.5W
≤19W
Note: Hard drive and wireless functionality are optional and not standard product configurations. Typical power consumption does not include hard drive or wireless module power consumption.
-
- Product Appearance