Back to Blog

Nvidia/Sophon + FPGA + High-Performance AI Edge Computing Box: Intelligent Early Warning System for Large Machinery

#人工智能

Sany Heavy Industry Co., Ltd., founded by Sany Group in 1994, rapidly rose by breaking the traditional "technophobia" of the Chinese people and insisting on independent innovation. In July 2011, Sany Heavy Industry, with a market capitalization of US$21.584 billion, was listed among the Financial Times Global 500, making it the only Chinese machinery enterprise to date to make the list. In January 2012, Sany Heavy Industry acquired Putzmeister, the "world's number one concrete machinery brand" from Germany, fundamentally altering the global industry competitive landscape. Sany Heavy Industry's parent company, Sany Group, was officially established in Lianyuan, Hunan, in 1989. The company's name originates from its founding vision: "To build a first-class enterprise, cultivate first-class talent, and make first-class contributions." For many years, Sany has upheld "Quality Changes the World" as its mission, striving to contribute a world-class brand to the Chinese nation.

Safety solutions based on visual AI intelligent algorithms can be designed for heavy-duty trucks such as large vehicles, commercial vehicles, mining engineering machinery, port engineering machinery, park engineering vehicles, dump trucks, mixers, and freight trucks. By utilizing in-vehicle cameras for panoramic blind spot recognition, driver voice warnings, in-cabin displays, and loudspeakers, image data is collected for intelligent recognition and early warning alerts. When AI vision cameras detect pedestrians or vehicles approaching within 3 to 5 meters of the vehicle, the in-cabin display immediately magnifies and shows the video image on the right side, helping the driver enhance visual perception. When the distance of pedestrians or vehicles exceeds a certain threshold, the AI device immediately triggers audible and visual early warning alerts.

    1. Product Overview

The XM-AIBOX-16 intelligent edge analysis all-in-one machine is a high-performance, low-power edge computing product. It is equipped with the domestic TPU chip BM1684, offering INT8 computing power up to 17.6 TOPS, capable of simultaneously processing 16 channels of high-definition video, and supporting hardware decoding of 32 channels of 1080P HD video and encoding of 2 channels.

This product highly integrates high-precision AI intelligent algorithms based on computer vision and deep learning networks, as well as a comprehensive intelligent video management platform. The AI intelligent algorithms cover various algorithms for scenarios such as industrial parks, communities, construction sites, and campuses, which can be combined on demand and configured by scenario. The comprehensive intelligent video management platform supports front-end device management, real-time video preview, alarm push, forensic snapshot, online algorithm loading and optimization, and large-screen display of data situational analysis. The device is easy to operate, plug-and-play, and also features rich northbound API interfaces to empower upper-layer business application platforms.

    1. Product Features

Ultra-High Performance Computing and Codec Capabilities

  • Supports peak INT8 computing power up to 17.6 TOPS or high-precision FP32 computing power of 2.2 TFLOPS;
  • Supports full-process handling of up to 16 channels of 1080P HD video;
  • Supports hardware decoding of up to 32 channels of H.264/H.265 1080P@25FPS video;
  • Supports hardware encoding of up to 2 channels of H.264/H.265 1080P@25FPS video.

Rich Built-in AI Algorithms

  • Built-in with over 30 AI algorithms, supporting free combination and custom configurations;

(Supports Person Structuring / Face Recognition / Vehicle Structuring / License Plate Recognition / Flame Detection / Smoke Detection / Smoking Detection / Phone Call Detection / Mobile Phone Play Detection / Mask-Wearing Detection / Personnel Absenteeism Detection / Personnel Sleeping on Duty Detection / Personnel Fall Detection / Personnel Static Elimination / Area People Counting / Insufficient People in Area / Overcrowding in Area / Abnormal Number of People in Area / Area Intrusion Detection / Work Uniform Detection / Safety Helmet Detection / Reflective Vest Detection / E-bike Detection / Standardized Parking (Illegal Parking) / Entrance/Exit Flow Statistics / Perimeter Crossing Intrusion / Personnel Boundary Crossing Detection / Area Loitering Detection / Fire Lane Occupancy / Fire Escape Route Occupancy / Un-bucketed Waste Detection / Overflowing Waste Bin Detection / Waste Disposal Reminder / Camera Abnormal Displacement Detection, and other algorithms)

  • Each video channel supports up to 3 AI analysis tasks running simultaneously;
  • Supports up to 16 video AI analysis tasks running simultaneously; when exceeding 16 AI analysis tasks, polling analysis can be performed.

Rich Interfaces, Flexible Deployment

  • Supports rich interfaces: 1000M Ethernet port, USB3.0/USB2.0, HDMI, RS-485, RS-232;
  • Supports wide operating temperature range from -20℃ to +60℃;
  • Supports IP30 protection rating, fanless cooling (subject to specific model);
  • Adapts to support SATA storage, with 2TB storage capacity (subject to specific model);
  • Optional support for LTE wireless backhaul (subject to specific model);
  • Northbound interfaces: Supports HTTP protocol, MQTT protocol, GB28281
  • Southbound interfaces: Supports GB28281, Onvif, RTSP

High Reliability, Encryption Protection

  • High-capacity eMMC supports development of primary/secondary partitions;
  • Supports abnormal fault alarm and protection mechanisms;
  • Supports programmable encryption chip for privacy information protection.

User-Friendly Toolchain, Flexible Development

  • One-stop deep learning development toolkit: Sophon SDK;
  • Supports mainstream deep learning frameworks such as Caffe/DarkNet/TensorFlow/PyTorch/MXNet/ONNX/PaddlePaddle;
  • Supports mainstream network models for classification and detection, and custom operator development;

Supports Docker containerization for rapid deployment of algorithm applications.

    1. Technical Specifications

Specifications

Model

XM-AIBOX-16

Technical Specifications

Chip

TPU

BM1684

CPU

8-core A53@2.3GHz

AI Computing Power

INT8

17.6 TOPS

FP32

2.2 TFLOPS

Video/Image Codec

Video Decoding Capability

H.264/H.265: 1080P @960fps

Video Decoding Resolution

8K / 4K / 1080P / 720P / D1 / CIF

Video Encoding Capability

H.264/H.265: 1080P @50fps

Video Encoding Resolution

4K / 1080P / 720P / D1 / CIF

Image Encoding/Decoding Capability

480 frames/sec@1080P

Max Image Decoding Resolution

32768 * 32768

Memory and Storage

Memory

12 GB

eMMC

32 GB

External Interfaces

Ethernet Port

10/100/1000Mbps Adaptive *2

USB

USB3.0 *2

N/A

Storage

MicroSD *1

Display

HDMI *1

N/A

Serial Port

RS232 *1/RS485 *1/Custom I/O

Extended Storage

SSD (Optional)

2TB (Optional)

N/A

Wireless Functionality

4G/5G Wireless Module (Optional)

Mini-PCIe 4G Module

Antenna (Optional)

SMA Female *2 (Wi-Fi)

SMA Female *4 (5G)

SIM

Standard SIM Card Slot

Wireless Functionality (Non-Optional)

5G Wireless Module

M.2 5G Module

Physical Specifications

Dimensions

LWH

188 mm * 148 mm * 44.5 mm

Power Supply and Consumption

Power Supply

DC 12V

Typical Power Consumption

≤23.5W

≤19W

Note: Hard drive and wireless functionality are optional and not standard product configurations. Typical power consumption does not include hard drive or wireless module power consumption.

    1. Product Appearance