Back to Blog

[Case Study] Nvidia/Sopon + FPGA + AI High-Performance Edge Computing Box: Vehicle 3D Perception System

#人工智能#边缘计算

Invested in and participated by CAS Source Computing, the company boasts a top-tier team, decades of research accumulation, and successful technology development and implementation. The core team members have focused on IoT and edge computing research for decades, possessing extensive experience in the R&D and implementation of large-scale intelligent IoT systems and "device-edge-cloud" integrated computing systems.

The team has participated in numerous national-level fundamental scientific research projects in the IoT field, earning high recognition both domestically and internationally. Currently, the company holds dozens of intellectual property rights, has obtained qualifications as a National High-tech Enterprise and a Zhongguancun High-tech Enterprise, and received the "Double Soft" certification from the China Software Industry Association. It has been honored with titles such as CAS STS Innovation and Entrepreneurship Guidance Project, Zhongguancun Golden Seed Enterprise, Haidian District Germination Enterprise, and Beijing Golden Bridge Project Enterprise, and was awarded the Science and Technology Achievement Transformation Award by the Beijing Branch of the Chinese Academy of Sciences.

The applications of 3D vision perception technology in the automotive field are mainly divided into exterior and interior applications. Exterior applications include autonomous driving and assisted driving 360-degree 3D surround view, exterior identity recognition, etc. Interior applications include driver detection and in-cabin interaction. 3D vision perception technology enables smart hardware to perceive, think, judge, and make decisions like humans, greatly accelerating the process of terminal intelligence. The smart vehicle sector is the next 'main stage' where 3D vision perception technology will shine, leading to the mass production and integration of automotive-grade 3D cameras, lidar, and other products into vehicles.

    1. Product Overview

The XM-AIBOX-32 Smart Edge Analysis All-in-One Machine is a high-performance, low-power edge computing product. Equipped with the BM1684X main chip, it delivers INT8 computing power up to 32 TOPS, FP16/BF16 computing power up to 16 TFLOPS, and FP32 computing power up to 2 TFLOPS. It can simultaneously process 32 channels of high-definition video, supporting hardware decoding for 32 channels of 1080P HD video and encoding for 12 channels.

This product highly integrates high-precision AI intelligent algorithms based on computer vision and deep learning networks, along with a comprehensive intelligent video management platform. The AI intelligent algorithms cover various scenarios such as parks, communities, construction sites, and campuses, and can be combined and configured as needed for specific scenes. The comprehensive intelligent video management platform supports front-end device management, real-time video preview, alarm push, forensic capture, online algorithm loading and optimization, and large-screen display of data situation analysis. The device is easy to operate, plug-and-play, and features rich northbound API interfaces to empower upper-layer business application platforms.

    1. Product Features

Ultra-High Performance Computing and Codec Capabilities

  • Supports up to 32 TOPS of INT8 peak computing power;
  • Supports up to 16 TFLOPS of FP16/BF16 half-precision computing power;
  • Supports 2 TFLOPS of FP32 high-precision computing power;
  • Supports hardware decoding for up to 32 channels of H.264/H.265 1080P@25FPS video;
  • Supports hardware encoding for up to 12 channels of H.264/H.265 1080P@25FPS video.

Built-in Rich AI Algorithms

  • Built-in with over 30 AI algorithms, supporting free combination and custom configurations;

(Supports algorithms such as personnel structuring / facial recognition / vehicle structuring / license plate recognition / flame detection / smoke detection / smoking detection / phone call detection / mobile phone usage detection / no mask detection / personnel absenteeism detection / personnel sleeping on duty detection / personnel fall detection / personnel electrostatic discharge / area crowd counting / insufficient area occupancy / area overcrowding / abnormal area occupancy / area intrusion detection / work uniform detection / safety helmet detection / reflective vest detection / electric vehicle detection / regulated parking (illegal parking) / entrance/exit passenger flow statistics / perimeter fence climbing intrusion / personnel boundary crossing detection / area loitering detection / fire lane obstruction / fire escape obstruction / garbage not in bin detection / overflowing garbage bin detection / garbage disposal reminder / camera abnormal displacement detection, etc.)

  • Each video channel supports up to 3 AI analysis tasks running simultaneously;
  • Supports up to 32 video AI analysis tasks running simultaneously; when exceeding 32 AI analysis tasks, polling analysis can be performed.

Rich Interfaces, Flexible Deployment

  • Supports rich interfaces: 1000M Ethernet port, USB3.0/USB2.0, HDMI, RS-485, RS-232;
  • Supports wide operating temperature environment from -20℃ to +60℃;
  • Supports IP30 protection level, supports fanless cooling (subject to specific model);
  • Adapts to support SATA storage, supports 2TB storage capacity (subject to specific model);
  • Optional support for LTE wireless backhaul function (subject to specific model);
  • Northbound interfaces: Supports HTTP protocol, MQTT protocol, GB28281
  • Southbound interfaces: Supports GB28281, Onvif, RTSP

High Reliability, Encryption Protection

  • Supports high-capacity eMMC, can be developed to support primary/secondary partitions;
  • Supports abnormal fault alarm and protection handling mechanisms;
  • Supports programmable encryption chip for privacy information protection.

Easy-to-use Toolchain, Flexible Development

  • One-stop deep learning development toolkit Sophon SDK;
  • Supports mainstream deep learning frameworks such as Caffe/DarkNet/TensorFlow/PyTorch/MXNet/ONNX/PaddlePaddle;
  • Supports mainstream network models for classification and detection, supports custom operator development;

Supports Docker containerization for rapid deployment of algorithm applications.

    1. Technical Specifications

Specifications

XM-AIBOX-32

Technical Specifications

Chip

SOC

BM1684X

CPU

8-core A53@2.3GHz

AI Computing Power

INT8

32 TOPS

BF16/FP16

16 TFLOPS

FP32

2 TFLOPS

Video/Image Codec

Video Decoding Capability

H.264/H.265: 1080P @800fps

Video Decoding Resolution

8K / 4K / 1080P / 720P / D1 / CIF

Video Encoding Capability

H.264/H.265:1080P @300fps

Video Encoding Resolution

4K / 1080P / 720P / D1 / CIF

Image Encoding/Decoding Capability

600 images/second (JPEG)

Max Image Decoding Resolution

32768 * 32768

Memory and Storage

Memory

16 GB

eMMC

64 GB

External Interfaces

Ethernet Port

10/100/1000Mbps adaptive *2

USB

USB3.0 *2, USB2.0 *2

Storage

MicroSD *1

Display

HDMI *1

Serial Port

RS232 *1/RS485 *1

Extended Storage

SSD (Optional)

M.2 SSD

Wireless Function

4G/5G Wireless Module (Optional)

Mini-PCIE 4G Module/M.2 5G Module

Antenna

SMA Female *1(LTE)

SMA Female *4(5G), requires motherboard replacement

SMA Female *2 (Wi-Fi)

SMA Female *1(BT)

SIM

Standard SIM Card Slot

Wi-Fi/BT

Wi-Fi supports 802.11a/b/g/n/ac

BT5.0

Physical Specifications

Dimensions

LengthWidthHeight

210 mm * 130 mm * 44.5 mm

Power Supply and Consumption

Power Supply

DC 12V

Typical Power Consumption

≤20W

Note: Hard drive, 4G/5G functions are optional and not standard product configurations. Typical power consumption does not include hard drive or wireless module power consumption.