As AI, sensing technology and networking innovation continue to accelerate, using AI analysis to assess and triage input at the network’s edge will be critical, especially as data sources expand almost daily.
Using AI analysis on a large scale can be slow and complex, and associated with higher maintenance costs and software upkeep to onboard new AI models and additional hardware. With edge computing capabilities emerging in more remote locations, AI analysis can be placed closer to the sensors, reducing latency and increasing bandwidth.
Red Hat and NTT, in collaboration with NVIDIA and Fujitsu, have jointly developed a solution to enhance and extend the potential for real-time artificial intelligence (AI) data analysis at the edge.
Built on the foundation of Red Hat OpenShift, the solution has received an Innovative Optical and Wireless Network (IOWN) Global Forum's Proof of Concept (PoC) recognition for its real-world viability and use cases.
This solution consists of the IOWN All-Photonics Network (APN) and data pipeline acceleration technologies in IOWN Data-Centric Infrastructure (DCI). NTT’s accelerated data pipeline for AI adopts Remote Direct Memory Access (RDMA) over APN to efficiently collect and process large amounts of sensor data at the edge.
Container orchestration technology from Red Hat OpenShift provides greater flexibility to operate workloads within the accelerated data pipeline across geographically distributed and remote data centres.
NTT and Red Hat have successfully demonstrated that this solution can effectively reduce power consumption while maintaining lower latency for real-time AI analysis at the edge.
Delivering the PoC
The proof of concept evaluated a real-time AI analysis platform with Yokosuka City as the sensor installation base and Musashino City as the remote data centre, both connected via APN. As a result, even when many cameras were accommodated, the latency required to aggregate sensor data for AI analysis was reduced by 60% compared to conventional AI inference workloads.
The IOWN PoC testing also demonstrated that the power consumption required for AI analysis for each camera at the edge could be reduced by 40% from conventional technology. This real-time AI analysis platform allows the GPU to be scaled up to accommodate a larger number of cameras without the CPU becoming a bottleneck.
According to a trial calculation, assuming that 1,000 cameras can be accommodated, it is expected that power consumption can be further reduced by 60%. The highlights of the proof of concept for this solution are as follows:
- Accelerated data pipeline for AI inference, provided by NTT, utilizing RDMA over APN to directly fetch large-scale sensor data from local sites to the memory in an accelerator in a remote data centre, reducing the protocol-handling overheads in the conventional network. It then completes data processing of AI inference within the accelerator with less CPU-controlling overheads, improving the power efficiency in AI inference.
- Large-scale AI data analysis in real-time, powered by Red Hat OpenShift, can support Kubernetes operators5 to minimize the complexity of implementing hardware-based accelerators (GPUs, DPUs, etc.), enabling improved flexibility and easier deployment across disaggregated sites, including remote data centres.
- This PoC uses NVIDIA A100 Tensor Core GPUs and NVIDIA ConnectX-6 NICs for AI inference.
- This solution helps set the stage for intelligent AI-enabled technologies that will help businesses sustainably scale. With this solution, organisations can benefit from:
- Reduced overhead associated with collecting large amounts of data;
- Enhanced data collection that can be shared between metropolitan areas and remote data centres for quicker AI analysis;
- The ability to utilize locally available and potentially renewable energy, such as solar or wind;
- Increased area management security with video cameras acting as sensor devices.