The `xInfer` Model Zoo API

Welcome to the xInfer::zoo. This is the high-level, "batteries-included" API for xInfer.

The zoo is a collection of pre-packaged, hyper-optimized, and incredibly easy-to-use solutions for the world's most common AI tasks.

The Philosophy: Solutions, Not Just Tools

While the Core Toolkit provides the powerful, low-level "engine parts" for building custom pipelines, the zoo provides the finished "F1 car."

Each class in the zoo is a complete, end-to-end pipeline that abstracts away all the complexity of pre-processing, inference, and post-processing. The goal is to let you solve a complex problem like real-time object detection or image generation with just two lines of C++ code:

One line to initialize the pipeline from a pre-built engine.
One line to predict.

This is the power of the zoo. It gives you the full, state-of-the-art performance of a custom C++/CUDA/TensorRT application with the simplicity of a high-level library.

Key Features of All `zoo` Classes

Performance by Default: Every zoo class is built on top of the hyper-performant xInfer::core::InferenceEngine and uses fused CUDA kernels from the preproc and postproc modules wherever possible.
Simple, Task-Oriented API: You don't interact with raw tensors. You provide a cv::Mat image and get back a std::vector<BoundingBox> or a std::string. The API is designed around the final answer, not the intermediate steps.
Seamless Hub Integration: Most zoo classes have special constructors that can download a pre-built, perfectly optimized engine for your hardware directly from the Ignition Hub. This provides a "zero-setup" user experience.
Robust and Production-Ready: These classes are designed to be used directly in your final application. They are efficient, safe, and easy to integrate.

The `zoo` Catalog of Solutions

The zoo is organized by domain. Explore the available pipelines below to find the solution you need.

🖼️ Computer Vision

Tools for understanding and analyzing visual information from images and video. This is the most mature and comprehensive part of the zoo.

Tasks: Image Classification, Object Detection, Semantic & Instance Segmentation, Pose Estimation, Face Recognition, OCR, and many more.

➡️ Explore the Vision API

✨ Generative AI

Powerful pipelines for creating novel content, from images and audio to 3D models.

Tasks: Text-to-Image (Diffusion), Image Generation (GANs), Super-Resolution, Style Transfer, Text-to-Speech.

➡️ Explore the Generative API

📝 Natural Language Processing (NLP)

High-throughput, low-latency solutions for understanding and processing human language.

Tasks: Text Classification, Named Entity Recognition (NER), Sentence Embeddings for RAG, Summarization, and Translation.

➡️ Explore the NLP API

🎧 Audio & Signal Processing

Real-time pipelines for analyzing audio signals, from speech to environmental sounds.

Tasks: Speech Recognition, Audio Classification, Speaker Identification, Music Source Separation.

➡️ Explore the Audio & DSP API

📈 Time Series

Specialized solutions for forecasting and analyzing sequential data.

Tasks: Forecasting, Anomaly Detection, and Classification.

➡️ Explore the Time Series API

🧊 3D & Spatial Computing

Cutting-edge pipelines for processing 3D data from sensors like LIDAR.

Tasks: 3D Reconstruction (Gaussian Splatting), Point Cloud Detection, Point Cloud Segmentation.

➡️ Explore the 3D API

🌍 Geospatial

Specialized tools for analyzing satellite and aerial imagery.

Tasks: Building & Road Segmentation, Change Detection, Maritime Object Detection.

➡️ Explore the Geospatial API

⚕️ Medical Imaging

High-performance pipelines for medical image analysis.

Tasks: Tumor Detection, Cell Segmentation, Retinal Abnormality Scanning, Artery Analysis.

➡️ Explore the Medical API

📄 Document AI

Pipelines for understanding the structure and content of documents.

Tasks: Table Extraction, Signature Detection, Handwriting Recognition.

➡️ Explore the Document AI API

🚀 Specialized & RL

Hyper-specialized, high-value solutions for specific industries and advanced applications.

Tasks: Reinforcement Learning Policies, Financial (HFT) Models, Physics Simulation, Genomics.

➡️ Explore the Specialized API

The xInfer Model Zoo API