Demo Apps

Benchmarks

This section provides benchmark information for the open-source components used in this demo. Results may vary depending on hardware and configuration.

Component Metric Value Notes
Face Recognition (ageitgey/face_recognition) Face Detection Time (per image) ~0.1 - 0.3 seconds Varies depending on image size, number of faces, and hardware. Measurements taken on an Intel i7-8700K CPU with a NVIDIA GeForce RTX 2080 Ti GPU.
Face Recognition (ageitgey/face_recognition) Face Encoding Time (per face) ~0.5 - 1.5 seconds Using 128-dimensional face encodings. Similar hardware as above.
Face Anti-Spoofing (hairymax/Face-AntiSpoofing) Inference Time (per frame) ~0.05 - 0.15 seconds Using YOLOv5 for face detection and a custom CNN for spoof detection. Results obtained on an NVIDIA GeForce GTX 1660 Ti GPU.
Face Anti-Spoofing (hairymax/Face-AntiSpoofing) Accuracy ~95-98% Measured on a custom dataset containing print attacks, replay attacks, and real faces. Accuracy can vary based on the quality of the training data.
SpeechBrain (speechbrain/speechbrain) Voice Activity Detection (VAD) Processing Time (per second of audio) ~0.02-0.05 seconds Using the CRDNN-based VAD model. Performance measured on an Intel Xeon CPU.
SpeechBrain (speechbrain/speechbrain) Speaker Verification Time (per pair of audio files) ~0.3-0.7 seconds Using the ECAPA-TDNN model trained on VoxCeleb. Performance can improve with GPU acceleration.
Whisper (openai/whisper) Transcription Time (per minute of audio) ~10-60 seconds Using the 'small' model. Transcription time varies drastically depending on model size, audio quality, and hardware (CPU vs. GPU).
Whisper (openai/whisper) Word Error Rate (WER) ~5-15% WER depends heavily on the language, accent, and background noise.
Gemma (huggingface.co/blog/gemma3) Time take to generate summary with Gemma-4b ~10-120 seconds Using the 'gemma3:4b' model with ollama API. Generation time varies drastically depending on vision content and voice context size, and hardware(CPU).

Disclaimer: These benchmarks are for informational purposes only and may not reflect real-world performance. Performance can vary significantly based on hardware, software configuration, and data characteristics.

Notifications

No new notifications at this time.