Emotion Multimodal Hybrid App

New standalone project built from your voice-emotion baseline concept, without modifying:

emotion-voice-app (GitHub)
ShiroOnigami23/emotion-voice-engine (HF model)

Goal

Unified hybrid emotion detection with three modalities from one video clip:

Audio signal extracted from video
Face expression from key frame extraction
Temporal video branch (frame sequence)

Final output is one fused emotion prediction from all three models working together.

Emotion classes (expanded): angry, calm, disgust, fear, happy, neutral, sad, surprise

Live Demo

Hugging Face Space (connected app): https://huggingface.co/spaces/ShiroOnigami23/emotion-multimodal-app
Hugging Face model repo: https://huggingface.co/ShiroOnigami23/emotion-multimodal-engine

Training (Kaggle)

Kernel path: kaggle_kernel/

Datasets used:

uwrfkaggler/ravdess-emotional-speech-audio
ejlok1/cremad
adrivg/ravdess-emotional-speech-video
astraszab/facial-expression-dataset-image-folders-fer2013

Note: FER2013 folder labels are numeric (0..6) and are mapped internally to emotion classes.

Run

Start long run without waiting:

python scripts/start_kaggle_training.py --owner aryanchande23l

Check later (quick):

python scripts/check_kaggle_status.py --owner aryanchande23l

Auto-retry until compatible GPU allocation:

python scripts/relaunch_until_gpu_compatible.py --owner aryanchande23l --max-attempts 5

When complete:

kaggle kernels output aryanchande23l/emotion-multimodal-hybrid-trainer-v1 -p kaggle_pull

Expected outputs:

audio_model.pt
face_model.pt
video_model.pt
fusion_config.json
metrics.json
run_version.json

Upload Model to Hugging Face

set HF_TOKEN=YOUR_TOKEN
python scripts/upload_to_hf.py --model-repo-id ShiroOnigami23/emotion-multimodal-engine --outputs-dir kaggle_pull

Run Locally

pip install -r requirements.txt
streamlit run app.py

Publish HF Space

set HF_TOKEN=YOUR_TOKEN
python scripts/publish_space.py --space-id ShiroOnigami23/emotion-multimodal-app

Android APK Release

Android wrapper project is under android_app/ and opens the deployed HF Space.

CI workflow: .github/workflows/android-release.yml
Trigger release by pushing a tag (example v1.0.0) or using workflow dispatch.
Signed APK is uploaded to GitHub Releases automatically.

Safety Notice

This is an affect-recognition research tool. It is not a diagnostic, clinical, legal, or hiring decision system.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.github/workflows		.github/workflows
android_app		android_app
assets		assets
gradle/wrapper		gradle/wrapper
kaggle_kernel		kaggle_kernel
scripts		scripts
src/mmemotion		src/mmemotion
.gitignore		.gitignore
README.md		README.md
app.py		app.py
build.gradle.kts		build.gradle.kts
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
requirements.txt		requirements.txt
settings.gradle.kts		settings.gradle.kts
train_multimodal_allinone.py		train_multimodal_allinone.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Emotion Multimodal Hybrid App

Goal

Live Demo

Training (Kaggle)

Run

Upload Model to Hugging Face

Run Locally

Publish HF Space

Android APK Release

Safety Notice

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Emotion Multimodal Hybrid App

Goal

Live Demo

Training (Kaggle)

Run

Upload Model to Hugging Face

Run Locally

Publish HF Space

Android APK Release

Safety Notice

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages