Collaborative generation of unique audiovisual experiences using NFC identity cards
-
Updated
Jan 20, 2021 - TypeScript
Collaborative generation of unique audiovisual experiences using NFC identity cards
Todo o conteúdo produzido para a unidade curricular PF (Projeto FEUP), para o curso em Engenharia Informática e Computação na FEUP
Multitasking multimodal AI material that focus on human interaction and assistance
Utilizing a multimodal architecture to predict the appropriate speaker turn in a dialogue.
A notebook to learn about ML for astronomy through BTSbot.
Visuo-haptic integration during texture exploration
In this course, you’ll select open source models from Hugging Face Hub to perform NLP, audio, image and multimodal tasks using the Hugging Face transformers library.
[COLM 2024] ExoViP: Step-by-step Verification and Exploration with Exoskeleton Modules for Compositional Visual Reasoning
Experimental HPC accelerated deep Learning research, a next-gen R&D AI project with Scala API. 🚀
This repo collects Multi-modal Machine Learning papers.
AMR extension for the spatial domain, with grounded frame of reference tracking
Accepted at The Web Conference 2024.
Multi-angle Lip Multimodal Video Data
research of unified ocr interface or ocr_ability enhanced multimodal model
Comparison of multimodal models for Emotion Detection on IEMOCAP
The first public transport search written in Flutter Web
A repository of Video Language papers, code and datasets.
Localized Multimodal Large Language Model (MLLM) integrated with Streamlit and Ollama for text and image processing tasks.
Add a description, image, and links to the multimodal topic page so that developers can more easily learn about it.
To associate your repository with the multimodal topic, visit your repo's landing page and select "manage topics."