Research
I am interested in computer vision and have also worked in vision-language integration.
Currently, I am focused on developing vision-language-action models, leveraging advanced techniques in object detection, dense video captioning, and action generation to create systems that can interpret and act upon visual data with a high degree of accuracy and context-awareness.
For a full list of publications please see Google Scholar.
|
|
A Neural ODE and Transformer‑based Model for Temporal Understanding and Dense Video Captioning
Sainithin Artham, Soharab Hossain Shaikh
Multimedia Tools and Applications, 2024
arXiv / code, models, data, project page
Dense video captioning with Neural ODE
|
|
Pred-AHCP: Robust feature selection enabled Sequence Specific Prediction of Anti-Hepatitis C Peptides via Machine Learning
Akash Saraswat, Utsav Sharma, Aryan Gandotra, Lakshit Wasan, Sainithin Artham, Arijit Maitra, Bipin Singh
Journal of Chemical Information and Modeling, 2024
arXiv / code, models, data, project page
We developed an explainable ML model that harnesses the amino acid sequence of a peptide to predict its potential as an anti-HepC (AHC) agent
|
This guy is good at website design.
|
|