Portfolio item number 1
Short description of portfolio item number 1
Short description of portfolio item number 1
Short description of portfolio item number 2 
Published in BCCA 2020, 2020
Chain FL: Decentralized federated machine learning via blockchain.
Download here
Published in ChartQA @ CVPR 2021, 2021
Integrating image data extraction and table parsing methods for chart question answering.
Download here
Published in ACL 2022, 2022
Chart-to-Text: A large-scale benchmark for chart summarization.
Download here
Published in ACL 2022 Findings, 2022
ChartQA: A benchmark for question answering about charts with visual and logical reasoning.
Download here
Published in EuroVis 2022, 2022
Chart Question Answering: State of the art and future directions.
Download here
Published in EMNLP 2023, 2023
UniChart: A universal vision-language pretrained model for chart comprehension and reasoning.
Download here
Published in arXiv Preprint, 2023
Do LLMs Work on Charts? Designing few-shot prompts for chart question answering and summarization.
Download here
Published in AIFinSI @ AAAI 2024, 2024
LongFin: A multimodal document understanding model for long financial domain documents.
Download here
Published in ACL 2024, 2024
ChartInstruct: Instruction tuning for chart comprehension and reasoning.
Download here
Published in EMNLP 2024, 2024
An extensive investigation into the capabilities and limitations of Large Vision Language Models for chart comprehension and reasoning.
Download here
Published in RBFM @ NeurIPS 2024, 2024
BigDocs is a permissively-licensed dataset for training vision-language models on document and code tasks.
Download here
Published in arXiv Preprint, 2025
Apriel-1.5-15b-Thinker.
Download here
Published in arXiv Preprint, 2025
LLM-based data science agents: A survey of capabilities, challenges, and future directions.
Download here
Published in arXiv Preprint, 2025
Improving GUI Grounding with Explicit Position-to-Coordinate Mapping.
Download here
Published in arXiv Preprint, 2025
Scope: Selective Cross-modal Orchestration of Visual Perception Experts.
Download here
Published in ACL Industry Track 2025, 2025
Judging the Judges: Can Large Vision-Language Models Fairly Evaluate Chart Comprehension and Reasoning?
Download here
Published in IEEE MLSP 2025, 2025
Learning or Cheating? Assessing Data Contamination in Large Vision-Language Models.
Download here
Published in IEEE MLSP 2025, 2025
Colflor: Towards Bert-Size Vision-Language Document Retrieval Models.
Download here
Published in ICLR 2025, 2025
BigDocs is a permissively-licensed dataset for training vision-language models on document and code tasks.
Download here
Published in COLING 2025 Industry Track, 2025
ChartGemma is a visual instruction-tuned model for chart reasoning.
Recommended citation: Masry, A., Thakkar, M., Bajaj, A., Kartha, A., Hoque, E., & Joty, S. (2025). ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild. COLING 2025. https://arxiv.org/abs/2407.04172v1
Published in EMNLP 2025 Industry Track, 2025
COLMATE: Contrastive Late Interaction and Masked Text for Multimodal Document Retrieval.
Download here
Published in ACL 2025 Findings, 2025
ChartQAPro: A more diverse and challenging benchmark for chart question answering.
Download here
Published in COLM 2025, 2025
BigCharts-R1: Enhanced Chart Reasoning with Visual Reinforcement Finetuning.
Download here
Published in NeurIPS 2025, 2025
AlignVLM: Bridging vision and language latent spaces for multimodal understanding.
Download here
Published in EACL 2026, 2026
DashboardQA: Benchmarking Multimodal Agents for Question Answering on Interactive Dashboards.
Download here
Undergraduate course, Office of Learning and Teaching (KOLT), Koç University, 2019
Role: Undergraduate Teaching Assistant
Period: Spring Semester, 2019
Location: Istanbul, Turkey
Graduate and Undergraduate courses, York University, 2021
Role: Teaching Assistant
Period: January 2021 – April 2022 & September 2024 – Present
Location: Toronto, Ontario