Research Scientist- Vision and Language

Posted 27 February 2024
Salary $200000-$320000
LocationSan Jose
Job type Permanent
Discipline Data & AI
Contact NameAlexis Navarro
Remote working Hybrid/Flexible

Job description

About the Opportunity: Join a dynamic team at the forefront of empowering content understanding and creation through cutting-edge research and development in computer vision (CV) and natural language processing (NLP). Our mission centers around pioneering R&D in multi-modal understanding, vision and language integration, foundation models, and audio/music understanding and generation, all aimed at enhancing content creation. The team comprises a blend of seasoned research scientists and engineers dedicated to advancing research in multi-modality and applying these findings to elevate user experiences.

Key Responsibilities:

  • Engage in state-of-the-art research and development in CV and NLP, focusing on multi-modality and the intersection of vision and language.
  • Publish and disseminate cutting-edge research findings, contributing to the organization's reputation in the scientific community.
  • Translate research insights into product innovations, exploring new product concepts with CV/NLP at their core.


  • Proven research and engineering background in CV and NLP, with a special interest in multi-modal understanding, vision and language applications (e.g., video captioning, visual question answering, text-to-video retrieval).
  • Experience handling large-scale datasets and developing foundation models to work with such volumes.
  • Proficiency in language models and their application in various tasks.
  • Knowledge in the domain of audio/music understanding and generation.
  • A track record of publications in top-tier scientific venues (CVPR, ECCV, ICCV, NeurIPS, ICLR, ICML, EMNLP, ACL, COLING, etc.).
  • Excellent algorithmic and programming skills, with proficiency in Python and popular deep learning frameworks.
  • Strong teamwork capabilities alongside the ability to work independently and excellent communication skills.

This position is ideal for individuals passionate about pushing the boundaries of CV and NLP research and eager to apply their findings in real-world applications. If you are driven by innovation and collaboration, we welcome you to apply for this exciting role in a team that values pioneering research and the impact it has on content creation and understanding.