
Applications Open · June–August 2026 · 12 Weeks · Remote-Friendly
Build the Future of Visual AI With Us
Every major lab is racing to improve language models. We're building something different.
At Memories.ai, we've created the world's first Large Visual Memory Model (LVMM) — AI infrastructure that can process, understand, and remember video at scale. Think of it as long-term memory for machines, but for everything they see.
This summer, we're inviting a small group of ambitious researchers to join us and push the boundaries of what's possible in visual AI.
What Is the Research Fellowship?
The Memories.ai Research Fellowship is a 12-week intensive research program (running from June 1 – August 30, 2026) for PhD students, Master's students, and early-career researchers working in computer vision, NLP, multimodal AI, or data systems.
You won't be shadowing anyone or working on side projects. You'll be embedded with the Memories.ai research team, tackling real, unsolved problems at the frontier of video intelligence and memory systems.
We're selecting 5–8 fellows for Summer 2026.
What You'll Work On
Our research spans some of the most exciting and challenging areas in AI today:
Egocentric Video Understanding
First-person perspective modeling: teaching AI to understand the world as humans experience it, from wearable cameras and AR devices.
Large Visual Memory Models (LVMM)
Our flagship research direction: building architectures that give AI systems persistent, retrievable visual memory across long time horizons.
Real-Time Video Intelligence
Edge AI and streaming inference: processing live video feeds with low latency for real-world applications.
Multimodal Memory Systems
Cross-modal retrieval and reasoning: connecting what AI sees, hears, and reads into unified memory representations.
Data Infrastructure at Scale
Scalable pipelines for processing, indexing, and retrieving massive video datasets — the unglamorous backbone that makes everything else possible.
What You'll Get
We don't just offer a research topic and wish you luck. We give you everything you need to do world-class work.
- Unlimited Cloud Compute — Full access to AWS and GCP compute credits. No rationing GPU hours. Train the models you need to train.
- State-of-the-Art GPU Clusters — Access to our dedicated infrastructure for large-scale model training and experimentation.
- Proprietary Datasets — Work with Memories.ai's proprietary video datasets and memory architectures — data you won't find anywhere else.
- 1:1 Mentorship — Weekly one-on-one sessions with senior researchers on the Memories.ai team offering dedicated, personalized mentorship.
- Access to World-Class Advisors — Connect with industry experts and academic collaborators across our research network.
- Publication Support — We actively support fellows in publishing at top-tier venues: ICLR, NeurIPS, CVPR, and more. Your research here can go on your CV.
- Visa Sponsorship — The fellowship is remote-friendly, but international applicants are welcome. We provide visa sponsorship and immigration support so the best researchers can join regardless of location.
Who Should Apply?
This fellowship is designed for researchers who:
- Are currently pursuing a PhD or Master's degree (or recently completed one) in computer science, AI/ML, or a related field
- Have hands-on experience with deep learning frameworks (PyTorch, JAX, etc.)
- Have published or are working toward publications in relevant venues
- Are excited about video understanding, multimodal AI, or large-scale data systems
- Move fast, think deeply, and want to work on problems that haven't been solved yet
We value intellectual curiosity and builder mentality over pedigree. If you're doing interesting work — no matter where — we want to hear from you.
Program Details
- Duration: 12 weeks (June 1 – August 30, 2026)
- Cohort Size: 5–8 fellows
- Format: Remote-friendly (with optional in-person collaboration)
- Final Presentations: August 25–30, 2026
How to Apply
Ready to build the future of visual AI?
Fill out the application form. We'll review applications on a rolling basis.
Questions? Reach out to us at [email protected].
About Memories.ai
Memories.ai is building the infrastructure for machines to remember what they see. Our team is working on the Large Visual Memory Model — a new class of AI system that processes, understands, and retrieves video at scale. We're a research-first company backed by a team of world-class engineers and scientists who believe the next frontier of AI isn't just language — it's vision, memory, and multimodal understanding.
Learn more at memories.ai.
Read more

Beyond Basic Descriptions: Why Memories.ai Outsmarts General AI for Your Ring Doorbell Security
How Memories.ai's specialized visual memory AI offers superior security camera intelligence compared to general models like Gemini 3.0 Pro — proactive alerts, persistent person tracking, and evolving memory for your Ring doorbell.

Unlock Smarter Home Security: Memories.ai Brings AI Intelligence & Visual Memory to Your Ring Camera
Discover how Memories.ai transforms your Ring camera into an intelligent guardian with visual memory — smart pet monitoring, proactive security alerts, and affordable AI-powered video analytics.

Revolutionizing Speaker Diarization: The World's First Multimodal AI
Memories.ai unveils the world's first multimodal speaker diarization technology — combining audio, vision, and persistent identity memory to accurately identify who is speaking, anywhere, anytime.