InternshipFull-time

[Intern] AI Research Scientist

VLA, World Model

제2판교 IT센터

상시 채용

Project Overview

WoRV is developing a World Foundation Model (WFM) for indoor multi-environment mobile manipulation robots as part of the "Physical AI Leading Technology Development" national R&D consortium. We train our own VLA (Vision-Language-Action) model that integrates video, language, and action — and we close the entire cycle from top-tier conference publication to real-robot demonstration and share it publicly.

To support this, we have on-premise designed and built our own 96 H100 GPU high-performance training cluster and a 600+ TB · 200 GB/s parallel storage system, and we are building a multimodal training dataset of 10,000+ hours through our own collection pipeline based on UMI rigs and bimanual robot platforms. On top of this infrastructure, we are building world models at the frontier of the field.

We are looking for passionate researchers to join this project. We welcome applications from anyone interested in World Model / VLA research and development.

WoRV is a fast-growing in-house startup. The process bends to fit the role and the candidate — we'd rather get to know you well than push you through a fixed funnel.

Research Topics

The World Model research team works at the intersection of the following topics. Interns participate deeply in one track based on their interests and skills; the match is determined together during the technical interview.

Long-context multimodal modeling — learning and inference architectures for long-horizon video, language, and action sequences
Policy learning for real-robot deployment — models that simultaneously satisfy latency, robustness, and generalization requirements
VLA / Robotics Foundation Model training pipelines — pre-training, fine-tuning, and evaluation cycle operations
Closed-loop and OOD evaluation — quantifying generalization performance on simulation and real-robot benchmarks
Scaling analysis & data curation — training data mixture design and scaling law validation
Bridging WFM → RFM — training and alignment strategies that turn World Foundation Models into Robotics Foundation Models

Focus topics may shift as the field evolves and research progresses — we concentrate resources on the highest-impact direction at each moment.

Responsibilities

VLA model training and large-scale experiment management
Experiment design, ablation studies, and result analysis
Reproducing public benchmarks and using evaluation frameworks such as vla-eval (co-developed with AI2)
Closed-loop evaluation in simulation and on real robots
Co-authoring top-tier conference papers (co-authorship possible based on contribution)

Requirements

Practical deep learning research and implementation experience (PyTorch, project-level or above)
Multi-GPU distributed training experience
Ability to design ablations and manage experiment reproducibility
Ability to read and write English academic papers
Available for full-time 3+ months

Preferred Qualifications

Experience training or reproducing VLA / VLM models (π0, GR00T, etc.)
Research experience in robot policy learning: Imitation Learning, Reinforcement Learning, etc.
Experience implementing Diffusion / Flow Matching policy models
Experience designing training data mixtures or validating scaling laws
First-author publication at a top-tier venue (NeurIPS, ICLR, CVPR, ICRA, IROS)

Work Arrangement

Full-time 3 months (extension or conversion to full-time negotiable)
Second Pangyo IT Center (on-site)
Competitive internship compensation
Priority consideration for full-time Research Engineer position after the internship

Research Infrastructure

On-premise 96 H100 GPU training cluster (CORE Cluster) — in-house designed and built, research-dedicated, no queue
Government-funded Blackwell training cluster — multiple B200 nodes secured through national R&D programs, with further capacity expansion planned
In-house distributed storage cluster — 600+ TB · 200 GB/s bandwidth
10,000+ hours of multimodal training data
In-house data collection pipeline (UMI rigs, bimanual robot platforms, etc.)
Weekly 1:1 mentoring, paper reading, and research sync
Full sponsorship for international conference presentations

Application Notes

Resume + cover letter (PDF)
Portfolio (GitHub, papers, project links)
Select 1–2 research topics of interest and write 1–2 paragraphs on why (in the Google Form's Additional Self-PR section)

Selection Process

Internship period: At least 2 months from the joining date (negotiable depending on the applicant's academic schedule)
The selection process may change; an executive interview may be added depending on the results of the technical interview.
Application results will be sent sequentially to candidates who pass the resume screening.

Additional Notes

This is an experiential internship during the vacation period; contract extension may be considered depending on return-to-school status and performance evaluation.

Hiring Process

Application

Coding Test

& Assignment

1st Interview

Technical interview

2nd Interview

Culture-fit interview

Offer

Hired

Join WoRV's World Model project as a short-term research intern

A focused R&D internship where you pair 1:1 with a full-time researcher and own a complete segment of the training, evaluation, and publication cycle

Apply Coffee Chat