Changho Choi

Computer Science undergraduate at Korea University (expected Aug 2026) specializing in computer vision and generative models. With over six years of applied machine learning experience across vision and audio—including research and engineering roles at Maum.AI, Pion Corporation, OptimizerAI, and the Vision & AI Lab. I am actively seeking PhD opportunities in computer vision and general machine learning.

Email  /  CV  /  GitHub  /  Google Scholar  /  LinkedIn  /  X

profile photo

Publications

project image

MambaEye: A Size-Agnostic Visual Encoder with Causal Sequential Processing


Changho Choi, Minho Kim, Jinkyu Kim
CVPR Findings, 2026
arxiv / code /

A causal sequential visual encoder that achieves input-size agnostic charateristic like human vision by using a pure Mamba2 backbone. We designed a strictly unidirectional approach and relative move embeddings, MambaEye ensures translation invariance and adaptability to arbitrary resolutions.

project image

SAOInstruct: Free-form Audio Editing using Natural Language Instructions


Michael Ungersböck, Florian Grötschla, Luca A. Lanzendörfer, June Young Yi, Changho Choi, Roger Wattenhofer
NeurIPS, 2025
arxiv / code / website /

A model based on Stable Audio Open for flexible, free-form natural language audio editing. Trained on a novel dataset of audio editing triplets, it generalizes to real-world audio and unseen instructions.

project image

LatentSwap: An Efficient Latent Code Mapping Framework for Face Swapping


Changho Choi, Minho Kim, Junhyeok Lee, Hyoung-Kyu Song, Younggeun Kim, Seungryong Kim
Arxiv preprint, 2024
arxiv / code /

A lightweight face swapping framework that generates latent codes for pre-trained generators without external datasets. It features a fast training process with a simple three-term loss, producing high-resolution results comparable to state-of-the-art models.




Work & Research Experience

Besides my work on the research and publications above, a sampling of my past works

project image

Vision & AI Lab, Korea University


Undergraduate Researcher
2025-03 ~ 2025-12

Conducted research on causal sequential visual encoders under Prof. Jinkyu Kim. Proposed novel vision encoder architecture, MambaEye.

project image

OptimizerAI


AI Researcher
2024-10 ~ 2024-12

Contributed to research and development for the Text-to-SFX Model v2, achieving state-of-the-art human preference win rates.

project image

Pion Corporation (vcat.ai)


AI Engineer
2023-06 ~ 2024-10

Developed deep learning models for optimal product image recommendation and online image clustering using CLIP and Triton Inference Server.

project image

Maum.AI


AI Scientist
2019-03 ~ 2023-05

Led the computer vision research team and worked on various CV tasks (Super Resolution, Face Swapping) through open-source utilization and paper implementation.




Other Projects

These include coursework, side projects and unpublished research work.

project image

HifiFace PyTorch Implementation


projects
2021-12
code /

Developed an unofficial PyTorch implementation of HifiFace, the state-of-the-art face swapping model at the time (Stars: 300+).

project image

FaceShifter Pytorch Implementation


projects
2020-10
code /

Implemented FaceShifter, one of the first GAN-based face swapping models, from scratch in PyTorch (Stars: 600+).


Design and source code from Jon Barron's website