avatar

AmirHossein (Amir) Razlighi

Computer Vision Researcher  |  3D Vision & Graphics  |  Generative Models

About Me

I'm currently an MSc student in Computer Science at UCY, while researching in 3D Computer Vision and Generative AI at GrUVi lab @ SFU. I am very interested in how computers perceive the world in 3D, and how we can generate realistic worlds using machine learning! I am always open to discussions and connections on research, projects, and potential collaborations. Please feel free to reach out if you want to chat about anything related to computer vision, machine learning, or just want to say hi!

Contact:

Education

University of Cyprus MSc. Computer Science
Jan 2026 – Present
Sharif University of Technology BSc. Computer Engineering
October 2020 – July 2025
Cumulative GPA: 19.27 / 20.0  |  Major GPA: 19.65 / 20.0  |  Ranked in top 10% among 194 students.
Salam High School Diploma in Mathematics and Physics
2016 – 2019
GPA: 19.86 / 20.0

Research Experiences

GrUVi lab @SFU Research Assistant
Jan 2026 – Present
Supervisors: Prof. Ali (Arash) Mahdavi-Amiri & Prof. Daniel CohenOr
Working Video Generative models and 3D Computer Vision.
INSAIT Summer Research Intern
Jun 2024 – Oct 2024
Supervisor: Dr. Danda Pani Paudel
Continued the project started at ETH. Worked on Dynamic 3D Reconstruction using Signed-Distance Functions (SDFs) integrated into the Gaussian Splatting pipeline. Created a dataset of challenging deformable scenes (breaking sphere, growing plant, etc.) and designed a complete pipeline for 3D scene encoding and geometry/color reconstruction.
ETH Zurich Research Assistant
Jun 2023 – Jul 2024
Supervisor: Dr. Danda Pani Paudel
Worked on NeRF models for 3D reconstruction of dynamic scenes. Studied SDFs to disentangle geometry and appearance. Developed morphing between two SDFs based on RGB images and designed SDF-based regularizers for continuous morphing learning in inverse-rendering.
University of Toronto Research Assistant (Summer Internship)
Jun 2023 – May 2023
Supervisors: Dr. Florian Shkurti and Dr. Animesh Garg
Worked on vision-based robotic manipulation. Studied 3D scene encoding and reconstruction using NeRF models; implemented and customized SRT (Scene Representation Transformer). Customized the ARNOLD dataset for canonical cube views of robot scenes (LINK). Worked on Google's PerAct model and implemented custom agents in IsaacSim.

Industrial Experience

Tapsi Co. Data Scientist — MAP Team
Mar 2025 – Jan 2026

Working on ETA and Navigation for ride-hailing.

Promoted Jul 2025
Senior Expert Data Scientist

Worked on and supervised the full process (from R&D to production) of two projects:

  • GNN for spatio temporal modeling of cities and ETA prediction.
  • An attention-based post processing model for enhancing ETA predictions.
  • The result was improving many important business metrics, including accuracy of final ETA and accuracy of map coloring.

    Started Mar 2025
    Expert Data Scientist
    Tapsi Co. Junior AR Systems Engineer
    Feb 2022 – Sep 2022  (University CO-OP)
    • First member of the R&D team "Tapsi Lab," using computer vision to improve ride-hailing UX.
    • Built the Driver-Arrived AR project from scratch, detecting driver position via camera + GPS integration.
    • Used TensorFlow.js to run the complete AR experience entirely client-side.
    • Created a customized version of AR.js with custom rendering and data-integration layers.
    • Collaborated cross-functionally with design, management, business, and DevOps teams.

    Publications

    Confident Thumbnail
    Confident Splatting: Confidence-Based Compression of 3D Gaussian Splatting via Learnable Beta Distributions

    AmirHossein Naghi Razlighi, Elaheh Badali Golezani, Shohreh Kasaei

    3D Gaussian Splatting enables high-quality real-time rendering but often produces millions of splats, resulting in excessive storage and computational overhead. We propose a novel lossy compression method based on learnable confidence scores modeled as Beta distributions. Each splat's confidence is optimized through reconstruction-aware losses, enabling pruning of low-confidence splats while preserving visual fidelity. The proposed approach is architecture-agnostic and can be applied to any Gaussian Splatting variant. In addition, the average confidence values serve as a new metric to assess the quality of the scene. Extensive experiments demonstrate favorable trade-offs between compression and fidelity compared to prior work.

    N4DE Thumbnail
    N4DE: Neural 4D Evolution under Large Topological Changes from 2D Images

    AmirHossein Naghi Razlighi, Tiago Novello, Asen Nachkov, Thomas Probst, Danda Paudel

    In this work, we address the challenges in extending 3D neural evolution to 4D under large topological changes by proposing two novel modifications. More precisely, we introduce (i) a new architecture to discretize and encode the deformation and learn the SDF and (ii) a technique to impose the temporal consistency. (iii) Also, we propose a rendering scheme for color prediction based on Gaussian splatting. Furthermore, to facilitate learning directly from 2D images, we propose a learning framework that can disentangle the geometry and appearance from RGB images. This method of disentanglement, while also useful for the 4D evolution problem that we are concentrating on, is also novel and valid for static scenes.

    MARS Thumbnail
    MARS: Multi-task Action Prediction for Robot Manipulation based on Scene Representation

    Haoping Xu, Richard Hanxu*, AmirHossein Naghi Razlighi*, Alan Aspuru-Guzik, Florian Shkurti, Animesh Garg

    This paper presents MARS (Multi-task Action Prediction for Robot Manipulation based on Scene Representation), a novel approach to robot manipulation tasks using a transformer-based framework. MARS leverages Scene Representation Transformers (SRT) to render orthogonal novel views from RGB multi-view observations, enabling the prediction of the next key frame's best end-effector pose and gripper state. The model is trained using the PerAct synthetic dataset, which includes expert demonstrations for a variety of tasks. By conditioning on language descriptions and utilizing keyframe-based behavioral cloning, MARS effectively generalizes across multiple tasks, demonstrating robust performance in complex robotic manipulation scenarios.

    Demo — Scene 1
    Demo — Scene 2

    Projects  (GitHub)

    Tiny NeRF
    Tiny NeRF

    A PyTorch implementation of tiny NeRF (Neural Radiance Field). A much smaller version of the original that can be trained even on CPU in a reasonable time.

    3D Computer Vision Computer Graphics PyTorch
    GitHub
    Ray Tracing
    RayTracing from Scratch

    Basic ray tracing concepts implemented in Python using only NumPy and Matplotlib, for understanding and experimenting with fundamental ideas.

    Computer Graphics
    GitHub
    Marching Squares
    Marching Squares

    Python implementation of the Marching Squares algorithm for 2D terrain generation, with binary/float value systems and various noise generators.

    Computer Graphics
    GitHub
    Semantic Scholar Search Engine
    Semantic Scholar Search Engine

    Three-phase search engine: (1) TF-IDF index + compression, (2) ML-based ranking (Naive Bayes, Neural Nets, LMs), (3) web scraping + Streamlit frontend.

    Information Retrieval Machine Learning Web Scraping
    GitHub
    BlurSim
    BlurSim

    Introduces a new loss function for accurately comparing differences between two blurred images.

    Image Processing Machine Learning PyTorch
    GitHub
    CMinus Compiler
    CMinus Compiler

    CMinus-to-Python compiler with four phases: scanner, parser, error analysis, and semantic analysis.

    Compiler Design Software Engineering
    GitHub

    Teaching & Volunteering

    Teaching Assistant — Sharif University of Technology

    Oct 2021 – July 2025  |  All positions are voluntary / unpaid.

    • Adv. 3D Computer Vision (Graduate) — Quizzes on multi-view reconstruction; lecture series on NeRF and Gaussian Splatting. Prof. Shohreh Kasaei
    • Modern Information Retrieval (Head of Project) — Led 3-phase final project: Classical IR, ML in IR, Deep methods + LLMs. Dr. Mahdieh Soleymani
    • Machine Learning (Head TA, Project Section) — Managed team of 10; CV and NLP final project. Dr. Fatemeh Seyed Salehi
    • Convex Optimization — Convex Functions questions; managed Duality homework grading. Dr. Amir Najafi
    • Digital Image Processing (Graduate) — Deep learning slides and teaching sessions; quizzes on image transformations and morphological operations. Prof. Shohreh Kasaei
    • Scientific and Technical Presentation (Head TA)Prof. Shohreh Kasaei
    • Artificial Intelligence — Computer Vision project designer. Dr. M. H. Rohban
    • Artificial Intelligence — Head of final project design. Dr. M. H. Rohban, Dr. M. Soleymani
    • Fundamentals of Computer Vision (Head of Workshops) — Exercises on 3D Vision. Prof. Shohreh Kasaei
    • Database DesignDr. M. Varmazyar
    • Linear Algebra (2 semesters) — Re-designed all slides; theoretical exercise design. Dr. Maryam Ramezani
    • Probability and StatisticsDr. Mahdi Jafari
    • Advanced Programming in Java (3 semesters) — Held 3 OOP workshops; designed GUI phase of course project.
    Other Volunteering Activities
    • Vice President of Students' Scientific Chapter (SSC)
    • Head of Technical Beta Team (8th WSS) — Created an R&D chatbot trained on FAQ data to help conference participants get instant answers. GitHub
    • Founder of Byte PublicationLinkedIn
    • Enrolled in "RTI" Program (Oxford University)
    • Member of "3D Vision" Reading Group (University of Toronto)

    Hobbies

    Outside of work and studies, I enjoy Playing Video Games and Reading Science Fiction Novels! You can find me via below links: