Skip to content
View Srijito354's full-sized avatar

Block or report Srijito354

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Srijito354/README.md

bg

It's Srijito Ghosh, here!

Tech Stack:

About Me:

  • I am a 3rd Computer Science and Engineering undergraduate student at the University Insitute of Technology, The University of Burdwan.
  • My current work involves reading and understanding seminal research work in the field of Generative AI and Robotics, while trying to implement them from scratch and making them adapt to my hardware and institutional infrastructure constraints. Such constraints make searching for efficiency not just an option but rather a requirement.

What I am currently working on:

  • I have started reading research material required for contributing to lerobot.
  • I recently completed reading the Swin Transformer and SmolVLA papers to better understand implementing efficient architectures to achieve efficiency and Edge-AI goals without compromising on model performance and precision.

Brief on what I have worked on:

I had started with building basic image classifiers in the senior year of high school, post which I moved onto using APIs in my first year, but realised that architectural understanding is what interests me the most. This made me start reading research articles and their related blog-posts by the end of my second year examinations. My considerable projects include:

  • Creating a conditional diffusion simulator on 2D point clouds. Click here
  • Building a Behavioral Cloning agent from scratch and testing it on the Udacity self-driving simulator. This involved manually collecting equal laps of data by driving both in the clock-wise and anti-clockwise directions to cancel out the left-turn bias in the data from the default set-up, and creating an action adapter for a Resnet-18 backbone to predict steering and throttle values. Click here
  • Making a CLIP-based VQA (Visual Question Answering) model, with everything from the image and text encoder being implemented from scratch. Click here

srijito74596 https://www.linkedin.com/in/srijito-ghosh-243726256 the_panther16

Pinned Loading

  1. Behavioral_cloning_vs_Generalisation Behavioral_cloning_vs_Generalisation Public

    Python

  2. Behavioral_cloning_self_driving Behavioral_cloning_self_driving Public

    Python

  3. lossfunk_intro_project lossfunk_intro_project Public

    Jupyter Notebook 1

  4. Diffusion_simulator_clean Diffusion_simulator_clean Public

    Python

  5. Latent-CLIP Latent-CLIP Public

    Jupyter Notebook