Skip to content
View shotsan's full-sized avatar
  • 17:51 (UTC -12:00)

Highlights

  • Pro
Block or Report

Block or report shotsan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
shotsan/readme.md

✋ I'm Santosh.

I have PhD in Network Control, and Learning from Texas A&M university under Dr. P. R. Kumar

Current Research Interests

  1. Applying Machine Learning for Large Scale Problems
  2. Training Neural Nets and Large Language Models

I was TA for ECEN 740 Machine Learning '22 and '24, Primary ML course at Texas A&M Electrical and Computer Engineering.

I specialize in designing, building, and deploying plug-and-play intelligent systems.




Pinned Loading

  1. Double-Descent-of-Neural-Networks Double-Descent-of-Neural-Networks Public

    Double Descent of Neural Networks

    Jupyter Notebook

  2. GLM-130B GLM-130B Public

    Forked from THUDM/GLM-130B

    GLM-130B: An Open Bilingual Pre-Trained Model

    Python

  3. gpt-fast gpt-fast Public

    Forked from pytorch-labs/gpt-fast

    Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

    Python

  4. hand-written-digits hand-written-digits Public

    JavaScript

  5. attention_mechanims attention_mechanims Public

    Forked from facebookresearch/xformers

    Hackable and optimized Transformers building blocks, supporting a composable construction.

    Python

  6. AppAgent AppAgent Public

    Forked from mnotgod96/AppAgent

    AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

    Python