Skip to content
View akhilpandey95's full-sized avatar
🐒
finding pebbles
🐒
finding pebbles

Organizations

@MozillaIndia @webcompat @fossasia @pClub-gu @asq-ai @NIU-DATA-Lab

Block or report akhilpandey95

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
akhilpandey95/README.md

About Me

I am Research Scientist at the Center for Science of Science and Innovation working for Dashun Wang at the Kellogg School of Management, Northwestern University. As a Research Scientist, I oversee the lab's overall AI efforts, data analytics, and data management. I work closely with other team members to ensure these initiatives align with the lab's goals and contribute to its broader research objectives. My research interests lie in the intersection of Large Language models, Science of Science, Representational Learning, and Uncertainty Quantification.

Previously, I was a Ph.D student advised by Dr. Hamed Alhoori, and my dissertation fully funded by the NSF grant. focussed on building a predictive modeling framework to investigate the Reproducibility crisis in AI. Previously, I was a Givens Research Associate and an Argonne Leadership Computing Facility Graduate Student researcher.

My most recent projects include training reasoning models and doing tests on scaling hypothesis for test time compute for scientific text generation. Specifically, finetuning and performance tuning DPO, ORPO, and recently GRPO on different open weight models. More can be found here. Additionally, I'm interested in conducting research investigating the topic of credit assignment problem for qualitative texts in science (peer-reviews). More specifically, if there exists a scientific task, Creating reward signals for LLM reasoning beyond math/programming domains is hard is a well agreed upon notion. More about this can be found in project LMRSD.

  • πŸ”­ I’m currently working on fine-tuning large language models, implementing LLM agents, LLM's + Graphs.
  • 🌱 I’m currently exploring Ways bring GraphRAG to the larger scholarly ecosystem, and Science of Science data
  • πŸ‘― I’m looking to collaborate on Building microservices for LLMs; Agentic LLM tooling as a service layer
  • πŸ€” I’m always looking for help with Building Knowledge Graphs within Citation Networks
  • πŸ’¬ Ask me about Large Language models, Graph Learning, Uncertainty Quantification, Neural Architecture Search.
  • πŸ“« How to reach me: @akhilpandey95
  • πŸ˜„ Pronouns: (he/him)
  • ⚑ Fun fact: Haskell has type inference, meaning it can automatically determine the type of a data by looking at how it is created.

Active projects:


Pinned Loading

  1. uncertainty uncertainty Public

    Work as part of ANL summer 2020 research on uncertainity quanitification methods in graph neural networks

    Python 1

  2. gnnNAS gnnNAS Public

    Work as part of ANL summer 2022 research with emphasis on utilizing symbolic programming to perform NAS on graph neural networks

    Python 2

  3. reproducibilityproject/IRSI reproducibilityproject/IRSI Public

    Reproducibility is an important feature of science; experiments are retested, and analyses are repeated.we examine a myriad of features in scholarly articles published in computer science conferenc…

    1

  4. reproducibilityproject/NLRR reproducibilityproject/NLRR Public

    Code, data, and supplemental information for the paper "Navigating the landscape of Reproducible Research: A predictive modeling study

    Python 1

  5. s1 s1 Public

    Experiments on test-time scaling approaches for reasoning LM's to enforce better <think> or <wait> capabilities.

    Jupyter Notebook 1

  6. LMRSD LMRSD Public

    Examining the spectrum of feature representation choices on Science of Science problems

    Python