Welcome! I'm a software developer at North Carolina State University.


Interests

My interest is in the field of data science and software development.

Education

  • M.S., Computer Science, 2018 - 2020
    Department of Computer Science
    North Carolina State University, USA

  • B.E., Computer Science and Engineering, 2014 - 2018
    Department of Computer Science & Engineering
    Visvesvaraya Technological University, India

Courses

    Graduate courses
  • Design and Analysis of Algorithms
  • Software Engineering
  • Operating Systems
  • Database Management System
  • Neural Networks
  • Algorithms for Data Guided Business Intelligence
  • Artificial Intelligence
  • Automated Learning and Data Analysis
  • Machine Learning for User-Adaptive Systems
  • IoT Analytics

Experience

  • Summer Research

    Solar Forecasting using LSTM
    University of Wisconsin–Madison, USA
    Jun 2019 - Aug 2019

    This study focuses on developing a solar forecasting model to address the challenge of daily and seasonal variation in solar irradiance and load usage in rural solar home systems.

    • The forecasting methodology accounts for variation in weather data as well as battery utilization, leading to more optimal energy usage.
    • Performed analysis of multivariate time series field data from real homes power and weather data leading to normalized RMSE of 0.11 in the predictions.

  • Data Science Intern

    LUCI : Land use change analysis of Indian subcontinent using Machine Learning
    Indian Space Research Organisation, India
    Jan 2018 - June 2018

    This project uses machine learning techniques to better predict, analyze and track land use and land cover changes in the Indian subcontinent using satellite remote sensing data from Indian Space Research Organization (ISRO) to develop a thematic map depicting land cover.

    • Developed a machine learning model to predict land cover change using satellite images for segmentation and classification of terrain using deep learning techniques using OpenCV, Keras, and Python.
    • Collaborated with ISRO scientists to optimize model with 20% higher accuracy than statistical methods, reported analytical insights, and designed a dashboard for easy processing and visualization of large-scale, high-dimensional data.
    • Confirmed hypothesis of land change due to seasonal variation and wrote Python scripts to automate data cleaning tasks (report) (code).
  • Summer Research Intern

    Scene Classification of Multispectral Satellite Images Using Convolutional Neural Networks (CNN)
    Indian Space Research Organisation, India
    Jun 2017 - Aug 2017

    The aim of this work is to show how a convolutional neural network can be applied to remote sensing multispectral images to achieve segmentation. This exploration uses National Remote Sensing Centre, India (NRSC), obtained remotely sensed data.

    • The ROIs (regions of interest) selected, comprise of four classes i.e. water bodies, land, forest, and snow
    • Unsupervied techniques like KMeans were used for segmentation but gave poor results.
    • Various design choices of the CNN architecture were analyzed. An accuracy of 92% for segmentation using labelled input data was achieved which shows that CNNs are a viable tool for solving segmentation tasks in the area of remote sensing (report).
  • Software Developer Intern

    Internet of Drones
    Indian Institute of Science, India
    Jun 2016 - Aug 2016

    The project involved developing android applications for remote sensing applicatons.

    • The application MapCam, was mainly used to enable control of a remote drone by a user and act as a sensor and data acquisition entity for aerial mapping by sensor data logging, capturing high resolution images using a smartphone, geotagging and appending camera calibration information to the images.
    • MapCam was developed as a cost efficient solution to incorporate sensor and image acquisition features to a drone, using Android Studio with Java.
    • Led a cross functional team of 3, which was responsible for complete lifecycle development of the application, from initial requirement gathering to design, coding, testing, documentation and implementation (presentation).

Projects

Educational Data Mining in Computer Science Education

The project involves developing a baseline model for the CSEDM 2019 Data Challenge. The goal of the project is to use previous students' programming process data to predict whether future students will succeed at a given programming task.

  • The task is to build a model that can predict, given a student's performance up until a given problem, whether that student will succeed at his first attempt at that problem.
  • The dataset containing records of students' attempts of a set of programming problems, including whether each attempt was correct or incorrect, and the code submitted, etc.
  • Analyzed coding behavior to find correlation between problem attempt order and performance using linear regression
Text classification using semi-supervised techniques

The project involves using semi supervised techniques for text classification.

  • Text classification of a massive volumne of data usually requires large amount of labeled data to train an model, which is not always available. Hence, to improve the accuracy, unlabeled documents can be used to augment the labeled data.
  • We explored self learning, expectation maximization and graph based methods with a supervised multinomial NB as a baseline model, and conclude that the self learning algorithm gives the best performance (report).

Skills

  • Languages : Python (expert), C (strong), Java (intermediate), SQL (intermediate), R (basic), HTML
  • Tools and Platforms : Numpy, SciPy, Scikit Learn, Pandas, Jupyter, Tensorflow, GraphX, Apache Spark, Keras, Git, OSI-layer
  • Platforms : Linux, Windows, XINU