Nikhil Agrawal

Senior Software Engineer, Samsung Research Institute India-Bangalore.
nikhilagrawal741@gmail.com

The best way to predict future is to create it..!

Welcome to my Website!

Education

Indian Institute of Tecnology, Kharagpur

Master of Technology - [M.Tech]
Computer Science Engineering

CGPA: 8.93

July 17 - May 19

Institute of Engineering and Tecnology, DAVV

Bachelour of Engineering - [B.E.]
Computer Science Engineering

Percentage: 77.33

August 13 - May 17

Projects

M.Tech Thesis:- Improving aspect based ranking in clinical trials

python-3

-Information Retrieval, Natural Language Processing
Clinical Trials are crucial for the practice of evidence-based medicine. It provides updated and important health-related information for the patients. Different stakeholders like trial volunteers, trial investigators, and meta-analyses researchers often need to search for trials having similar eligibility features. Clinical trials are the first source of information available to users in terms of new drugs and new treatments for the disease. Apart from clinical trials, there are also other sources of information eg: PubMed, adverse events that can be leveraged to provide ranked results of clinical trials. In this research work, we develop an automated method that can be applied across all classes of disease to retrieve relevant trials on the basis of UMLS concepts overlapped between the disease information provided by the user as a query and clinical trials. We propose and compare the different relevancy based approaches and found out that the inclusion of synonyms words provided better results in terms of precision value. We also ranked the retrieved clinical trials on different aspects like relevancy, adversity, recency, and popularity. Also, we found out that there’s a high negative Spearman rank’s correlation coefficient existing between popularity and recency which is expected as the paper published earlier has high chances of more citations. Also at the end, we provided the possible limitations of the methods.

May 18 - May 19

Finding the influential users in the Twitter network

Python-2

-Complex Network
The goal of the project was to use temporal pattern of retweets combined with structural information of the network to identify the best set of influential users that can be targeted for viral diffusion in the Twitter network.

Jan 18 - Mar 18

Automatic Concept Map Generation from Text-based Learning Material

Python-3, Solr, StanfordCoreNLP, PyGraphviz

-Natural Language Processing, Information Retrieval
Generated a Concept Map from a document by first converting text to simple language, identify important entities, finding the weighted relationship between entities, and then finally obtain a visual representation of the relations between entities.

Apr 18 - Jun 18

Apparel Recommendation

Sckit-Learn, Tensorflow

-Deep Learning, Machine Learning, Natural Language Processing
Content-based recommendation of apparels by calculation of weighted score (syntactic, semantic, and image similarity) between products obtained from Amazon real-world dataset.

May 18 - Jul 18

Skills

Programming Languages & Tools
Subjects
  • Machine Learning
  • Natural Language Processing
  • Information Retrieval
  • Deep Learning
  • Complex Network

Publications

Towards an Aspect-based Ranking Model for Clinical Trial Search

python-3
Clinical Trials are crucial for the practice of evidence-based medicine. It provides updated and important health-related information for the patients. Different stakeholders like trial volunteers, trial investigators, and meta-analyses researchers often need to search for trials having similar eligibility features. Clinical trials are the first source of information available to users in terms of new drugs and new treatments for the disease. Apart from clinical trials, there are also other sources of information eg: PubMed, adverse events that can be leveraged to provide ranked results of clinical trials. In this research work, we develop an automated method that can be applied across all classes of disease to retrieve relevant trials on the basis of UMLS concepts overlapped between the disease information provided by the user as a query and clinical trials. We propose and compare the different relevancy based approaches and found out that the inclusion of synonyms words provided better results in terms of precision value. We also ranked the retrieved clinical trials on different aspects like relevancy, adversity, recency, and popularity. Also, we found out that there’s a high negative Spearman rank’s correlation coefficient existing between popularity and recency which is expected as the paper published earlier has high chances of more citations. Also at the end, we provided the possible limitations of the methods.

May 18 - May 19

Resume