Work In Progress Financial Data Science Part1 Cloud Infrastructure

In this article, I will demonstrate how to use AWS Step Function. AWS Step Function is a service that lets you orchestrate multiple AWS services in a workflow. I will use it to analyze financial data with some simple code examples. The code examples are written in AWS CDK, which is a tool that helps you create cloud resources with your preferred programming language.

Read More

Machine Learning Model Inference

Inference is the process of using machine learning model to predict the outcome of a given input record. Inference typically requires low latency to ensure a smooth customer experience. This post will outline a few options for optimizing inference operation.

Read More

In Demand Generative Ai Skills For The Future

The domain of Generative AI is advancing swiftly, leading to a surge in the need for adept professionals adept at steering through its intricate terrain. Pioneering firms are on the lookout for gifted contributors who can aid in crafting cutting-edge AI models. Let’s delve into the expertise that’s gaining prominence in this trailblazing field.

Read More

Fact checking NLP: Selecting the right evidence using BERT

In order to verify a claim, we can utilize a knowledge corpus like Wikipedia. Checking a claim generally involves three steps 1) relevant document retrieval from knowledge corpus, 2) relevant sentence retrieval from the documents, 3) identify whether the claim is supported by the evidence sentences.

Read More

My Picks From Neurips 2020

NeurIPS 2020 is virtual this year. As a result, not only the talks were virtual, but also the networking and poster sessions were held online. I got to experience gather.town for the first time. It felt like playing video games at times. I changed my avatar many times :D
All the keynotes had sign language interpretation. I thought it was cool!
Below are some of the talks that I enjoyed watching or reading.

Read More

Machine Learning For Autonomous Vehicle

Autonomous vehicle (AV) heavily utilizes machine learning for various tasks. Majority of these tasks are related to its perception. The perception module helps the vehicle sees the world. Recent advancements in deep learning have improved the perception for autonomous.

Read More

Math Behind Random Forest

Random forest is one of the widely used machine learning models for supervised learning task. It is robust to missing values in dataset as well as to outliers. It is an ensemble of many decision trees. Therefore, it achieves good accuracy in practice. In this post, I will present detail mathematics of how a Random forest works.

Read More

Parallel Computing For Python Workload

Modern day’s computer processor comes with multiple cores. Utilizing different cores often vastly reduces runtime of programs. This is helpful in the context where program manipulates large of amount of data. This tutorial will list out some ways to enable parallelization of Python code involving Pandas data frame.

Read More

Relational Machine Learning

(Colab Notebook for the blog post) Data in non-Euclidean space
In real world data often live in non-euclidean space. Examples include social networks, point clouds, etc. Such data contains topological information and are non-linear in nature. Typical machine learning models treat data point as independent to each other. In this post we will look at a model that exploits the inter-relationship of the data points and apply them to perform machine learning task such classification. First we look at some non-euclidean data

Read More

You're up and running!

Next you can update your site name, avatar and other options using the _config.yml file in the root of your repository (shown below).

Read More