CommonLounge is a community of learners who learn together. Get started with the featured resources above, ask questions and discuss related stuff with everyone.
1.
discussion
Exploring Vowpal Wabbit with the Avazu Clickthrough Prediction Challenge
In online advertising, click-through rate (CTR) is a very important metric for evaluating ad performance. As a result, click prediction systems are essential and widely used for sponsored search and real-time bidding.
For this competition, we have provided 11 days worth of Avazu data to build and test prediction models. Can you find a strategy that beats standard classification algorithms? The winning models from this competition will be released under an open-source license.
Data Fields
id: ad identifier
click: 0/1 for non-click/click
hour: format is YYMMDDHH, so 14091123 means 23:00 on Sept. 11, 2014 UTC.
In this hands-on assignment, we'll apply the NumPy python library to explore a dataset. The dataset we'll be using is a medical dataset with information about some patients on metrics like glucose, insulin levels, and other metrics related to diabetes. The assignment will serve two primary objectives - (a) practice NumPy on a realistic task, and (b) learn how to get a feel for a large dataset (also known as data cleaning and data exploration).
Dataset description
We'll be using the following dataset: diabetes.csv. Open the file in your favorite text editor and have a look.
The following are the column names: Pregnancies, Glucose, BloodPressure, SkinThickness, Insulin, BMI, DiabetesPedigreeFunction, Age, Outcome.
Quiz time!
The quiz will guide you through the rest of the assi...
One of the most prominent field of application of machine learning in sports, and a lot of people love sports statistics. It is an excellent domain for practicing data exploration and visualization. In fact, most machine learning work that perform well on sports data is 90% data exploration and 10% model building.
Cricsheet has a bunch of cricket data available for download.
We provide ball-by-ball data for Men’s and Women’s T...
Yes you should try multiple models, and try to reason which one is performing better on this particular dataset and why. This includes SVM, logistic regression, K nearest neighbors, etc
I encourage everyone to share the resources they find most useful for each olympiad, and feel free to start discussions, ask questions, invite more friends, etc.
I constructed the dataset using WordNet. The problem is that the dataset will have a lot of false negatives. For example, a camel is a herbivore, but that connection might be missing in WordNet.
I am attempting to make a ground based robot traverse from Point A to Point B through an Obstacle Course having fixed and moving obstacles (Imagine a marketplace or a railway station), with input from a camera feed (a group of CCTV cameras/ live feed from a drone over the area). What kind of ML algorithm will enable the system to plan the shortest path(s) for the ground robot to follow under varying circumstances (moving obstacles at certain speeds travelling randomly over the task plane)?
You'd could use something like A* search (which is an algorithm, nothing to do with machine learning). However, you could use reinforcement learning techniques in figuring out when to explore,...
Read more… (55 words)
Read more (55 words)
NaN.
discussion
Undergraduate Thesis Ideas
Hey guys I am new here! I scrolled through the content and I have to say I'm really excited to join this community!
I started learning machine learning through Prof. Andrew Ng. on Coursera last month and I'm really starting to enjoy it a lot. I'm thinking about doing my undergraduate thesis on machine learning next year but since I am fairly new to ML I don't really know where to start looking for ideas of anything I can do my thesis on as an undergraduate on ML.
If anyone can guide me through this or give me some ideas I would really appreciate it a lot!
A thesis requires a lot of work and thought. My advice is to find a way to make it fun and find a subject you are passionate about where you are able to use ML. Or if there is a problem you se...