Some time ago, I tried to scrape every Bay Area profile off LinkedIn until the site blocked my entire office network (Lesson learned: Use a proxy). This was Bad because we were (and still are!) hiring.
The goal was to collect enough data to create a set of classifiers that could estimate a person’s salary from their LinkedIn profile.
LinkedIn profiles were decomposed using Latent Semantic Indexing and mapped to salary estimates based on users’ current job titles. I scraped all the Bay Area salary information from GlassDoor.
Now when we encounter a new profile, we can perform a similarity query, find the nearest matching profiles, and return their salaries.
Previously this was all done using python libraries which made it too slow for public consumption. I finally got around to rewriting it all using Google’s TensorFlow libraries. The only remaining speed bump is the roundabout way I pull a user’s LinkedIn profile.
Here it is, go play with it.
I’ll write more about TensorFlow some other day, but for now I need to spend less time on this and more time on stuff that won’t get me fired.
Many thanks to Aronima, TingTing, and Wenjie. GlassBowl would not have happened without them.
9 thoughts on “A Machine Learning Model for Salary Estimation”
Hello, unfortunately this service appears to be down at the moment. Thank you.
Although the page currently isn’t loading for me right now (Hacker News effect?), assuming your page works the way I think it does/should, I must say that this is a VERY clever honeypot for collecting warm lead LinkedIn profiles. Can’t fool me! 😀
Is glassbowl down?
try again… the server gets overloaded because I didn’t think to build it in a scalable manner.
Yep! Now I am able to access!. Was looking for more details about the models you were using though 🙂
Is it just similar to document clustering?
It’s a type of soft clustering, yes 🙂
Still can’t get this to work… My profile is http://www.linkedin.com/in/kelly-nicholes-ab096122. The error message tells me to use www and not jp.linkedin.com. I’m not using jp.linkedin.com.
ah. Make sure you use https://www.linkedin.com/in/kelly-nicholes-ab096122
I should put a patch for that, thanks for bringing it to my attention.