Readit News logoReadit News
OisinMoran · 7 years ago
I think there are some flaws in this analysis that can be confirmed when the code is released.

I think the reason Scala seems surprisingly popular is that is was simple string matching so a post that said "We are looking for someone who can build scalable infrastructure" would count as one mention of "scala".

Could be wrong though.

Also see: https://news.ycombinator.com/reply?id=18725655&goto=item%3Fi...

AznHisoka · 7 years ago
This is why 90% of data science is the messy, grunt work of just simply cleaning/normalizing the data. The analysis is the easy part.
screye · 7 years ago
> grunt work

Honestly, it is an insult to call good data preprocessing skills grunt work.

It may not be as attractive, but requires a lot of understanding of the nature of data and way in which patterns present themselves.

rhizome · 7 years ago
I worked for a company, which is still in business, 80% of whose work involves "cleaning/normalizing" data, basically everything that isn't sales and client services.

Market research is all about getting different kinds of data and massaging it into queryable and productizable form (reports).

z3t4 · 7 years ago
Often it can be faster to enter data manually then to make a script collect it automatically with the same accuracy.
chefandy · 7 years ago
VaLUAble. tRUST.

I would be pretty surprised to see these two so much more commonly requested than Javascript.

mchaver · 7 years ago
Probably the same for rust, looking at the October 2018 data set with https://kennytilton.github.io/whoishiring/ rust gives 50 results, trust gives 21, \brust gives 26. Similar results for scala.
JshWright · 7 years ago
Yeah, similarly with "Excel". Are there really that many posts looking for people with deep spreadsheet experience? Seems more likely that it's a lot of false positives for "excellent" and "excel at".
osoba · 7 years ago
Excel is probably mislabelled, it should say HTML instead. Look at the list above
fredbo22 · 7 years ago
Yup. JavaScript, Node.js, Node, npm, React, Angular, Vue, Express and a few more words mean pretty the same thing and are fishing for the same pool of developers (Yes, I know there's a difference between Angular and React.)
jenscow · 7 years ago
> Yes, I know there's a difference between Angular and React.

When can you start?

adrianmonk · 7 years ago
I've noticed that in the hiring threads, a particular company is always hiring, and the email contact username is "austin". It would be easy to pick this up as a location even though it's a person's name.

This analysis has the Austin as one of the top 10 locations, and I'm curious whether or not that is skewing the numbers. There are a steady stream of jobs actually in Austin, so either seems possible.

NorthOf33rd · 7 years ago
Same with product manager. Many descriptions list working with product management as a requirement. There have not been 425 product management jobs listed this year. Trust me, I've applied to most of them.
lettergram · 7 years ago
You’re very likely correct. See the compare by volume:

https://hnprofile.com/compare?search=scala,python,rust

Not the only issue I see, for instance my company posted for “machine learning engineers”, yet that isn’t in the results at all.

I think they are doing regex search in the strings, as opposed to identifying the words and doing comparisons

fulafel · 7 years ago
Here's a link with more languages in case anyone else got curious: https://hnprofile.com/compare?search=scala,python,rust,cloju...
ashelmire · 7 years ago
Gotta add some word boundaries to that regex!
NicoJuicy · 7 years ago
I wonder what's included for the regex of .net. Eg: asp.net core, dotnet, c#, VB.net, f#, .net, xamarin,...

Because there are a multitude of possibilities.

emmanueloga_ · 7 years ago
Agreed. It looks like the author may be looking for something like a /.net/ regexp? I don't think .net is popular at all in HN job postings. A better regex would be pretty much what you suggest:

    /asp\.net|dotnet|c#|VB.net|f#|xamarin/i

danieka · 7 years ago
It appears that the same problem occurs with locations. For example it would appear that the small town of Visby (Sweden) which is 1/100 of the size of Stockholm has 1/10 the jobs of Stockholm. Or more likely the location matches the company name Visby of startup visby.io.
TheAdamist · 7 years ago
Pittsburgh (120) seems to be beating Pittsburg (6) as well, so theres a lot of data cleanup to be done.
rhizome · 7 years ago
I think there's also a bug where the numbers don't account for "among those who mentioned [category]."
alehul · 7 years ago
Would there be an issue with just changing the string to be matched to "scala " or " rust"?
meritt · 7 years ago
There are numerous ways to fix the glaring flaws in this analysis but a very simple approach would be using regex word boundaries, that way it'll work not only for the space but for commas, periods, end-of-line, etc.
jgtrosh · 7 years ago
The correct method is probably something like "\<$lang\>"
PhasmaFelis · 7 years ago
For the record, the original post has been edited: your suspicion was correct, and they've fixed it now.
vthallam · 7 years ago
OP here. As some people were commenting, Scala and excel I guess have some false positives which reflect the unusual popularity. I did try to limit the search to exact word, but think that I overlooked that part. I will try fix the code and update the skills chart soon. Thanks for pointing it out.

Edit: I fixed the false positives using a word boundary on regex "\\bword\\b". The data now seems little sane, will keep looking for any false positives. updated the data and regenerated the chart using new data.

ticmasta · 7 years ago
I'd also expect some tough to catch exceptions for remote, as that's something I always look at and despite the posting guidelines, posters can't stop themselves from adding things like "sorry, no remote available at this time" or "part-time remote might be an option for the right person in some roles".

I get that you're trying to be complete, but if it's not a truly remote position, it's not remote. put nothing about remote in the position or at the very most "REMOTE: NO". Working from home 10% of the time but needing to be in a SF office the rest of the time is not a remote position; it's the flexibility I'd expect from most jobs in this industry.

travisjungroth · 7 years ago
Try searching craigslist for a shared apartment for a couple.

"Sorry, no couples!"

lozenge · 7 years ago
I'm guessing they get a lot of speculative remote applications, so are trying to prevent that.
wooly_bully · 7 years ago
You've got some citizens of Null Island in your data!

https://en.wikipedia.org/wiki/Null_Island

notyourloops · 7 years ago
It's really cool, but it's not going to be useful until you correct these errors. You might want to provide a note to readers in the meantime.
vthallam · 7 years ago
Thanks. I added a note at the skills section about the false positives. Really should have paid attention, but i guess i was just excited to show it off, first :(. I am on it fixing the regex.
hocuspocus · 7 years ago
There seem to be errors in geo-locations as well. No results for Zurich while it's the biggest hub in Switzerland. A suspiciously high number in Venice, Italy (I think it's safe to assume California here).
vthallam · 7 years ago
I completely relied on the python package "GeoText" for locations.I see Geneva and Lausanne from Switzerland, i will have to double check if the package missed it or there were not many posts on the hiring thread. Thanks for pointing it out.
dvaughan · 7 years ago
I don't know how best to handle "go" as well as "golang" without false positives.
onlyrealcuzzo · 7 years ago
Maybe it's in Facebook's word2vec?
dmode · 7 years ago
OP, thanks for the analysis. Really appreciate it. Quick question - Does San Francisco include abbreviations like "SF" or "San Fran" ? Does it also include jobs in the Valley towns like Palo Alto and San Mateo ?
mygo · 7 years ago
"We do not accept REMOTE applicants" would still be a false positive.

And same would go for visa. "We do not sponsor VISA"

I really think you're going to have to do some sort of sentiment analysis, either via human or machine.

segmondy · 7 years ago
Edit 2 is still broken, I find it hard to believe there are less skills than total number of jobs or position posted.

Deleted Comment

dabernathy89 · 7 years ago
While you're at it, you've got `vuejs` and `vue.js` listed separately :)
bcherny · 7 years ago
Same for “react” and “reactJS”.
dotdi · 7 years ago
Take that all you people crying "Scala is dead" after Java 8+ comes out. /s

Anyway, I've personally seen (and interviewed for) jobs that were advertising Scala (or plans to move to Scala, or that Scala experience is a plus) but actually meant there was one proof-of-concept project in Scala, 18 months ago, nobody ever touches it nowadays, the person who wrote it is long gone and by the way, our legacy JDK5 application is on fire, do you mind fixing that for the rest of your life?

markhalonen · 7 years ago
I suspect there's false positives from "scalability" and "scalable"...
dotdi · 7 years ago
I hope OP took that into account. Seems like pretty low hanging fruit when cleaning your data.
Cthulhu_ · 7 years ago
Oh for sure; I'd take any job listing with something cool with a huge grain of salt; hardly any proper software development employer actually uses hip technology, but they will use hip terms to try and draw in developers.

Hype-driven development is a problem, and hype-driven recruitment another one. Just be a good java developer. Maybe push for Kotlin slowly. Don't pursue Scala for the language's sake.

See also: blockchain, IoT, etc. I've seen a job advert that basically bolted those onto the not well hidden "we're just looking for decent java developers" job description.

mountaineer · 7 years ago
Scala's in good shape, but not quite so popular as the posted analysis presents, based on my findings of the same data:

https://www.hntrends.com/2018/nov-react-still-top-containers...

mountaineer · 7 years ago
Cool to see that other folks find this data interesting. I've been analyzing languages/frameworks/skills in the "Who is Hiring" posts for a few years (https://www.hntrends.com/) as well.

I know it's just one part of the analysis, but the skills list appears to be be pretty far off of what I've been seeing. React gets over 200 a month by itself. Are you capturing all pages of the postings in each month's thread? Here's the data (counts) I have through November, broken down by month and term - https://www.hntrends.com/data/data-20181101.js

eddd · 7 years ago
The fact that Rust and Python are quite high in the rankings, makes me hopeful about the future of programming again. The fact that remote work is not growing is a bit concerning, but if people from the valley want to overpay - it is none of my business. I hope eventually the market will correct it.
ptd · 7 years ago
If you don’t mind sharing, why do you find the lack of growth in remote work concerning?
TarpitCarnivore · 7 years ago
Not OC, but for me it's just a real wonder why it's not better embraced at this stage of the game. We have all the tools to accommodate collaboration within remote teams and in (most) places the broadband to handle it. Add to this the continued funneling of companies into these metro areas where COL is high (NYC, SF, Seattle) and thus people may find themselves being forced into higher commute times just to attain a better COL situation.

I personally am commuting close to 2 hours each way, so 4 hours total, because the job market is much stronger in NYC then my immediate (30-45 minute) area. If a job is open around here the salaries are almost 30-40% lower than NYC despite our COL still being high.

apercu · 7 years ago
Not OP but due to a lack of investment in public transportation in NA over the previous decades, many people spend a few hours a day commuting and in many cases this is done in private vehicles that pollute. For many in our profession there is no need to go to an office to work.
moduspol · 7 years ago
Not OP, but I'd like the world to be moving toward a paradigm where physical location is not a significant factor in career / pay / advancement. It makes things more "meritocratic," and it puts pressure on some of these big tech hubs to keep their costs of living competitive.

But I don't want to see it if the market doesn't justify it. I think today there is something beneficial to having people working together in an office, but I'd prefer to be proven wrong by some new remote-work management style (or something). And software development is one of the more ideal use cases for remote work, so if it's not expanding for us, it's less likely to expand in other industries.

souprock · 7 years ago
Rust I can see.

Python is deeply troubling. It is a regression from FORTRAN and COBOL. Long ago, we invented compile-time type checking. The benefits for software quality have been enormous. There isn't really a downside here, as there would be with the performance loss of garbage collection or bounds checking. Shaking out lots of bugs before even attempting to test the software is a wonderful advance that we made half a century ago. Python's incompatibility with compile-time optimization is also horrifying. The situation is so extreme that you can't even make a decent-performing JIT.

luord · 7 years ago
You're awfully unfamiliar with the history of computer science if you truly believe that dynamic typing is some sort of recent invention or that static typing is some panacea that solves all of your problems.

Both systems of type checking have existed as long as... Programming languages have existed, essentially and, sadly, so have the endless comparisons and flamewars.

shaklee3 · 7 years ago
Python has strong typing as of 3.6, although not at compile-time, since it doesn't compile.
lsiunsuex · 7 years ago
So the next report / analysis needs to be:

Of everyone who asked for work or applied for work via these posts - how many secured a position?

I've long given up on these posts trying to find freelance work or other permanent work. There's to much competition (here) to stand out from the flood of replies.

jedberg · 7 years ago
Anecdotally, the one time I posted a job, I had about 80 replies for a remote work junior engineering job where I specified the salary at $60k/yr in the post (this was in 2015). Of those 80, about 10 were promising and I probably would have made an offer to at least one or two if we hadn't shifted priorities and stopped the hiring process.
Domark · 7 years ago
I applied to matching jobs for three months about five years ago. Also did the same earlier this year for three months.

My advice: skip this site and weworkremotely.com. Both are complete wastes of time based on the responses.

If you’re in IT, your best bet is LinkedIn (for a referral) or Careerbuilder/Dice/Indeed. At least those will result in face to face interviews.

Also find companies you like and apply directly on their website.

vanrysss · 7 years ago
Check your map, it looks like you may be matching "Remote, or XXX" with the town of Remote, Or (Oregon).
sytelus · 7 years ago
Some highlights I saw -

* SF is center of the tech world. Next biggest is just around half of its size. (not withstanding HN bias).

* Approx 25% jobs allow visa sponsorship

* HTML, Python, .Net overwhelmingly dominates everything else.

* Reduced interest in databases, SQL, Obj-C, Java

* No TensorFlow or PyTorch in demand

* 90% jobs in development/software engineering, 10% in management and misc.

dmode · 7 years ago
It probably understates the SF market, TBH. As some of the largest employers in SV who hire en masse rarely post jobs in HN. Such as Google, Facebook, Apple, Salesforce, Uber etc. Also, Google, FB, Uber, Airbnb and others have a boot camp system, where job postings do not correlate to hiring since one posting is used to hire an entire bootcamp class
patrickfreed · 7 years ago
>SF is center of the tech world. Next biggest is not even half of its size.

The article lists SF as having 2922 jobs and New York 1746 jobs