Readit News logoReadit News
robz75 commented on Ask HN: Data engineers, What suck when working on exploratory data-related task?    · Posted by u/robz75
clejack · 2 months ago
The main issues for problems like this fall into 3 categories

- Things that prevent you from starting the job. Org silos, security, and permissions

- Things that prevent you from doing the job. This is primarily data cleaning.

- Things that make the job more difficult. This involves poor tooling, and you'll struggle to break the stranglehold that SQL and python-pandas have in this area. I'll also add plotting libraries to this. Many of them suck in a seemingly unavoidable way.

On the second and third points llms will most likely own these soon enough, though maybe there's room to build something small and local that's more efficient if the scope of the agent is reduced?

The first point is organizational generally, and it's very difficult to solve outside of integrating your system into an environment which is the strategy pursued by companies like snowflake and databricks.

robz75 · 2 months ago
What are the pain points your are facing with data cleaning? How do you handle it for now?
robz75 commented on Ask HN: Data engineers, What suck when working on exploratory data-related task?    · Posted by u/robz75
squircle · 2 months ago
Ah, well, rereading your original post I realize now this isn't necessarily painful for me. Perhaps though, the annoying aspect is seeing others use proprietary excel spreadsheets without a data lake. Conway's Law?

Does VS here mean Visual Studio? I would not call myself a data engineer, I just play one at work sometimes. Many hats, yknow?

robz75 · 2 months ago
"the annoying aspect is seeing others use proprietary excel spreadsheets without a data lake" => what's painful about that?

VS = compared to, versus

robz75 commented on Ask HN: Data engineers, What suck when working on exploratory data-related task?    · Posted by u/robz75
squircle · 2 months ago
Conversations and interviews > Jupyter notebook
robz75 · 2 months ago
Why? What's currently annoying about notebooks that you have to deal with compared to just directly going to users?
robz75 commented on Ask HN: Who is hiring? (July 2024)    · Posted by u/whoishiring
kgritesh · a year ago
I actually enjoy refactoring messy code ->and have worked and improved multiple code bases in my time. However, I don't like recording videos. Any other alternative to apply.
robz75 · a year ago
Thanks for your feedback, I can understand that it's not enoyable to record a video.

But for now we will keep these steps and this process since it's important for our recruitment process.

Deleted Comment

robz75 commented on Ask HN: Freelancer? Seeking freelancer? (June 2024)    · Posted by u/whoishiring
johnnyfived · a year ago
3-10 minute video is a crazy ask to put into the second page of the application, after already asking a large write-up in the first page. Don't have people invest their time and switch it up with something you know is off-putting and rely on sunken cost to get applicants. Bonus red flag points for no salary or hourly pay information.
robz75 · a year ago
Thanks for the feedback.

I can understand that it can feel like an investment to apply.

But for now those are important steps we need in our recruiting process.

Let me know if you have any other feedback :)

robz75 commented on Ask HN: Who is hiring? (June 2024)    · Posted by u/whoishiring
robz75 · a year ago
Why would this mean it's a lower tier company to you?
robz75 commented on Ask HN: Who is hiring? (June 2024)    · Posted by u/whoishiring
robz75 · a year ago
Thanks for your brutally honest feedback. What makes you think it's shit-tier company? How do you evaluate that?
robz75 commented on Ask HN: Who is hiring? (June 2024)    · Posted by u/whoishiring
facundo_olano · a year ago
I was curious about this position (the fact that you are upfront that you need to do a big refactor), but then the application form was very off-putting.

> Tell us about yourself in 3 KPIs, with a brief explanation and examples. (A KPI is a number that evaluate performance in a specific aspect.)

> Record and upload an unedited face-cam video (3 to 10 minutes) where you explain a problem you had this week and how you solved it on your own.

robz75 · a year ago
Thanks for the feedback. Indeed I don't want to get the wrong expectations & be as clear as possible. It will be a painful cleaning job at first (at least that's how refactoring is viewed by most).

And yes we do have a carefully crafted process. Those questions / requests are very meaningful for us.

u/robz75

KarmaCake day9June 11, 2024
About
Co-founder at https://evaboot.com/

Eager learner

View Original