Hacker Newsnew | past | comments | ask | show | jobs | submit | dserban's commentslogin

These roles are North America only, based on your careers page. Please consider adding a caveat to the "Remote" designation.

  Location: Remote / Worldwide
  Remote: Yes
  Willing to relocate: Yes
  Technologies: Kafka, Flink, Spark, Cassandra, Scala, Rust, Python, Zookeeper, Databricks, BigQuery, Redshift, Hive, Kinesis, Airbyte, Airflow, DBT, Aerospike, Snowflake, Qdrant, PrestoSQL, Trino, Clickhouse, SQL, Golang, gRPC, Terraform, CUDA
  Resume: https://drive.google.com/file/d/1lZuSpneLDVzzYoeaA8or-m5m2dfYAMB4/view
  Email: in the profile
[Remote Worldwide] I'm pursuing a role in the high-scalability distributed systems space. I'm a well-rounded Scala/Rust/Python dev, well-versed in data engineering, with deep knowledge of the internals of distributed datastores. I have experience with data modeling for high throughput database activity and a strong understanding of which workloads and data access patterns are scalable and what datastore the data should reside in. I have a highly confident ability to lead a data / MLops project from start to finish. Hire me.

Core Skills:

● Cassandra (Data Modeling, Troubleshooting Performance And Operational Issues)

● Apache Iceberg (Scaling, Tuning, Self-Hosted Setup)

● Stream Processing At Scale: Kafka, Flink, Spark Streaming, Storm

● Custom-Crafted Contextualized Embeddings, Vector-Based Semantic Search, Deep Intent Recognition In Search Engine Queries

● Languages: Scala, Rust, Python, SQL (proficient), Golang (ramping up)

Educational Background: Computer Science.

Solid experience working remotely and working with teams that are distributed geographically. I typically work Pacific Time hours.

Ask: $325K base, inflation-adjusted twice a year, 0.005% equity.


SEEKING WORK, Senior Data Engineer, Remote / Worldwide

Well-rounded Scala/Rust/Python dev, well-versed in data engineering, with deep knowledge of the internals of distributed datastores. I have experience with data modeling for high throughput database activity and a strong understanding of which workloads and data access patterns are scalable and what datastore the data should reside in. I have a highly confident ability to lead a data / MLops project from start to finish. Hire me.

Core Skills:

● Cassandra (Data Modeling, Troubleshooting Performance And Operational Issues)

● Apache Iceberg (Scaling, Tuning, Self-Hosted Setup)

● Stream Processing At Scale: Kafka, Flink, Spark Streaming, Storm

● Custom-Crafted TF-IDF / Embeddings, Vector-Based Semantic Search, Deep Intent Recognition In Search Engine Queries

● Languages: Scala, Rust, Python, SQL (proficient), Golang (ramping up)

Other Skills: Airflow, Aerospike, DBT, Snowflake, Qdrant, Databricks/DeltaLake, BigQuery, Redshift, Hive, Kinesis, PrestoSQL/Trino, ClickHouse, gRPC, Airbyte, Terraform, CUDA, Reverse-Engineering Search Engine Technologies.

Educational Background: Computer Science.

Solid experience working remotely and working with teams that are distributed geographically. I typically work Pacific Time hours.

E-mail address in the profile.


I'm not sure if I should be applying but before I do, I'm wondering what kind of novel or interesting data driven algorithms you refer to here. Are you referring to like adaptive query planners based on runtime statistics? Real-time detection of query latency spikes? Automated creation of secondary indexes based on usage patterns? Statistical smarts to decide what to cache and when? These would be sort of runofthemill for any modern database product, can you clarify if you are looking for any of these?


Don't rely on gut feeling or anecdotal evidence, there's good data you can look at.

Over a few years, I came to rely on proxy indicators that I found to be reliably in sync with the strength / weakness of the job market for software engineers.

One of these indicators is the business spending index (it has an official name, something about purchasing managers' sentiment index or similar). CNBC tends to show it a lot these days, you can't miss it.

Another one is "US Auto Loans Delinquent by 90 or More Days (I:USALD90)" which is a very good "finger to the wind" for how the overall economy is doing, since everyone needs a car.

Neither of these indicators are looking good these days.


Lovely suggestion, I'll have a look at the USALD90 ticker, I'm in the EU, so I also use the PMI but "Eurozone" Manufacturing Purchasing Managers Index (PMI), anyone can have a look here and get a basic idea - https://tradingeconomics.com/euro-area/manufacturing-pmi, just go to maximum time window.


I use it to extract either vocabulary or grammar patterns from snippets of text written in a foreign language I'm learning. I double-check all output, just as you do.


Looking at this, it's not long until OpenAI starts offering a version of this to ecommerce companies for their search backends, it's a space that is held back by corporate corruption, and it's ripe for disruption.


I'm still working through legal complications with my current employer, and in no mood to start my own company. The job market is also bad on the dataeng / datainfra / mlops side of things. My first guess is that interest rates still have to come down some. Giving up on software engineering and trying a different line of work is not an option for me, I was born with a passion for this.


> Is your company like this? If you have real info and not just suspicions, let’s name some names.

A certain company (the name starts with "A") is widely known for doing this, you might guess which company I refer to by deep-diving into my comments history.

It looks exactly like in this YT short: https://www.youtube.com/shorts/V5VAN6ldS9o


  > prefer Craigslist
Seconded. Upwork and Toptal aren't what they used to be.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: