More

banditelol · 2026-01-14T12:31:03 1768393863

Cool, can you share your setup for python and your current ide?

malux85 · 2026-01-14T20:42:41 1768423361

Yes sure, dev containers inside each project, that way the entire environment (debugger, all ide plugins for linting etc) are standard across all devs and the coding environment matches prod exactly

IDE is cursor

banditelol · 2026-01-12T06:40:33 1768200033

https://github.com/MedUnes/go-kata-solutions seems like they intended to create the solutions too, but seems like there's no progress yet

banditelol · 2025-12-21T11:53:05 1766317985

One of the things that made me think twice for self hosting postgres is securing the OS I host PG on. Any recommendation where to start for that?

danparsonson · 2025-12-21T12:23:42 1766319822

Can you get away without exposing it to the internet? Firewall it off altogether, or just open the address of a specific machine that needs access to it?

banditelol · 2025-05-31T01:17:52 1748654272

I tried this before, but since I often need to open different browser even if a link came from the same app, I ended up moving to https://github.com/will-stone/browserosaurus

Not to say you cant use both tho

banditelol · 2025-05-02T02:32:40 1746153160

I've tried airbyte, sling, and dlt (besides building several tools from scratch)

My best bet for now will be dlt if you have dedicated DE team, but sling will get you a long way for moving data around your warehouse

banditelol · 2025-05-02T02:29:35 1746152975

Hi, I've been loking something like this! Any of your custumer has success story migrating off bigquery to your platform? And how do you compare to motherduck? (Looks like you built some of ypur stack on top of duckdb)

mritchie712 · 2025-05-02T10:39:54 1746182394

Yes, we've had many bigquery / snowflake converts. The reality is, most companies don't have 100tb of data (which is what those platforms are optimized for). Motherduck has a good post[0] on this:

> There were many thousands of customers who paid less than $10 a month for storage, which is half a terabyte. Among customers who were using the service heavily, the median data storage size was much less than 100 GB.

I'm a fan of what motherduck is doing. We're building something different (opinionated, instant data stack), but yes, we both use duckdb under the hood.

0 - https://motherduck.com/blog/big-data-is-dead/

banditelol · 2025-02-24T01:26:39 1740360399

Anyone have tried comparing with Qwen VL based model? I heard good things about its performance on ocr compared to other self hostable model, but haven't really tried benchmarking its performance

jimmySixDOF · 2025-02-24T10:34:41 1740393281

Yes I'd like to see this repeated with any of the small VLM's like IBM Granite or the HF Smols. Pretty much anything in the sub 7B range.

banditelol · on Dec 21, 2024

Now you make me wonder if I could run this entirely inside pyscript

banditelol · on Dec 2, 2024

I think you want something aling the line of dvc (github.com/iterative/dvc)

banditelol · on Nov 8, 2024

Looking at the syncer it seems like copying data to csv from the whole table everytime (?) Code: https://github.com/BemiHQ/BemiDB/blob/6d6689b392ce6192fe521a...

I cant imagine until at what scale can you do this and is there anything better we can do before using debezium to sync the data via cdc?

Edit: add code permalink

exAspArk · on Nov 8, 2024

Our initial approach was to implement periodic full table re-syncing. We're starting to work on CDC with logical replication for incremental syncing. Here is our roadmap https://github.com/BemiHQ/BemiDB#future-roadmap