Readit News logoReadit News
renegat0x0 commented on I started programming when I was 7. I'm 50 now and the thing I loved has changed   jamesdrandall.com/posts/t... · Posted by u/jamesrandall
renegat0x0 · 3 hours ago
Programming is not art for me. I do not find it useful to gold plate solutions. I prefer getting the job done, sometimes by any means necessary for "the vehicle" to continue running.

AI often generates parts of code for my hobby projects, which allow me speed running with my implementation. It often generates errors, but I am also skilled, so I fix error in the code.

I use AI as boiler plate code generator, or documentation assist, for languages I do not use daily. These solutions I rarely use 1:1, but if I had to go through readme's and readthedocs, it would take me a lot longer.

Would there be more elegant solutions? often - yes. Does it really matter? For me - not.

renegat0x0 commented on More Mac malware from Google search   eclecticlight.co/2026/01/... · Posted by u/kristianp
shreyaspapi · 2 days ago
This is very close to something that happened to a friend of mine. They were trying to follow a MoltBot installation guide, but clicked on a different link that looked legitimate. That page instructed them to paste a command into Terminal. After running it, macOS immediately started asking for multiple permissions, which in hindsight was the big warning sign. But for someone who is non technical might have ran with it.
renegat0x0 · 2 days ago
This might sound stupid, but I have my own index, of trusted domains:

https://github.com/rumca-js/Internet-Places-Database

I start with it, to find stuff I know. If there is stuff I don't know and is important to me, I add it to my database.

Also it enforces me to verify each link I visit. So links I visit are mostly ok.

Though I sometimes use chatgpt for instructions, and if someone poinsed the well "well enough" it might spread malware.

renegat0x0 commented on Ask HN: What are you working on? (February 2026)    · Posted by u/david927
renegat0x0 · 2 days ago
Still working on

- https://github.com/rumca-js/Internet-Places-Database - map of the Internet domains

- https://github.com/rumca-js/Internet-feeds - database of RSS feeds

- https://github.com/rumca-js/yafr - very simple RSS reader

- https://github.com/rumca-js/crawler-buddy - crawling project

- https://github.com/rumca-js/Django-link-archive - another RSS reader

renegat0x0 commented on Battle-Testing Lynx at Allegro   blog.allegro.tech/2026/02... · Posted by u/tgebarowski
self_awareness · 5 days ago
Ale was boli ten InPost, nawet na przykładzie wciskacie te swoje OneBoxy.
renegat0x0 · 5 days ago
dziwnie sie czyta komentarz po polsku na takiej stronie jak ta
renegat0x0 commented on I built a search engine to index the un-indexable parts of Telegram   telehunt.org... · Posted by u/alenmangattu
renegat0x0 · 6 days ago
- "I built a search engine" sounds cool on hacker news, but in reality it is a "company product", right?

- do the links in the footer work? I tried clicking on github icon, and it appears to be broken

renegat0x0 commented on I built a search engine to index the un-indexable parts of Telegram   telehunt.org... · Posted by u/alenmangattu
Antibabelic · 6 days ago
Where is the search engine? The site says that it's a bot directory.
renegat0x0 · 6 days ago
wikipedia "A search engine is a software system that provides hyperlinks to web pages, and other relevant information on the Web in response to a user's query".

I think there can be different expectation connected to this term. It seems to be a "search engine" for bots. Bot directory does not have to have "search" functionality, right?

renegat0x0 commented on Updates to our web search products and Programmable Search Engine capabilities   programmablesearchengine.... · Posted by u/01jonny01
saltysalt · 19 days ago
I built my own web search index on bare metal, index now up to 34m docs: https://greppr.org/

People rely too much on other people's infra and services, which can be decommissioned anytime. The Google Graveyard is real.

renegat0x0 · 19 days ago
I made also something for my own search needs. It's just an SQLite table of domains, and places. I have your search engine there also ;-)

https://github.com/rumca-js/Internet-Places-Database

Demo for most important ones https://rumca-js.github.io/search

renegat0x0 commented on Waiting for dawn in search: Search index, Google rulings and impact on Kagi   blog.kagi.com/waiting-daw... · Posted by u/josephwegner
KellyCriterion · 20 days ago
Scraping is hard. Very good scraping is even harder. And today, being a scraping business is veeery difficult; there are some "open"/public indices, but none of these other indices ever took off
renegat0x0 · 20 days ago
Scraping is hard, and is not hard that much at the same time. There are many projects about scraping, so with a few lines you can do implement scraper using curl cffi, or playwright.

People complain that user-agent need to be filled. Boo-hoo, are we on hacker news, or what? Can't we just provide cookies, and user-agent? Not a big deal, right?

I myself have implemented a simple solution that is able to go through many hoops, and provide JSON response. Simple and easy [0].

On the other hand it was always an arms race. It will be. Eventually every content will be protected via walled gardens, there is no going around it.

Search engines affect me less, and less every day. I have my own small "index" / "bookmarks" with many domains, github projects, youtube channels [1].

Since the database is so big, the most used by me places is extracted into simple and fast web page using SQLite table [2]. Scraping done right is not a problem.

[0] https://github.com/rumca-js/crawler-buddy

[1] https://github.com/rumca-js/Internet-Places-Database

[2] https://rumca-js.github.io/search

u/renegat0x0

KarmaCake day1035October 25, 2022View Original