it's not about the content, but topic. i am tired of reading about AI stuff. i am not interested in it whatsoever and it has been polluting HN for a long time time.
lol strongly agree it is just cherry on top. In big tech they also copy but just copy in a smart way so I don't believe that's the reason they got removed.
Err, yeah, you should neither do any web scraping without respecting robots.txt, nor use ad blockers when using Google. When working with a business, never use Google Docs without paying them. Nah, that's not how the world works and at least not in the software industry.
It can save tons of tokens compared with screen capture or reading the HTML directly. The downside is that we are still not able to handle complex JavaScript and CAPTCHA checks.
ICIC. Indeed, our next step after completing all the JS rendering would be to pretend to be human, i.e., pass the CAPTCHA. Of course, we have to respect robots.txt."
Just wanted to add some clarifications to your list:
> 1. I use the MIT license, so you don't have to pay. Lightpanda requires payment if you use it for business. :)
Lightpanda uses the AGPL license, you can use it for business for free. Your only obligation is to distribute any modified version of Lightpanda's code + the license to your users.
> 3. Lightpanda isn't in Markdown format; it's more like a curl format.
I'm not sure what you mean, but Lightpanda can dump a rendered page in Markdown format via the CLI, CDP (using the custom LP domain), and the native MCP. This is a feature added recently.
BTW, Pardus looks nice, congrats! I'll follow your progress.
And I agree, it's great to see more players in this space!
Bro you are like saying "OH LLM can't do X within 10 days which few people spend over decades" Live a life bro applause and change the title to "it can do xyz" instead of adding the "critical and critical" ...
reply