More

rafram · 2026-01-22T21:41:01 1769118061

> This project is far from perfect, but without generative models, it couldn’t exist. There’s simply no way to do this much work on your own

100 people built this in 1964: https://queensmuseum.org/exhibition/panorama-of-the-city-of-...

One person built this in the 21st century: https://gothamist.com/arts-entertainment/truckers-viral-scal...

AI certainly let you do it much faster, but it’s wrong to write off doing something like this by hand as impossible when it has actually been done before. And the models built by hand are the product of genuine human creativity and ingenuity; this is a pixelated satellite image. It’s still a very cool site to play around with, but the framing is terrible.

rafram · 2026-01-19T17:15:48 1768842948

> To many HTML elements can slow the page to a crawl.

You can read the entirety of War and Peace in a single HTML file: https://standardebooks.org/ebooks/leo-tolstoy/war-and-peace/...

A marketing page, SaaS app landing, etc., will not even begin to approach that size, whether or not you add an extra wrapper around your <a>s.

atoav · 2026-01-19T18:24:31 1768847071

This is a wonderful example how people live in the inverse-world.

Marketing is in the end a way of trying to get people to listen, even if you have nothing substantial to say (or if you have something to say, potentially multiply the effect of that message). That means you have to invent a lot of packaging and fluff surrounding the thing you want to sell to change peoples impression independent of the actual substance they will encounter.

This to me is entirely backwards. If you want people to listen focus on your content, then make sure it is presented in a way that serves that content. And if we are talking about text, that is really, really small in terms of data and people will be happy if they can access it quickly and without 10 popups in their face.

Not that I accuse any person in this thread of towing that line, but the web as of today seems to be 99% of unneeded crap, with a tiny sprinkle of irrelevant content.

j45 · 2026-01-19T20:23:48 1768854228

The experience also depends on the desired outcome, and who's outcome that is. The marketers? or the readers? Which comes first? How far should it go?

shimman · 2026-01-19T18:11:05 1768846265

Almost 15,000 elements! I do agree that too many elements can slow a page but from my experience that starts to happen a few hundred thousand elements, at least that's what we'd run into making data visualizations for network topologies (often millions of nodes + edges) but the trick for that was to just render in canvas.

azangru · 2026-01-20T12:33:03 1768912383

The HTML spec page[0] is the proper War and Peace of the web. It is 2,125MB of text gzipped, twice as large as War and Peace. It still makes some browsers weep, as was discussed in an episode of HTTP 203 podcast[1].

[0] - https://html.spec.whatwg.org/

[1] - https://www.youtube.com/watch?v=FFA-v-CIxJQ

Etheryte · 2026-01-19T21:35:12 1768858512

This is true, yet I've seen plenty of poorly built webapps that manage to run slowly even on a top tier development machine. Never mind what all the regular users will get in that case.

wnevets · 2026-01-19T20:46:34 1768855594

Thank you for this example. I'm going to keep it in mind the next time I asked myself if there are too many elements or not.

croisillon · 2026-01-19T18:59:56 1768849196

nice, Firefox Reader Mode tells me i need 2968 to 3774 minutes

rafram · 2026-01-19T13:12:19 1768828339

(She passed away two days ago.)

rafram · 2026-01-19T04:56:57 1768798617

> For one it had to originate from app.opencode.com

No, that was the initial mitigation! Before the vulnerability was reported, the server was accessible to the entire world with a wide-open CORS policy.

https://github.com/anomalyco/opencode/commit/7d2d87fa2c44e32...

ofrzeta · 2026-01-19T06:00:31 1768802431

How is it wide open? Does everything go through a localhost proxy?

rafram · 2026-01-19T12:40:25 1768826425

Not sure what you mean by that, but before they implemented any mitigations, it had a CORS policy that allowed requests from any origin. As far as I know, Chromium is the only browser platform that has blocked sites from connecting to localhost, so users of other browsers would be vulnerable, and so would Chrome users if they could be convinced to allow a localhost connection.

rafram · 2026-01-19T00:14:16 1768781656

Have you actually accounted for all the services you’re receiving from the government? Road construction and maintenance, schools, availability of clean water, safe aviation, trustworthy financial markets, public universities, funding for research that improves your health and happiness, and so on? I don’t think you can even really put a price on most of those, because they simply could not exist without a centralized system funded by taxes.

rafram · 2026-01-16T13:39:10 1768570750

Google Fonts is not a tracker.

https://developers.google.com/fonts/faq/privacy

> For clarity, Google does not use any information collected by Google Fonts to create profiles of end users or for targeted advertising.

bflesch · 2026-01-16T15:36:56 1768577816

Google has carte blanche to lie to foreigners for national security purposes, it's not even illegal for them. The data is fed into the mass surveillance systems.

IP, user agent, language headers and network timings are enough to fingerprint and associate you with any other accounts at US tech companies. The visited website is linked via Referer / Origin headers to your browsing history.

All of this tracking is passive and there is no way to check for an independent observer.

Yet here you are defending the most privacy invasive company on the planet.

sillyblob67 · 2026-01-16T20:34:47 1768595687

By default, loading Google Fonts from Google’s servers exposes user data to Google (e.g., IP Address, User agent, Referrer, Timestamps, Cache identifiers).

It's passive tracking, but it's tracking.

pwdisswordfishy · 2026-01-16T18:26:34 1768587994

Well, if Google said it, it must be true.

rafram · 2026-01-15T19:10:50 1768504250

Not to mention most interactive content from the New York Times (which is what Rich Harris originally developed it for).

rafram · 2026-01-15T13:33:42 1768484022

It’s difficult to prosecute online harassment, and “hate sites and photoshopped images” aren’t illegal. There’s a right to freedom of speech in the US.

bawolff · 2026-01-15T21:50:32 1768513832

Slander is illegal, and while not exactly the same thing, there is a lot of overlap.

Depending on the nature of the site, there also might be rules around unfair competition that might be implicated.

rafram · 2026-01-15T23:51:44 1768521104

There is a very high bar for slander in the US, especially for public figures.

asdfasvea · 2026-01-15T14:40:21 1768488021

Further the 'hate-' prefix in an accusation makes it one I immediately tune out.

It's just a manipulative word that lets you incite people while not actually revealing what was said.

rafram · 2026-01-15T05:10:31 1768453831

Remember when Archive.is/today used to send Cloudflare DNS users into an endless captcha loop because the creator had some kind of philosophical disagreement with Cloudflare? Not the first time they’ve done something petty like this.

stavros · 2026-01-15T09:09:53 1768468193

It wasn't a philosophical disagreement, they needed some geo info from the DNS server to route requests so they could prevent spam and Cloudflare wasn't providing it citing privacy reasons. The admin decided to block Cloudflare rather than deal with the spam.

arcfour · 2026-01-15T15:51:17 1768492277

Had nothing to do with spam, the argument by archive.today that they needed EDNS client subnet info made no sense, they aren't anycasting with edge servers in every ISP PoP.

ventegus · 2026-01-15T16:02:15 1768492935

They use EDNS for regional compliance, not for bandwidth optimization.

josephcsible · 2026-01-15T23:33:15 1768519995

What specific part of regional compliance actually needs this, and why does no other website seem to need it?

ventegus · 2026-01-16T11:10:37 1768561837

e.g. currently most media snapshots contain wartime propaganda forbidden at least somewhere.

RT content verboten in Germany, DW content verboten in Russia, not to mention another dozen of hot spots.

"Other websites" are completely inaccessible in certain regions. The Archive has stuff from all of them, so there’s basically no place on Earth where it could work without tricks like the EDNS one.

josephcsible · 2026-01-17T00:31:51 1768609911

> The Archive has stuff from all of them, so there’s basically no place on Earth where it could work without tricks like the EDNS one.

Isn't that true of archive.org as well? Why doesn't it need EDNS then?

ventegus · 2026-01-17T16:06:40 1768666000

Actually, I'm not entirely sure on how archive.org achieves its resiliency.

It's a rather interesting question for archive.org, if one were to interview them, that is.

Unlike archive.today, they don't appear to have any issues with e.g. child pornography content, despite certainly hosting a hundred times more material.

They have some strong magic which makes the cheap tricks needless.

arcfour · 2026-01-16T14:21:33 1768573293

That makes zero sense. You're aware that they get the client's actual IP upon connection?

You're saying they have groups of servers with every possible permutation of censorship that they direct clients to through DNS? Absurd.

ventegus · 2026-01-16T14:26:22 1768573582

They always direct clients to a server abroad. The task is exactly opposite to what CDNs do

AndroTux · 2026-01-15T08:28:58 1768465738

That's still a thing. Happens to me as we speak.

wolvoleo · 2026-01-15T08:36:51 1768466211

For me it just doesn't resolve at all on Cloudflare dns. So annoying.

rafram · 2026-01-14T21:52:04 1768427524

- They already do this. Every chat-based LLM system that I know of has separate system and user roles, and internally they're represented in the token stream using special markup (like <|system|>). It isn’t good enough.

- LLMs are pretty good at following instructions, but they are inherently nondeterministic. The LLM could stop paying attention to those instructions if you stuff enough information or even just random gibberish into the user data.