More

nnurmanov · 2025-12-03T04:50:44 1764737444

I agree. Here is my thinking. What if LLM providers will make short answers the default (for example, up to 200 tokens, unless the user explicitly enables “verbose mode”). Add prompt caching and route simple queries to smaller models. Result: a 70%+ reduction in energy consumption without loss of quality. Current cost: 3–5 Wh per request. At ChatGPT scale, this is $50–100 million per year in electricity (at U.S. rates).

In short mode: 0.3–0.5 Wh per request. That is $5–10 million per year — savings of up to 90%, or 10–15 TWh globally with mass adoption. This is equivalent to the power supply of an entire country — without the risk of blackouts.

This is not rocket science — just a toggle in the interface and I believe, minor changes in the system prompt. It increases margins, reduces emissions, and frees up network resources for real innovation.

And what if EU/California enforces such mode? This will greatly impact DC economy.

cryptophreak · 2025-12-03T05:01:33 1764738093

Can you explain why a low-hanging optimization that would reduce costs by 90% without reducing perceived value hasn't been implemented?

lmm · 2025-12-03T05:18:58 1764739138

> Can you explain why a low-hanging optimization that would reduce costs by 90% without reducing perceived value hasn't been implemented?

Because the industry is running on VC funny-money where there is nothing to be gained by reducing costs.

(A similar feature was included in GPT-5 a couple of weeks ago actually, which probably says something about where we are in the cycle)

balder1991 · 2025-12-03T05:20:41 1764739241

Not sure that’s even possible with ChatGPT embedding your chat history in the prompts to try to give more personal answers.

randomNumber7 · 2025-12-03T08:14:27 1764749667

Dunning Krueger

EagnaIonat · 2025-12-03T08:17:39 1764749859

Good enough models can already run on laptops.

nrhrjrjrjtntbt · 2025-12-03T06:09:30 1764742170

Context Tokens want a word...

nnurmanov · 2025-12-03T04:47:39 1764737259

I used to be a Dell customer; all my family members had Dell Latitude laptops. But I agree, they got worse. I had a fan break, a key fall out, and Bluetooth issues with some of them, and when it was time to upgrade, I moved to Apple MacBook. It took some time to learn how to work in the macOS environment, but I am happy now.

nnurmanov · 2025-11-11T04:31:12 1762835472

AI for COBOL? Now I’ve seen everything.

blast · 2025-11-11T05:21:25 1762838485

Have you seen a man eat his own head?

https://www.youtube.com/shorts/BFIO6OcbsTQ

nnurmanov · 2025-10-10T01:48:40 1760060920

It’s per core licensing with core factors. Core factor ranges from 0.25 to 1. I have not checked how it is now, but it used to be as above 5-10 years ago

nnurmanov · 2025-10-03T16:20:24 1759508424

Nowadays, when I post a job for our openings, I get thousands of applicants, which is much higher than 2–3 years ago. Over 90% are irrelevant. To limit the applicant count, I ask for references. For example, just the other day a candidate reached out directly, I asked if he could provide references, and then heard nothing back. It’s the quickest way to filter out irrelevant candidates. Now I am employing the same tactics and building an ATS where references make you stand out.

nnurmanov · 2025-10-03T15:55:53 1759506953

I am using Neo4j to build an equipment database, I also use MySQL in the same project to store transactional data. It took some time to figure out right syntax for Spring Boot/Neo4j Cypher query, but now it works OK. The reason I chose Neo4j? Because I wanted to play with it:). I can say it is more flexible than relational databases. I would like to continue using it, but you can't create multiple instances within a database, I guess it is possible to do so by installing separate binaries, but I have not tried it yet.

nnurmanov · 2025-09-25T02:06:41 1758766001

It is a business model problem, not Google’s problem. When it is easy to stand out by paying money, it will always become trash

nnurmanov · 2025-09-24T02:34:48 1758681288

English should be #1:)

platevoltage · 2025-09-25T06:18:16 1758781096

English is Turing complete I assume.

schoen · 2025-09-25T06:30:40 1758781840

But its formal syntax and formal semantics are underspecified. People might not be able to agree on the interpretation of natural language sentences in general, or grammaticality judgments, or how to handle sentences that stop in

kensai · 2025-09-24T06:22:40 1758694960

Underrated comment! ;)

nnurmanov · 2025-08-31T11:18:54 1756639134

I live in such country, local currency volatile, laws are changing. There is no long term planning, projects should span 1-2 years and if they bring income, then you are lucky. In long term the situation is no good

nnurmanov · 2025-08-07T22:58:54 1754607534

In the marketing world 1>2:)