ntfy.sh v2.18.0 was written by AI

ueiqkkwhuwjw@lemmy.world · 1 month ago

ntfy.sh v2.18.0 was written by AI

d15d@feddit.org · 1 month ago

They are not even trusting it themselves. This is from the release notes

I’ll not instantly switch ntfy.sh over. Instead, I’m kindly asking the community to test the Postgres support and report back to me if things are working

Fuck that.

Mirror Giraffe@piefed.social · 1 month ago

Classic “test in production” strategy, very solid!

Natanox@discuss.tchncs.de · 1 month ago

Yeah, this is now inherently untrustworthy. Better to switch to an alternative.

exu@feditown.com · 1 month ago

Do you know any? I’ve never really looked beyond ntfy.sh until now

november@piefed.blahaj.zone · 1 month ago

There’s SunUp on F-droid, but I don’t know anything about them.

poVoq@slrpnk.net · 1 month ago

That’s from Mozilla, another AI company…

november@piefed.blahaj.zone · 1 month ago

Ugh, seriously? Great…

(Edit) I don’t think this is true? They use Mozilla’s push services, but nothing about their Codeberg repo (yes, it’s on Codeberg, not Github) indicates they’re part of Mozilla.

Kilgore Trout@feddit.it · 1 month ago

Read the README

ѕєχυαℓ ρσℓутσρє@lemmy.sdf.org · 1 month ago

Damn, I guess I’ll stick to the older release for now. Hopefully a viable alternative/fork comes around.

henfredemars@infosec.pub · edit-2 1 month ago

Definitely share your initial concern. Without strong review processes to ensure that every line of code follows the intent of the human developer, there’s no way of knowing what exactly is in there and the implications for the human users. And I’m not just talking about bugs.

They say it’s reviewed, but the temptation to blindly trust is there. In this case, developer appears to have taken some care.

The code was written by Cursor and Claude, but reviewed and heavily tested over 2-3 weeks by me. I created comparison documents, went through all queries multiple times and reviewed the logic over and over again. I also did load tests and manual regression tests, which took lots of evenings.

Let us hope so. Handle with care to ensure responsibility is not offloaded to a machine instead of a person.

patrick@lemmy.bestiver.se · 1 month ago

It looks like that tool is more or less built by a single developer (you already trust their judgment anyways!), and even though the code came through in a single PR it was a merge from a branch that had 79 separate commits: https://github.com/binwiederhier/ntfy/pull/1619

Also glancing through it a bit, huge portions of that are straightforward refactors or even just formatting changes caused by adding a new backend option.

I’m not going to say it’s fine, but they didn’t just throw Claude at a problem and let it rewrite 25k lines of code unnecessarily.

mudkip@lemdro.id · 1 month ago

Any AI usage immediately discredits the software for me, because it calls into question all of their past and future work.

blarg_dunsen@sh.itjust.works · 1 month ago

Oh boy, do I have bad news about 90% of the internet for you…

mudkip@lemdro.id · 1 month ago

Linus sent an email recently to the Kernel Mailing List trashing AI slop and rejecting AI generated patches. The fact that he used it to play around with a script doesn’t invalidate the fact that he distrusts code written by LLMs when it actually matters.

Possibly linux@lemmy.zip · 1 month ago

I’d run for the hills

There are so many issues with AI

NoFun4You@lemmy.world · 1 month ago

Like ppl thinking skilled engineers cannot vet AI output. AI is pretty good for programming.

Ohi@lemmy.world · 30 days ago

You’re absolutely right, and the vast majority of people on this platform seem to get offended by anything AI related. Software engineers have been reviewing code made by other people since the dawn of the craft. Guess what y’all, AI generated code looks exactly the same, if not better on the first pass at creating a thing.

Down vote me all you want homies. You’re living in a fantasy if you think all AI is slop. Sure, I can see how it’s ruining some content on the Internet, but for code related tasks, its going to dramatically change the world for the better.

MerryJaneDoe@lemmy.world · 6 days ago

I think you would need to first make the case that software is making the world a better place. So far, it’s got a spotty record…

Ohi@lemmy.world · 5 days ago

The same thing happened to music when GarageBand and similar tools lowered the effort required to produce quality tracks. It took power away from the old gatekeepers and gave it to people with ideas but not traditional access. AI is doing that to software now.

notabot@piefed.social · 1 month ago

I’m assuming this is some sort of canary message to indicate that the code base has been compromised, the author can’t talk about it, and everyone should immediately stop using the service. Surely no-one would be unwise enough to commit this otherwise?

Even ignoring the huge red LLM flag, a 25kLOC delta in a single PR should be cause for instant rejection as there’s no way to fully understand or test it, let alone in 2-3 weeks.

Nalivai@lemmy.world · 1 month ago

This doesn’t make me uneasy. It makes me resentful, a little angry, and a lot tired. Thanks for bringing it to attention, I will make sure that nothing of that project or from that author will ever cross my ecosystem again.

NoFun4You@lemmy.world · 1 month ago

You’re gonna have a lot of hate in your blood if you go around acting like the most skilled engineers aren’t using AI to write code.

mic_check_one_two@lemmy.dbzer0.com · 1 month ago

There’s a massive difference between “using AI to write code” and refactoring almost 15k lines in a single push.

The “best” uses of AI in coding are for small blocks. You don’t just tell it “I need a program that does X, Y, and Z” because that will (at best) result in horrible code. Instead, it’s best practice to use it for small blocks of code, where you tell it something more akin to “I need a function that takes {a} as a variable, does {thing}, and outputs {x}.” That way you’re not using it to generate giant swaths of code all at once, you’re just using it to generate individual functions that you can then use as needed.

But it also means that the “most skilled” (as you put it) programmers are basically putting themselves in a permanent debugging seat instead of working as a developer. And in many cases, debugging code can be just as (or more) difficult than writing the initial code. It’s also why senior devs exist to audit code from junior devs, because it’s assumed that junior devs will inevitably make mistakes that need debugging, or will make code that clashes with code from other junior devs. And it’s the senior dev’s job to ensure that the code is both functional and integrated properly.

And this “adding 15k lines of code and ripping out 10k lines” push smells a lot like the former “write me a program to do {thing}” usage.

Nalivai@lemmy.world · 30 days ago

Most skilled engineers, and even mildly skilled engineers don’t use slopgenerators to write code. Some of them use it sometimes to do some menial tasks, although I’m not convinced it actually saves them time. It sure doesn’t every time we measure it.
There is however a plague of low skilled people who convinced themselves that they’ve found a shortcut to being an engineer. Those people are producing bad things at a fast pace, and the only reason we’re not in an unsolvable crisis yet is that their slop isn’t hitting prod very often on account of being bad.

ntfy.sh v2.18.0 was written by AI

ntfy.sh v2.18.0 was written by AI

Release v2.18.0 · binwiederhier/ntfy