Skip Navigation

stravanasu

@ pglpm @lemmy.ca

Posts

150
Comments

596
Joined

3 yr. ago

Moderating

Typography & fonts lemmy.ca

English usage and grammar lemmy.ca

Software alternatives for Linux lemmy.ca

Buster Keaton lemmy.ca

satori lemmy.ca

Bayesian Theory mander.xyz

General Relativity mander.xyz

1d ago

Does traning AI/ML-models on AI-generated content causes collapse on the quality of the output?
Jump
2

stravanasu @lemmy.ca 1d ago

It is actually not so difficult to see this for yourself in a much simplified setting. One can easily build a "Small Language Model" that extracts correlations between only three consecutive words. On the web there's plenty of short scripts that do this; here and here is one example. The output created by such a SLM can have remarkably long sentences with grammatical meaning (see the examples in the links above); this is remarkable since all it learned was correlations between triplets of words.
Now you can take a large amount of output from such a SLM, and use it to train a second, identical or even better SLM, then check the output generated by this second one. You'll see that the new output is less coherent than the one from the first SLM. Give the output of the second SLM to a third, and you'll see even less coherent text coming out. And so on.

1d ago

Does traning AI/ML-models on AI-generated content causes collapse on the quality of the output?

3

stravanasu @lemmy.ca 1d ago

They aren't out of context, and you have just said the same thing. Data processing can help in removing noise, but it can't help in creating information or extracting information that wasn't there in the first place. In fact – again as you said – it can end up destroying part of the original information.

LLMs extract word correlations from textual data. Already in this process they are losing information, since they can't extract correlations beyond a certain (yet large) length, and don't extract correlations at shorter lengths. And in creating output they insert spurious correlations that replace (destroy) some of the original ones. This output will contain even less information than the original training data. So a new LLM trained with such an output will give back even less.

1d ago

"Iron Lung" movie: does anyone know more about the "self-release"?

stravanasu @lemmy.ca 1d ago

What a job! Thanks for the info!

1d ago

Does traning AI/ML-models on AI-generated content causes collapse on the quality of the output?

5

stravanasu @lemmy.ca 1d ago

Yes it does. Indeed it is a mathematical theorem from Information Theory, called the data-processing inequality. Quoting from two good textbooks on Information Theory:

“No clever manipulation of the data can improve the inferences that can be made from the data” (Cover & Thomas, Elements of Information Theory §2.8).

“Data processing can only destroy information” (MacKay, Information Theory, Inference, and Learning Algorithms exercise 8.9).

3d ago

"Iron Lung" movie: does anyone know more about the "self-release"?

stravanasu @lemmy.ca 3d ago

Thanks for the info! 🚀

Very nice what he's doing, although I'll prefer to get it from other sources than YouTube (which I suppose will get a commission).

3d ago

"Iron Lung" movie: does anyone know more about the "self-release"?

stravanasu @lemmy.ca 3d ago

My post obviously begged for this. Iron lung and lotion – what a combination!

Movies @lemmy.world

stravanasu @lemmy.ca

3d ago

"Iron Lung" movie: does anyone know more about the "self-release"?

thetvdb.com /movies/iron-lung

movies @piefed.social

stravanasu @lemmy.ca

3d ago

"Iron Lung" movie: does anyone know more about the "self-release"?

thetvdb.com /movies/iron-lung

3d ago

Ubuntu 26.02 working like it’s too new

stravanasu @lemmy.ca 3d ago

You've read the stances of all different people. I agree with most and I'm a bit more conservative: I switch to a LTS (even-numbered) release only when its main non-LTS (odd-numbered) upgrade is out; and skip all non-LTS.

5d ago

I2P on laptop: How to properly shut down the router?

stravanasu @lemmy.ca 5d ago

Thanks for the additional tip! Not using docker at the moment, but I'll keep this in mind if I do :)

5d ago

The locking-down of Android: why I had to 'hack' a banking app just to get it running

stravanasu @lemmy.ca 5d ago

I agree. I'll actually contact the national Consumer Policies department and ask if this is at all legal.

5d ago

Oh No! Now A Federal Bill Wants OS-Level Age Verification for Everyone in the USA

stravanasu @lemmy.ca 5d ago

The fundamental problem is that age verification is bullshit. So let's not normalize it. It must be fought, on all fronts, including the FOSS front.

6d ago

The locking-down of Android: why I had to 'hack' a banking app just to get it running

stravanasu @lemmy.ca 6d ago

Indeed I wonder if that kind of keyboard check is even legal - personally I feel it as a breach of my privacy, none of their fucking business what kind of input method I use. (If anyone here is knowledgeable about such matters, please let me know!)

7d ago

The locking-down of Android: why I had to 'hack' a banking app just to get it running

7

stravanasu @lemmy.ca 7d ago

I've been having similar turd-kind encounters with bank apps even within Android. I use the egregious Heliboard from F-droid, and my bank app refused to start because I use an "untrusted keyboard" – funny as it's way more trustworthy that Gboard or Microslop board apps. Turns out the apps of all banks in my country are like that. So now I simply access the bank via the browser instead. Fuck their apps.

But I understand that the browser solution may not work for everyone :(

Partly this problem comes from incompetence of the app's developers, partly for shifting responsibility: it seems to me that they let Play Store do the checks, so if any hacking happens they can blame Play Store. And there's also the modern motto: "if you want to make an app secure, make it unusable". Even better I'd then say "don't make it at all"! – there, security-problem fully solved.

Put pressure on banks would be best. Possibly one could also play a "disability" card: I must use such-and-such app or OS owing to visual impairment, say. Or collect signatures for a petition... but I imagine we're a very small minority.

As a protest in my case I changed bank a couple of times.

But thank you for the USB-ADB tip! I'll use it when I switch to GrapheneOS.

1w ago

I2P on laptop: How to properly shut down the router?

stravanasu @lemmy.ca 1w ago

Thank you, this is exactly what was unclear to me. I also prefer to shut it down from the command line. I'll use the graceful command as well!

1w ago

I2P on laptop: How to properly shut down the router?

2

stravanasu @lemmy.ca 1w ago

Thank you for the information and advice. I'll set aside 15 min for shutting it down gracefully then.

1w ago

I2P on laptop: How to properly shut down the router?

stravanasu @lemmy.ca 1w ago

Thank you for the reply and explanations! Now I understand. I'll also use i2prouter graceful then.

Regarding the config, I'll investigate. It occurs only from time to time, but it may have happened because I was shutting the router down the wrong way.

Thanks again!

The Invisible Internet Project @lemmy.world

stravanasu @lemmy.ca

1w ago

I2P on laptop: How to properly shut down the router?

1w ago

PeerBox, the first fully P2P secure email system

2

stravanasu @lemmy.ca 1w ago

I understand. Be aware that this can be quite a limiting factor, more than you think. The need to think about home servers starts to clash with the statement that

It was built from day one to be usable by anyone, with zero tech background required.

1w ago

Oh No! Now A Federal Bill Wants OS-Level Age Verification for Everyone in the USA

1

stravanasu @lemmy.ca 1w ago

Possibly. That's up to your distro. However, consider that EU as well is starting to speak about age verification. It's quite clear that the whole "West" aspires to be more like Russia and China.

1w ago

PeerBox, the first fully P2P secure email system

4

stravanasu @lemmy.ca 1w ago

Thank you for the explanation. But I don't understand how it can work if:

I send a message while my contact is offline,
then I go offline,
my contact comes back online while I'm still offline.

The message needs to be somewhere in between. This is a situation that occurs quite often when you message with people in very different time zones.

1w ago

PeerBox, the first fully P2P secure email system

13

stravanasu @lemmy.ca 1w ago

Nobody in the middle. No server storing anything. No company analyzing anything
[...]
In deferred mode, it works just like regular email. Meaning your contact doesn’t need to be online when you send the message. Your contact will get it automatically once they come online.

So I can't send a message while my contact is offline, then go offline myself, and expect that my contact will receive it when they go online? This is quite limiting.

How is PeerBox different from Delta Chat?

1w ago

Oh No! Now A Federal Bill Wants OS-Level Age Verification for Everyone in the USA

stravanasu @lemmy.ca 1w ago

I wish, but I'm not so sure. Look at what happened with the Californian age-verification laws and Systemd for example. Some (arsehole, in my personal opinion) FOSS developers hurried up and bent over backwards to start complying. We'll probably end up having "Linux" distros that will comply, and Linux distros, probably distributed via secret channels, that won't.

Linux @lemmy.world

stravanasu @lemmy.ca

2w ago

Oh No! Now A Federal Bill Wants OS-Level Age Verification for Everyone in the USA

itsfoss.com /news/os-level-age-verification-across-us/

Linux @programming.dev

stravanasu @lemmy.ca

2w ago

Oh No! Now A Federal Bill Wants OS-Level Age Verification for Everyone in the USA

itsfoss.com /news/os-level-age-verification-across-us/

The Invisible Internet Project @lemmy.world

stravanasu @lemmy.ca

2w ago

I2P and I2Pd: any more information?

One Punch Man - OPM @ani.social

stravanasu @lemmy.ca

3w ago

One Punch Man (webcomic) chapter 159

mangafire.to /read/one-punch-man-webcomicoriginall.jjn/en/chapter-159

One Punch Man - OPM @lemmy.ml

stravanasu @lemmy.ca

3w ago

One Punch Man (webcomic) chapter 159

mangafire.to /read/one-punch-man-webcomicoriginall.jjn/en/chapter-159

privacy @lemmy.ca

stravanasu @lemmy.ca

4w ago

Denial Takes Hold as Teens Circumvent Australian Age Verification

www.freezenet.ca /denial-takes-hold-as-teens-circumvent-australian-age-verification/

Privacy @lemmy.world

stravanasu @lemmy.ca

4w ago

Denial Takes Hold as Teens Circumvent Australian Age Verification

www.freezenet.ca /denial-takes-hold-as-teens-circumvent-australian-age-verification/

Privacy @lemmy.dbzer0.com

stravanasu @lemmy.ca

4w ago

Denial Takes Hold as Teens Circumvent Australian Age Verification

www.freezenet.ca /denial-takes-hold-as-teens-circumvent-australian-age-verification/

Privacy @lemmy.dbzer0.com

stravanasu @lemmy.ca

4w ago

"FOSS" and "GNU Linux" do not automatically mean "for the community" or "for human rights"

Privacy @lemmy.world

stravanasu @lemmy.ca

4w ago

"FOSS" and "GNU Linux" do not automatically mean "for the community" or "for human rights"

Linux @lemmy.world

stravanasu @lemmy.ca

4w ago

"FOSS" and "GNU Linux" do not automatically mean "for the community" or "for human rights"

Linux @programming.dev

stravanasu @lemmy.ca

4w ago

"FOSS" and "GNU Linux" do not automatically mean "for the community" or "for human rights"

Linux @lemmy.world

stravanasu @lemmy.ca

1mo ago

A polite open letter to KDE developers and maintainers, which got blocked by a moderator.

KDE & Plasma users @lemmy.ml

stravanasu @lemmy.ca

1mo ago

A polite open letter to KDE developers and maintainers, which got blocked by a moderator.

KDE @lemmy.kde.social

stravanasu @lemmy.ca

1mo ago

A polite open letter to KDE developers and maintainers, which got blocked by a moderator.

privacy @lemmy.ca

stravanasu @lemmy.ca

1mo ago

Will we have to choose between privacy-friendly Linux distros vs legal Linux distros?

Linux @lemmy.world

stravanasu @lemmy.ca

1mo ago

Will we have to choose between privacy-friendly Linux distros vs legal Linux distros?