Isn't Wikipedia where AI gets like half of its information from anyway?
Wikipedia editors adopt a policy giving admins the authority to quickly delete AI-generated articles that meet certain criteria, like incorrect citations
a year ago by Pro to c/technology
reddit allows GOOGLE to scrape it for its AI, because google allows them to use thier v3captcha for thier moderation and banning purposes.
Do you think these people surreptitiously submitting articles written by AI are gonna be capable of validating what they're submitting is even true? Particularly if the (presumably effective) Wikipedia defense for this is detecting made up citations?
This kind of thing makes something valuable to everyone, like Wikipedia, ultimately a less valuable resource, and should be resisted and rejected by anyone with their head screwed on
Why would wikipedia of all things be your go to for that?
Wikipedia is the most accurate encyclopedia to date; its perceived unreliability as to its correctness is largely a misunderstanding that arose from misconceptions as to why one can’t (or shouldn’t, depending on case) cite it in academia. People think that it can’t be cited because of its unreliability but in reality it’s simply because it’s a third hand source; i.e. a resource.
Wikipedia is built near-purely on second hand sources, which is how all encyclopedias are intended to be constructed. As long as one ensures the validity of the second hand source used, encyclopedias are great resources.
Wikipedia is the most accurate encyclopedia to date
How did you determine that?
Wikipedia is built near-purely on second hand sources, which is how all encyclopedias are intended to be constructed. As long as one ensures the validity of the second hand source used, encyclopedias are great resources.
True, but basically nobody does check that the sources are valid, and they often aren't.
How do you know they often aren't? I'm an academic and regularly use wikipedia to find citations for sources. I've have yet to come across any citations that were wrong.
For anything that is not politically contentious, it’s very good. Even the politically contentious stuff tries to give the most “balanced”/“mainstream” interpretation usually.
There are communities of people which hyperfixate on certain topics. Think dinosaurs and trains. If a serious Dino-head sees a mistake about the length of Diplodocus, they are going to drop everything and fix it immediately.
I routinely check wiki sources - I’ve taught a lot of college kids that as a way to quickly find sources for papers. Most of the time, topics I know a lot about from my own educational background match what I see on wiki and cite the same kinds of sources I would use.
It’s not perfect - there’s the infamous story of an American teenager writing all of Scots Wikipedia without knowing any Scots - but you have to respect the fact that there are a lot of people who are obsessed with certain topics and will watch their pet articles like a hawk.
This guy is a troll and he's going to keep asking questions as long as people keep answering them.
I'm just going to block him and move on; got no time to suffer fools like this any more.
Man, you people really loath anyone who doesn't just shut up and agree.
NATOpedia is a great resource if you go in with an assumption of a pro-western bias, but a source of truth lmao.
Someone is mad their sources got removed for not being credible.
What a shock that someone who pretends to be an anarchist would go to bat to defend the reliablity of far right western propaganda outlets like Radio Free Asia, the Victims of Communism Memorial Foundation, and the Australian Strategic Policy Institute. Remember, if it doesn't' have the Western Neo-liberal seal of approval, it's not credible and should be removed, that's the anarchist way!
A lot of western liberals really do treat it like the Holy Scripture. Any intelligence agencies would just have to pay a few admins and higher some people to sculpt the list of "reliable sources" that Wikipedia uses and they can basically fully control what hundreds of millions of neoliberals believe.
And they have.
You're just salty that the russian and chinese propaganda edits are thrown out as soon as they pop up lol
Well you're free to submit sources that are credible and challenge that old ones aren't.
The headline reflects a sensible move by Wikipedia to protect content quality. AI-generated articles often include errors or fake citations, so giving admins the authority to quickly delete such content helps maintain accuracy and credibility. While there's some risk of overreach, the policy targets misuse, not responsible AI-assisted editing, and aligns with Wikipedia’s existing standards for removing low-quality material.
Ha, fair question! But no irony here—I actually wrote it myself. That said, it's kind of funny how quickly we've reached the point where any well-written, balanced take sounds like it could be AI-generated. Maybe that's part of the problem we're trying to solve!
Either LLM or quality trolling
It always feels weird when people write an essay as if this is their final quarter project for high school. Too neat, thoughts too organized, much flowery proses.
If anyone has specific questions about this, let me know, and I can probably answer them. Hopefully I can be to Lemmy and Wikimedia what Unidan was to Reddit and ecology before he crashed out over jackdaws and got exposed for vote fraud.
Well now I want to know about jackdaws and voter fraud
what about the jackdaws thing?
Is there a danger that unscrupulous actors will try and build out a Wikipedia edit history with this and try to mass skew articles with propaganda using their "trusted" accounts?
Or what might be the goal here? Is it just stupid and bored people?
So Wikipedia has three methods for deleting an article:
This new criterion has nothing to do with preempting the kind of trust building you described. The editor who made it will not be treated any differently than without this criterion. It's there so editors don't have to deal with the bullshit asymmetry principle and comb through everything to make sure it's verifiable. Sometimes editors will make these LLM-generated articles because they think they're helping but don't know how to do it themselves, sometimes it's for some bizarre agenda (e.g. there's a sockpuppet editor who's been occasionally popping up trying to push articles generated by an LLM about the Afghan–Mughal Wars), but whatever the reason, it just does nothing but waste other editors' time and can be effectively considered unverified. All this criterion does is expedite the process of purging their bullshit.
I'd argue meticulously building trust to push an agenda isn't a prevalent problem on Wikipedia, but that's a very different discussion.
Thank you for your answer, I really feel happy that Wikipedia is safe then. Stuff happening nowadays makes me always think of the worst.
Do you think your problem is similar to open-source developers fighting AI pull requests? There it was theorised that some people try to train their models by making them submit code changes and abuse the maintainers' time and effort to get training data.
Is it possible that this is an effort to steal work from Wikipedia editors to get you to train their AI models?
Is it possible that this is an effort to steal work from Wikipedia editors to get you to train their AI models?
I can't definitively say "no", but I've seen no evidence of this at all.
How frequently are images generated/modified by diffusion models uploaded to Wikimedia Commons? I can wrap my head around evaluating cited sources for notability, but I don't know where to start determining the repute of photographs. So many images Wikipedia articles use are taken by seemingly random people not associated with any organization.
So far, I haven't seen all that many, and the ones that are are very obvious like a very glossy crab at the beach wearing a Santa Claus hat. I definitely have yet to see one that's undisclosed, let alone actively disguising itself. I also have yet to see someone try using an AI-generated image on Wikipedia. The process of disclaiming generative AI usage is trivialized in the upload process with an obvious checkbox, so the only incentive not to is straight-up lying.
I can't say how much this will be an issue in the future or what good steps are to finding and eliminating it should it become one.
I'm going to write this from the perspective of the English Wikipedia, but most specifics should have some analog in other Wikipedias. By "contribute to new articles", do you mean create new articles, contribute to articles which are new that you come across, or contribute to articles which you haven't before (thus "new to you")? Asking because the first one has a very different – much more complicated – answer from the other two.
The short answer is that I really, really suggest you try other things before trying to create your first article. This isn't just me; every experienced editor will tell you that creating a new article is one of the hardest things any editor can do, let alone a newer one. It's why the task center lists it as being appropriate for "advanced editors". Finding an existing article which interests you and then polishing and expanding it is almost always more rewarding, more useful, easier, and less stressful than creating an article from scratch. And if creating articles sounds appealing, expanding existing stub articles is great experience for that.
The long answer is "you can", but it's really hard:
Some of these apply to normal editing too, but working within an article others have worked on and might be willing to help with is vastly easier than building one from scratch. If you want specific help in picking out, say, an article to try editing and are on the English Wikipedia, I have no problem acting like bowling bumpers if you're afraid your edits won't meet standards.
Unidan was a legend, he will be missed.
Why wouldn’t they default to not accepting any AI generated content
If you can accurately detect what content is AI generated, you'll have a company worth billions overnight
Manual approval process would kill the site I think, there's just so much content on it that gets updated constantly it would just grind it all to a halt
Right, and by manual approval it just would be the absolute lowest priority. Kind of like the automated message "we're expecting higher than normal call volumes" as companies gently tell us their margins are more important than their customers.
common wikipedia w
This has since been corrected, but there is a great video by BadEmpanada about the state of the Holodomor’s page:
Wikipedia has a giant article regurgitating the false claims from the extremist Falun Gong cult that China is stealing their organs.
Here you go, Would you like me to cut your food for you too?
Link or gtfo, let people check the sources themselves
Here you go, Would you like me to cut your food for you too?
not interested in doing work for others.
There have been plenty instances of manipulation over the years and shady practices in the organisation itself.
Unbelievable there are still so many gullible people still thinking it's a reputable source.
if you love it so much for some reason then keep using it.
garbage in, garbage out
@lemmy.world
go to feed...
@lemmy.world
go to feed...
Oh for fuck's sake...
I'd not considered this was happening (people submitting AI wiki articles)
save