Shortcuts: WD:PC, WD:CHAT, WD:?
Wikidata:Project chat
Wikidata project chat A place to discuss any and all aspects of Wikidata: the project itself, policy and proposals, individual data items, technical issues, etc.
Please use
|
- Afrikaans
- العربية
- беларуская
- беларуская (тарашкевіца)
- български
- Banjar
- বাংলা
- brezhoneg
- bosanski
- català
- کوردی
- čeština
- словѣньскъ / ⰔⰎⰑⰂⰡⰐⰠⰔⰍⰟ
- dansk
- Deutsch
- Zazaki
- dolnoserbski
- Ελληνικά
- English
- Esperanto
- español
- eesti
- فارسی
- suomi
- føroyskt
- français
- Nordfriisk
- galego
- Alemannisch
- ગુજરાતી
- עברית
- हिन्दी
- hrvatski
- hornjoserbsce
- magyar
- հայերեն
- Bahasa Indonesia
- interlingua
- Ilokano
- íslenska
- italiano
- 日本語
- Jawa
- ქართული
- қазақша
- ಕನ್ನಡ
- 한국어
- kurdî
- Latina
- lietuvių
- latviešu
- Malagasy
- Minangkabau
- македонски
- മലയാളം
- मराठी
- Bahasa Melayu
- Mirandés
- مازِرونی
- Nedersaksies
- नेपाली
- Nederlands
- norsk bokmål
- norsk nynorsk
- occitan
- ଓଡ଼ିଆ
- ਪੰਜਾਬੀ
- polski
- پنجابی
- português
- Runa Simi
- română
- русский
- Scots
- davvisámegiella
- srpskohrvatski / српскохрватски
- සිංහල
- Simple English
- slovenčina
- slovenščina
- shqip
- српски / srpski
- svenska
- ślůnski
- தமிழ்
- తెలుగు
- ไทย
- Tagalog
- Türkçe
- українська
- اردو
- oʻzbekcha / ўзбекча
- Tiếng Việt
- Yorùbá
- 中文
On this page, old discussions are archived after 7 days. An overview of all archives can be found at this page's archive index. The current archive is located at 2024/07. |
750 V DC conductor
[edit]I have temporarely added Q21855034 (600 Volt) as the closest to a '750 V DC conductor' electrification as a P930 property to Q918235. I hesitade to create a new electrification item, as there are several type third rail type (upper, side and underneath contact) and maybe the type of electric contact is a seperate property from P930. Having a third rail shoe is a quite essential part of a train type.Smiley.toerist (talk) 08:20, 29 June 2024 (UTC)
- Q25857994 can be used; the differences between 600 V DC railway electrification (Q21855034) and 750 V DC railway electrification (Q25857994) are the use of direct current (Q159241) in Q21855034 but not Q25857994 (not significant as both are DC railway electrification (Q11581821)) and the voltage; they don't specify the type of contact or whether it is third rail or overhead. Q838484#P930 has third rail as a qualifier. Peter James (talk) 13:44, 29 June 2024 (UTC)
- Thanks, Smiley.toerist (talk) 09:42, 30 June 2024 (UTC)
Named after
[edit]we have the property "named after" which ban have the value John Smith. Is there a property to use at John Smith to show "things named after this person"? RAN (talk) 16:31, 29 June 2024 (UTC)
- If such a property would exist you would see it listed at named after (P138). We generally avoid inverse properties and I can't think of a good reason to have the property "things named after this person". ChristianKl ❪✉❫ 16:47, 29 June 2024 (UTC)
- Not sure if this would be useful for you but there is a gadget that you can enable in preferences called "relateditems" which "Adds a button to the bottom of item pages to display inverse statements." So would show all things named after the person as well any other properties that link to the item Piecesofuk (talk) 17:01, 29 June 2024 (UTC)
- We have lots: "Father:Child"; "Owner_of (P1830):Owned_by (P127)"; "Occupant (P466):Residence (P551)"; "member of the crew of (P5096):crew member(s) (P1029)" those just ones I can think of without searching. --RAN (talk) 17:32, 29 June 2024 (UTC)
- The latest proposal was Wikidata:Property proposal/Namesakes. GZWDer (talk) 17:49, 29 June 2024 (UTC)
- All those examples are properties created early, grandfathered in when the issues with inverses became clearer. ArthurPSmith (talk) 16:36, 1 July 2024 (UTC)
Merging Q117208646 (exercise & fitness product) into Q352222 (exercise equipment)?
[edit]The former seems to be generated from Google's product taxonomy, but overall seems to refer to the same concept. The subgraphs of both terms are overlapping but not identical, so perhaps a clean-up would be welcome. Any thoughts? Alcinos (talk) 22:34, 29 June 2024 (UTC)
- Is there any fitness products that are not exercise equipment? Trade (talk) 13:05, 1 July 2024 (UTC)
- Fitbits and other activity tracker (Q16001686) perhaps. Not my area, but all of https://www.wikidata.org/wiki/Special:WhatLinksHere/Q117208646 look like they could fit in the other category Vicarage (talk) 13:36, 1 July 2024 (UTC)
- Activity trackers are a good example, one could argue that they are indeed fitness products but not really exercise equipment (although the link to either concept is currently missing in the page you linked). Other elements that could be in the same case: fitness app (Q25104632), smart scale (Q116454756), perhaps also massage gun (Q110997596).
- In the light of this, here is a refined proposal:
- - rename exercise & fitness product (Q117208646) to "fitness product", and make exercise equipment (Q352222) a subclass of it
- - move all current sub-classes of exercise & fitness product (Q117208646) to be sub-classes of exercise equipment (Q352222) (as Vicarage noted, currently all of them seem to be appropriate sub-classes
- - add links for the remaining "fitness products" that are not "exercise equipment", such as activity tracker and the other listed above
- How does that sound? Alcinos (talk) 15:51, 1 July 2024 (UTC)
- Other proposed "fitness products" that are not "exercise equipment":
- - heart rate monitor (Q925303) although that one is currently listed as an exercise equipment, not sure if I agree
- - yoga pants (Q8054336) (perhaps it would be nice to have a "workout clothes" class? I can't seem to find one currently) Alcinos (talk) 15:58, 1 July 2024 (UTC)
- sportswear (Q645292) includes exercise in description Vicarage (talk) 16:37, 1 July 2024 (UTC)
- It may be too broad to be a subclass of "fitness product". Eg sports jersey (Q2623418) is a sportswear but likely wouldn't be a good (indirect) subclass of "fitness product" Alcinos (talk) 17:09, 1 July 2024 (UTC)
- sportswear (Q645292) includes exercise in description Vicarage (talk) 16:37, 1 July 2024 (UTC)
- Fitbits and other activity tracker (Q16001686) perhaps. Not my area, but all of https://www.wikidata.org/wiki/Special:WhatLinksHere/Q117208646 look like they could fit in the other category Vicarage (talk) 13:36, 1 July 2024 (UTC)
Bogus disease English aliases prefixed with "obsolete" - cleanup needed
[edit]A large number (thousands?) of pages for diseases and classes of diseases currently have bogus aliases in English "obsolete X", where X is usually the main English label. For example, hemophilia (Q134003) has the alias "obsolete hemophilia". Likewise, rinderpest (Q157008) has alias "obsolete rinderpest" (though in a sense it actually is obsolete!). Some have variations, e.g. chronic pancreatitis (Q1996053) has alias "obsolete relapsing pancreatitis".
These seem to have been added by a bot trying to import an external taxonomy in 2020. Example of a bad revision: https://www.wikidata.org/w/index.php?title=Q194435&oldid=1313119769
How should these be cleaned up? Can a bulk query be used to find them all?
73.223.72.200 05:00, 1 July 2024 (UTC)
Wikidata weekly summary #634
[edit]This is the Wikidata summary of the week before 2024-07-01. Please help Translate.
Discussions
- New requests for permissions/Bot:
- DifoolBot 4 Task(s) - Split single references containing multiple reference URLs into multiple references.
- Bot Bozze Task(s) - Add sitelinks to itwiki draft articles after they've been moved to the main namespace.
- New request for comments: Spelling convention for labels and descriptions in English - RfC started 2024-06-25. This RfC requests feedback and input for finding consistency in spelling convention as English has multiple regional variations.
- Past: The Lexicodays 2024 was an online event designed to offer a discussion space for the Wikidata community about Lexicographical Data. An archive of some of the slides and session recordings are here c:Category:Lexicodays 2024. More will be added as they become available.
- Upcoming:
- The next Wikidata+Wikibase office hours will take place on Wednesday, 16:00 UTC on Wednesday, 10th July 2024 (18:00 Berlin time) in the Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
- Talk to the Search Platform / Query Service Team—July 3, 2024
- Botany-focused Wikidata online workshop online as part of the #IBC2024. Date: Tuesday 9th July at 9pm NZST (GMT+12) / 11 am central Europe. Register here!
Press, articles, blog posts, videos
- Blogs
- Querying for audio on Wikidata - This blog post discusses using SPARQL queries on Wikidata to find audio recordings, focusing on musical compositions and their associated genres.
- Stories from the anti-disinformation repository: Why Wikimedia is an antidote to disinformation - The blog post highlights how Wikidata, as a central storage repository, plays a crucial role in countering disinformation by providing reliable, structured data for Wikimedia projects and beyond.
- Diff Blog: Imagining a Wikidata future for librarians together - the sixth and final blog post from the LD42023 conference. Silvia Gutiérrez (WMF) and Giovanna Fontenelle (WMF) document the results of the collaborative session on building a bridge between the Library-Wikidata community and WMF.
- Census IDs are Now Wikidata External Identifiers
- Library Knowledge as Linked Data: A Wikidata Approach: Contributing to a shared data commons. David Erlandson describes the experiences of using Wikidata for the pilot Program for Cooperative Cataloging to "accelerate the movement towards ubiquitous identifier creation and identity management at the network level".
- Papers
- Mapping the Past: Geographically Linking an Early 20th Century Swedish Encyclopedia with Wikidata - This paper describes the extraction of location entries from a prominent Swedish encyclopedia and sheds light on selection and representation of geographic information in the Nordisk Familjebok. By A. Ahlin, A. Myrne & P. Nugues.
- Papers from the just-ended Wiki Workshop 2024
- Do LOD Conventions Impede the Representation of Diversity? The Case of Disabled Actors in DBpedia and Wikidata?
- SPARQL for LIS Analytics: Exploring Gender Representation amongst PCC Wikidata Pilot Participant
- Wikidata Vandalism Detection with Graph-Linguistic Fusion
- Wikidata Quality Toolkit: Entity Schema Generator Demonstration (tool demonstration)
- Videos
- Wikidata Knowledge Graph to Enable Equitable and Validated Generative AI - Wikimedia Deutschland's Jonathan Fraine and Lydia Pintscher show how Wikidata can be used to provide well-cited information and how semantic search can augment generative AI inference. Presented at the Open Source GenAI & ML Summit.
- Wikidata Editing LIVE at Lexico Days 2024 - User:Abbe98 and User:JanAinali are back for another session of live-editing, focused on lexicographical data, during the Lexico Days 2024 event that took place this last weekend, June 28 - June 30.
- Get more out of Wikidata with Resonator - Rachel Hendrick and Gary Price of LibTech Tools walk through Resonator and point out the best ways to use it. Resonator is available on ToolForge.
- Knowledge Integrity: Reliability- Wikidata Vandalism Detection with Graph Linguistic Fusion - Diego (WMF) and Mykola Trokhmovych showcase their work on building a model to help Wikidata editors identify edits that require patrolling, as part of the [Wiki Workshop 2024.
- Inclusion of Communities: Using Wikibase to Leverage Community Sourced Data Initiatives - Erin Yunes talks about their work in using Wikibase Cloud as part of the Compel project (COmputer Music Preservation Electronic Library).
- (es) ¿Cómo fortalecer el dominio público con Wikidata? - This Wikitools Workshop hosted by Jorge Gemetto is on Paulina, a tool for exploring and accessing public domain information on authors and their works.
Tool of the week
- Automatic Structuring of text for Wikidata - User:BrokenSegue introduces their new tool.
- User:Zvpunry/CreateNewItem - This is a User script to easily add a new Item while editing a Statement and noticing that the desired Item is missing.
Other Noteworthy Stuff
- The second iteration of the Wikidata:Open Online Course has begun. Class will continue until August 11. Whether you're a beginner taking your first steps, an individual in need of a refresher on Wikidata concepts, or a seasoned trainer looking to level up your skills - this course is right for you.
Newest properties and property proposals to review
- Newest General datatypes:
- showrunner (person who is responsible for the day-to-day operation of a television show)
- music mood (qualifier carrying an emotion (mood) relevant to a musical audio recording)
- coin edge (image or images that show the edge of a coin)
- ozone depletion potential (relative amount of degradation to the ozone layer relative to CFC-11)
- Newest External identifiers: Locomotive Yaroslavl HC player ID, Orthoptera Species File taxon ID (new), Flown From the Nest person ID, Online Swahili - English Dictionary ID, A Dictionary of Plant Sciences ID, A Dictionary of Zoology ID, A Dictionary of Contemporary Icelandic ID, Seret film ID, Limited Liability Partnership Identification Number, Paleobiology database reference ID, PNG School Code, Téarma ID, IMAIOS entity ID, Naturalis Repository ID, English-Spanish Dictionary ID, Vikidia article ID, thisisbasketball.be player ID, poblesdecatalunya.cat ID, Oqaasersiorfik ID, MNAHA person ID, Greenlandic-English Dictionary ID, Te Aka Māori Dictionary ID, Tropicos person ID, He Pātaka Kupu ID, vehicle keeper marking (VKM), AllGame style ID, FC Metz player ID, MoFo ID, itch.io numeric ID, filmas.lv film ID, filmas.lv person ID, filmas.lv studio ID, Cockroach Species File taxon ID (new), Lygaeoidea Species File taxon ID (new), Phasmida Species File taxon ID (new), Psocodea Species File taxon ID (new), Spanish-English Dictionary ID, Norwegian National Museum producer ID, Burgenwelt ID, Irish-English Dictionary ID, Tesoro della Lingua Italiana delle Origini ID, Tommaseo-Bellini Online ID, danskfodbold.com player ID, DAKA Danish-Greenlandic Dictionary ID, DAKA Greenlandic-Danish Dictionary ID, Canadian Great War Project person ID, English-Irish Dictionary ID, PMC journal ID, Census ID, Douban personage ID, Avibase person ID, Brezhoneg21 ID, European Education Thesaurus ID, Cineuropa distributor ID, Cineuropa production company ID, OpenCitations Meta ID, IGN franchise ID, Federal Reserve Subject Taxonomy ID, Farhang-i forsī ba rusī ID, Devri ID, Cambridge University Press ID, Canadian Virtual War Memorial ID, Personnel Records of the First World War ID, Fowler’s Concise Dictionary ID, NooSFere publisher ID, Plex person key, BHMPI OBJ ID, Index Fungorum person ID, stiga.trefik.cz player ID, UNIBO professor ID, Cineuropa international sales agent ID, Mapes de Patrimoni Cultural ID
- New General datatypes property proposals to review:
- number of local branches (number of branches of this organization at the lowest (local) level)
- KANAL inventory ID (inventory number of a creative work assigned by KANAL)
- Tüik mahalle id (Identifier of neighborhoods <small>({{q|Q17051044}})</small> in Turkey in TÜİK <small>({{q|Q1375058}})</small> database)
- New External identifier property proposals to review: Pocket Oxford-Hachette French Dictionary: English-French ID, Biodiversity Information System for Europe ID, Elonet company ID, Numista issuer ID, Overcast episode ID, Metamath statement label, Pocket Oxford German Dictionary: English-German ID, Pocket Oxford Italian Dictionary: English-Italian ID, Il Nuovo DOP ID, FEI horse ID, Google Play author ID, identifiant d'une personne sur Archelec, Standard Ebooks ID, Lojas com História ID, RGALI person ID, RGALI organization ID, Hebrew Academy term ID, milononline.net entry ID, KANAL identifier, LAGL author ID, Alle Burgen, FC Krasnodar player id, Pocket Oxford Italian Dictionary: Italian-English ID, Pocket Oxford German Dictionary: German-English ID, Pocket Oxford-Hachette French Dictionary: French-English ID, Manhom Arabic Profile ID, GOArt databas, ArchWiki article, Star Wars.com, identifikátor filmu ve Filmové databázi (FDb)
You can comment on all open property proposals!
Did you know?
- Query examples:
- Newest WikiProjects: Inuktitut - This is the space to organize work to assure that the sum of all knowledge and the supporting infrastructure for necessary services are available in Inuktitut (ᐃᓄᒃᑎᑐᑦ, Inuktitut).
- Newest database reports: Merge candidates based on same pattern
- Showcase Items: Montblanc (Q761735) - town in the province of Tarragona, Catalonia
- Showcase Lexemes: gbuɣi (L725113) - Dagbanli verb, translates to "vomiting" and "sprouting"
Development
- EntitySchemas:
- We worked around an issue where EntitySchema pages were no longer considered “content” and had become unsearchable (phab:T368010)
- We prepared for the release of the new datatype on July 2nd.
- mul language code: We are working on the last remaining blocker before rolling out the first stage to Wikidata (phab:T362917)
- Wikibase REST API: We are continuing to rework API errors (phab:T366911, phab:T366239)
You can see all open tickets related to Wikidata here. If you want to help, you can also have a look at the tasks needing a volunteer.
Weekly Tasks
- Add labels, in your own language(s), for the new properties listed above.
- Contribute to the showcase Item and Lexeme above.
- Participate in this week's Lexeme challenge:
- Govdirectory weekly focus country: Argentina
- Summarize your WikiProject's ongoing activities in one or two sentences.
- Help translate or proofread the interface and documentation pages, in your own language!
- Help merge identical items across Wikimedia projects.
- Help write the next summary!
The wiki is now in read-only mode
[edit]"Failed to save due to an error." and "The wiki is now in read-only mode." pop up. Why? Eurohunter (talk) 05:26, 2 July 2024 (UTC)
- Apparently there were some brief spikes of replication lag around the time you posted that message; when this happens, the wiki may automatically put itself into read-only mode temporarily until the database has caught up again. Lucas Werkmeister (WMDE) (talk) 09:25, 2 July 2024 (UTC)
Why If I add subclass of (P279) with for example history of Berlin (Q679741) then value-requires-statement constraint (Q21510864) pop up? For example, it pop up at history of trams in Berlin (Q1514212) while it not pop up in history of trams in Barcelona (Q11925955). Eurohunter (talk) 06:18, 2 July 2024 (UTC)
- @Eurohunter You have to make sure there is a complete hierarchy of classes. In the example you have given, Q1514212 has class Q679741, but Q679741 needs to have some class too... I suggest Q122131 be added there as P279. Vojtěch Dostál (talk) 13:26, 2 July 2024 (UTC)
- @Vojtěch Dostál: Thanks. Eurohunter (talk) 12:14, 6 July 2024 (UTC)
Implementing Orphanet Data into Wikipedia
[edit]Orphanet is an important reference within wikipedia with over 1000 refs. Recently, they changed their data structure, thus the former Template:Orphaned does no longer work. I got a file with relevant changes I would like to be implemented. Zieger M (talk) 07:38, 2 July 2024 (UTC)
- @Zieger M Hi, can you share the file publicly, so that I (or others) can have a look and decide if we're able to implement the change? Vojtěch Dostál (talk) 07:42, 2 July 2024 (UTC)
- Yes, how can I share it? Zieger M (talk) 07:44, 2 July 2024 (UTC)
- @Zieger M If it is a table file, maybe you can upload somewhere and share a link? Ideally, with properly labelled columns so that we understand what changes to what :-). Vojtěch Dostál (talk) 07:46, 2 July 2024 (UTC)
- "upload somewhere"? Never done, don't know where to. Sorry Zieger M (talk) 07:51, 2 July 2024 (UTC)
- https://www.mediafire.com/file/uimhjnvs9g4uf49/Linkliste+Orphanet_Original.xlsx/file Zieger M (talk) 12:01, 2 July 2024 (UTC)
- @Zieger M Hi, I checked the file and I think I now better understand what you mean. In fact, the change does not have anything to do with Wikidata - you just want to properly format its links to Orphanet. I think that you only need to replace the URL string "https://www.orpha.net/consor/cgi-bin/Disease_Search.php?lng=DE&data_id=" at de:Template:Orphanet with "https://www.orpha.net/en/disease/detail/". Isn't that right? You can do it locally in Dewiki. Vojtěch Dostál (talk) 13:16, 2 July 2024 (UTC)
- https://www.mediafire.com/file/uimhjnvs9g4uf49/Linkliste+Orphanet_Original.xlsx/file Zieger M (talk) 12:01, 2 July 2024 (UTC)
- "upload somewhere"? Never done, don't know where to. Sorry Zieger M (talk) 07:51, 2 July 2024 (UTC)
- @Zieger M If it is a table file, maybe you can upload somewhere and share a link? Ideally, with properly labelled columns so that we understand what changes to what :-). Vojtěch Dostál (talk) 07:46, 2 July 2024 (UTC)
- Yes, how can I share it? Zieger M (talk) 07:44, 2 July 2024 (UTC)
Wikidata Question
[edit]Hi Wikipedia, I have two concerns regarding data for Blic, daily newspaper from Serbia. I have tried entering publication interval and for some reason it does not let me publish it. Also, I have tried editing their social media information and it did not let me. For both of them, it does not let me publish changes. Can you tell me why ? Боки ✉ 18:21, 2 July 2024 (UTC)
- What does it say? Ymblanter (talk) 18:42, 2 July 2024 (UTC)
- @Ymblanter it doesnt say anything.
- Basically, when I try and change it, publish button is blanked so I cant click on it. Боки ✉ 18:50, 2 July 2024 (UTC)
- If you enter say "1 week" in the field for publication interval then the check-mark can't be clicked. Unit goes into a separate field. Infrastruktur (talk) 19:03, 2 July 2024 (UTC)
iodine in medicine (Q28196266) used for many medicine-related topics on iodine, instead of iodine (Q1103) or perhaps another (new) item
[edit]I noticed that this item is linked not only as an antiseptic but also for many other medical topics. Its description only mentioned "antiseptic" and I've added the prevention and treatment of iodine deficiency, based on its page linked from WikiProjectMed. The mistake may arise from the fact that it's disambiguated in the English- (and several other) language Wikipedia(s) as "iodine (medical use)". I think all other medical uses (e.g. radioactive iodine therapy (Q13233408)) should link to either to iodine as an element (iodine (Q1103)), or to a new item created for this purpose, but the antiseptic (and possibly the deficiency-preventing) use shouldn't be conflated with the radioactive or other medical means of using it. Adam78 (talk) 21:08, 2 July 2024 (UTC)
- @Adam78 Is iodine as antiseptic in any way chemically different from the iodine element? If not, all such links should point to iodine (Q1103) and Q28196266 should instead be facet of (P1269) of iodine (Q1103) or something of that sort. A similar example is calcium in biology (Q60097). Vojtěch Dostál (talk) 11:25, 4 July 2024 (UTC)
Railway junctions: Q24045957 vs Q336764
[edit]I'd be grateful if anyone could help me distinguish railway junction (Q24045957) and junction (Q336764) -- both used specifically for railway junctions, and distinct from railroad switch (Q82818) and the more general junction (Q1777515).
There seem to be two different concepts here, at least in German, but I'm not entirely seeing how they should be named in English to express the difference, or whether articles in the various different language wikis are all connected to the correct item.
Which would be most appropriate for a location where one linear ELR railway line section (Q113990375) of track (perhaps 50 km long, double-track) meets another such section? Jheald (talk) 22:10, 2 July 2024 (UTC)
- Notified participants of WikiProject Railways. (I did ask on the talk page there a couple of years ago, but it didn't get any responses.) Jheald (talk) 22:14, 2 July 2024 (UTC)
- I can explain these from the Czech point of view, but the explanation is similar for all countries in the central Europe (Poland, Germany, Slovakia etc.). At thirst railway junction (Q24045957) is very big (hundreds of switches) and junction (Q336764) is very small (sometimes only one switch, but usually not more than four switches). railway junction (Q24045957) express connection of lot of railway lines usually in one town/city. E.g. železniční uzel Praha (Prague junction) consists of all railway station in Prague (Q1085), in which all railway lines leading to this big city are connected. junction (Q336764) is usually a place where one railway line splits into two railway lines and it is not railway station (Q55488), so if the railway lines are with one track, then one switch can be enough. In Czechia and Poland it is also a place on the double track line between two stations, where are 4 switches to go from the left to the right track and vice versa (the same place is Slovakia (till 2000 also in Czechia) is classified as passing loop (Q784159)). But I have no idea how to name these different places in English. When I translate it, I usually use "junction" for both, although they have completely different meanings. --Cmelak770 (talk) 06:54, 3 July 2024 (UTC)
- In Germany we strongly distinguish between free track (Q1302250) which roughly means track which is not part of a railway station (Q55488) and tracks that are part of a railway station (Q55488). junction (Q336764) is a junction, that is not part of a railway station (Q55488). As far as I know, (most?) english speaking countries don't have this concept free track (Q1302250), so this may not be easy to translate. --Trockennasenaffe (talk) 06:09, 4 July 2024 (UTC)
- Asked at en:Wikipedia_talk:WikiProject_UK_Railways whether anyone there can suggest better English-language labels / descriptions Jheald (talk) 12:56, 7 July 2024 (UTC)
Ingest of SEC EDGAR data into Wikidata?
[edit]I have recently noticed that many company infoboxes on Wikipedia are frequently out of date, even though they draw from Wikidata for many values like yearly results. All of this data is available online through the SEC's EDGAR system, at least for publicly traded companies in the US, so I was wondering whether it would be worthwhile to write a bot that would read SEC data and update Wikidata with it?
Botlord (talk) 19:18, 3 July 2024 (UTC)
- @Botlord: that sounds like a great idea - if you are proposing to do it yourself, the general procedure is to write the code, test it on a small number of items, and then ask for bot status approval for a bot account to regularly run it on at Wikidata:Requests for permissions/Bot. If you are hoping somebody else will do it then Wikidata:Bot requests is the place to start. ArthurPSmith (talk) 20:25, 3 July 2024 (UTC)
Conventions for Knowledge Graph aligning
[edit]Dear Wikidata Community,
We're looking to build a Aerospace Engineering Knowledge Graph, and linking (all) entries to wikidata. For some, like Q3319996, that's easy, for others like conceptual modelling not so much. Others, like CPACS, are not even in Wikidata yet, or Wikipedia for that matter. Given that context, I have the following cases and questions:
- If a perfect match exists, no questions.
- If a match exists that does look correct, but seems to be lacking relations, should we populate this entry as we see fit? (assumed answer: yes, see en:WP:BOLD)
- If a match exists that does look somewhat corect, but does not have the right type, should we split it into two different entities?
- e.g. Q377960 not being a Q3249551, but an Q166142 - should we create a new process instance with the same label?
- what about instances such as Q2623243, which specifically lists conceptual model (an object) and conceptual modelling (a process)? Does the existence of this entry mean differentiation is not desired?
- If no match exists, I assume we should create one. I've taken a look at Wikidata:Notability:
- "It refers to an instance of a clearly identifiable conceptual or material entity that can be described using serious and publicly available references."
- All instances would fall under this category, since all are derived from a systematic literature review and we can link to the respective papers where they are discussed.
- All our instances would be instances of Q10843872, Q7397, Q235557 or similar. Examples: https://github.com/DLR-SC/tixi, https://dlr-sl.github.io/cpacs-website/
- "It refers to an instance of a clearly identifiable conceptual or material entity that can be described using serious and publicly available references."
Furthermore, I have some SPARQL / Database questions, which I'll add to a separate topic to not overflow this one.
Thanks, TimBorgNetzWerk (talk) 11:00, 4 July 2024 (UTC)
API / Pyton / SPARQL access questions
[edit]Hi everyone,
please see Wikidata:Project chat#Conventions for Knowledge Graph aligning for context.
TL;DR, we're looking to check if a wikidata instance exists for ~500 entries we have in our database. We also don't want to overburden the Wikidata API, hence:
What can we do to most efficiently query the wikidata database?
What currently do is:
query = f"""
SELECT ?item ?itemLabel (GROUP_CONCAT(DISTINCT ?altLabel; separator = ", ") AS ?altLabels)
(SAMPLE(?description) AS ?description) WHERE {{
{selection[select]}
OPTIONAL {{?item skos:altLabel ?altLabel FILTER(LANG(?altLabel) = "en")}}
OPTIONAL {{?item schema:description ?description FILTER(LANG(?description) = "en")}}
SERVICE wikibase:label {{bd:serviceParam wikibase:language "en".}}
}}
GROUP BY ?item ?itemLabel
LIMIT {limit}
"""
, wherin we limit the results to 20 at most, and select based on:
selection = {
'label' : f'?item rdfs:label "{label}"@en.',
'altLabel' : f'?item skos:altLabel "{label}"@en.'
}
Then, per label, we check if:
- entries with that label are available (e.g. "STEP file" to Q3509055
- if these entries do not sum up to our limit (20), then we also check if entries with that label as altLabel exist (e.g. ".stp" to Q3509055),
- if these entries do not sum up to our limit (20) then we try 1. and 2. again with (if != label):
- label.lower(), so "STEP" -> "step",
- label.capitalize(), so "STEP" -> "Step",
- label.upper(), so "STEP" -> "STEP" -> not done, since == label
Then we store all queries and results so we run no query twice, and can just check our local "copy" for the result.
Given all this, our Question:
- Is there a better way?
Better as in "easier on wikidata / time" as well as "better results", since currently we have about 40% match rate. Likely, many ouf our instances do, in fact, have no match, but others (like Q2117885 "Systems Modeling Language" or "SysML") are currently just not catched. We have seen advise to run some preprocessing on the labels, to lower all wikidata labels in a filter, but that seemed unfathomably taxing on all parties involved.
There is also the general advice to use a data dump. We have checked Wikidata:Database download and https://dumps.wikimedia.org/wikidatawiki/entities/, and not found a dump that contains all labels AND is relatively small. The lexemes do not seem to contain all labels, presumably only Q111352 instances. All the aformentioned entries, e.g. .p21 and .stp, are not mentioned therein.
I really appreciate your help, and am open to suggestions, improvements, hints or anything, really :)
Best, TimBorgNetzWerk (talk) 11:30, 4 July 2024 (UTC)
- Have you considered using a tool like OpenRefine to help reconcile your data with Wikidata's? M2Ys4U (talk) 16:26, 4 July 2024 (UTC)
- Haven't heard about it yet (I think), will be looking into it, thanks! TimBorgNetzWerk (talk) 09:50, 5 July 2024 (UTC)
- OpenRefine is nice if you intend to import data into Wikidata. Last time I checked the reconciliation it uses yielded less than ideal results. Is this a publicly available graph? If your graph had it's own identifier registered on Wikidata you could use Mix'n'match to do a preliminary matching of the dataset and then let you verify each match manually. Asking for a new identifier can be done at WD:PP.
- In any case freetext search may be what WDQS is worst at. Unsurprisingly the built-in search does a much better job, see [1] for Wikidata specific functionality. You won't tax the API as long as you make calls sequentially and support maxlag. There are libraries available that makes this easier. Infrastruktur (talk) 16:36, 5 July 2024 (UTC)
- Haven't heard about it yet (I think), will be looking into it, thanks! TimBorgNetzWerk (talk) 09:50, 5 July 2024 (UTC)
Very widely used property no longer works
[edit]See Property talk:P5380#No longer works BhamBoi (talk) 22:25, 4 July 2024 (UTC)
We need to put an end to this
[edit]For months, items like
and likely more others have been target of constant edit warring, having English and Russian description changed back and forth by various IP addresses and few-edits-accounts. Could anyone have a look, say what is going on and suggest how administrators should deal with it? --Matěj Suchánek (talk) 08:57, 5 July 2024 (UTC)
- Chechen-Ingush wars. All items should be protected at a random version. May be we should block the warriors as well. Ymblanter (talk) 19:41, 5 July 2024 (UTC)
- Though may be things like this would help before protection, but then I need to go manually through the list. I can do it, but very slowly. Ymblanter (talk) 19:44, 5 July 2024 (UTC)
- I see no good reason to protect them to be only edited by admins. Semiprotections should be good enough. ChristianKl ❪✉❫ 21:09, 6 July 2024 (UTC)
"agency" property?
[edit]I'm getting "{{cite journal}}: |author= has generic name (help)" from:
- CNN Newsource (24 February 2021). "Urban League of Greater Kansas City unveils social justice bus". KMIZ. Wikidata Q126365824.
in Wikipedia:Gwendolyn Grant (activist)#References.
In a section on "work with template:Cite Q?" on the talk page associated with Wikipedia:Template:Sfn, Wikipedia:User:ActivelyDisinterested said, "CNN News Source is not a valid author name ... . The correct field in this case would be |agency= but [that is not] supported by Wikidata / Cite Q." I've experimented with assigning "CNN Newsource" to different properties, so far without finding one that makes this complaint disappear.
Can someone help me find a property to which to assign "CNN Newsource" (Q5013147) so this complaint in Wikipedia disappears? Thanks, DavidMCEddy (talk) 00:15, 6 July 2024 (UTC)
Jon DeVries
[edit]Q104346704 (duplicate: Q111549344) RIMOLA (talk) 09:54, 6 July 2024 (UTC)
- Merged Done
I think that this discussion is resolved and can be archived. If you disagree, don't hesitate to replace this template with your comment. RVA2869 (talk) 11:54, 6 July 2024 (UTC) |
constraint on instance or subclass of
[edit]ISFDB award ID (P11395) has constraint
subject type constraint: class - type of award relation - instance or subclass of
So why is Ditmar Award (Q906455) which is a subclass of (P279) of science fiction award (Q107581015), an instance of (P31) of type of award (Q107467117) OK
While William Atheling Jr. Award (Q8004646) which is an instance of (P31) of literary award (Q378427), an instance of (P31) of type of award (Q107467117) reports a violation? Vicarage (talk) 14:07, 7 July 2024 (UTC)
Removing unreferenced religions and ethnicities
[edit]@Nikkimaria: Was a decision made to remove all unreferenced religions and ethnicities at some point? If so I missed that discussion. If the decision was made they should be deleted by a bot, not one-by-one by any individual. Doing it that way will lead to selection bias. I noticed some disappearing and traced the deletions to Special:Contributions/Nikkimaria RAN (talk) 16:21, 7 July 2024 (UTC)
- These are mostly reverts of a particular problematic IP editor who pops up periodically in Special:AbuseFilter/95. If there is a preference to revert such edits by bot I have no objection. Nikkimaria (talk) 16:37, 7 July 2024 (UTC)