Shortcuts: WD:PC, WD:CHAT, WD:?

Wikidata:Project chat

From Wikidata
Jump to navigation Jump to search

Wikidata project chat
A place to discuss any and all aspects of Wikidata: the project itself, policy and proposals, individual data items, technical issues, etc.

Please use {{Q}} or {{P}} the first time you mention an item or property, respectively.
Other places to find help

For realtime chat rooms about Wikidata, see Wikidata:IRC.
On this page, old discussions are archived after 7 days. An overview of all archives can be found at this page's archive index. The current archive is located at 2024/07.

750 V DC conductor

[edit]

I have temporarely added Q21855034 (600 Volt) as the closest to a '750 V DC conductor' electrification as a P930 property to Q918235. I hesitade to create a new electrification item, as there are several type third rail type (upper, side and underneath contact) and maybe the type of electric contact is a seperate property from P930. Having a third rail shoe is a quite essential part of a train type.Smiley.toerist (talk) 08:20, 29 June 2024 (UTC)[reply]

Q25857994 can be used; the differences between 600 V DC railway electrification (Q21855034) and 750 V DC railway electrification (Q25857994) are the use of direct current (Q159241) in Q21855034 but not Q25857994 (not significant as both are DC railway electrification (Q11581821)) and the voltage; they don't specify the type of contact or whether it is third rail or overhead. Q838484#P930 has third rail as a qualifier. Peter James (talk) 13:44, 29 June 2024 (UTC)[reply]
Thanks, Smiley.toerist (talk) 09:42, 30 June 2024 (UTC)[reply]

Named after

[edit]

we have the property "named after" which ban have the value John Smith. Is there a property to use at John Smith to show "things named after this person"? RAN (talk) 16:31, 29 June 2024 (UTC)[reply]

If such a property would exist you would see it listed at named after (P138). We generally avoid inverse properties and I can't think of a good reason to have the property "things named after this person". ChristianKl16:47, 29 June 2024 (UTC)[reply]
Not sure if this would be useful for you but there is a gadget that you can enable in preferences called "relateditems" which "Adds a button to the bottom of item pages to display inverse statements." So would show all things named after the person as well any other properties that link to the item Piecesofuk (talk) 17:01, 29 June 2024 (UTC)[reply]

Merging Q117208646 (exercise & fitness product) into Q352222 (exercise equipment)?

[edit]

The former seems to be generated from Google's product taxonomy, but overall seems to refer to the same concept. The subgraphs of both terms are overlapping but not identical, so perhaps a clean-up would be welcome. Any thoughts? Alcinos (talk) 22:34, 29 June 2024 (UTC)[reply]

Is there any fitness products that are not exercise equipment? Trade (talk) 13:05, 1 July 2024 (UTC)[reply]
Fitbits and other activity tracker (Q16001686) perhaps. Not my area, but all of https://www.wikidata.org/wiki/Special:WhatLinksHere/Q117208646 look like they could fit in the other category Vicarage (talk) 13:36, 1 July 2024 (UTC)[reply]
Activity trackers are a good example, one could argue that they are indeed fitness products but not really exercise equipment (although the link to either concept is currently missing in the page you linked). Other elements that could be in the same case: fitness app (Q25104632), smart scale (Q116454756), perhaps also massage gun (Q110997596).
In the light of this, here is a refined proposal:
- rename exercise & fitness product (Q117208646) to "fitness product", and make exercise equipment (Q352222) a subclass of it
- move all current sub-classes of exercise & fitness product (Q117208646) to be sub-classes of exercise equipment (Q352222) (as Vicarage noted, currently all of them seem to be appropriate sub-classes
- add links for the remaining "fitness products" that are not "exercise equipment", such as activity tracker and the other listed above
How does that sound? Alcinos (talk) 15:51, 1 July 2024 (UTC)[reply]
Other proposed "fitness products" that are not "exercise equipment":
- heart rate monitor (Q925303) although that one is currently listed as an exercise equipment, not sure if I agree
- yoga pants (Q8054336) (perhaps it would be nice to have a "workout clothes" class? I can't seem to find one currently) Alcinos (talk) 15:58, 1 July 2024 (UTC)[reply]
sportswear (Q645292) includes exercise in description Vicarage (talk) 16:37, 1 July 2024 (UTC)[reply]
It may be too broad to be a subclass of "fitness product". Eg sports jersey (Q2623418) is a sportswear but likely wouldn't be a good (indirect) subclass of "fitness product" Alcinos (talk) 17:09, 1 July 2024 (UTC)[reply]

Bogus disease English aliases prefixed with "obsolete" - cleanup needed

[edit]

A large number (thousands?) of pages for diseases and classes of diseases currently have bogus aliases in English "obsolete X", where X is usually the main English label. For example, hemophilia (Q134003) has the alias "obsolete hemophilia". Likewise, rinderpest (Q157008) has alias "obsolete rinderpest" (though in a sense it actually is obsolete!). Some have variations, e.g. chronic pancreatitis (Q1996053) has alias "obsolete relapsing pancreatitis".

These seem to have been added by a bot trying to import an external taxonomy in 2020. Example of a bad revision: https://www.wikidata.org/w/index.php?title=Q194435&oldid=1313119769

How should these be cleaned up? Can a bulk query be used to find them all?

73.223.72.200 05:00, 1 July 2024 (UTC)[reply]

Wikidata weekly summary #634

[edit]

The wiki is now in read-only mode

[edit]

"Failed to save due to an error." and "The wiki is now in read-only mode." pop up. Why? Eurohunter (talk) 05:26, 2 July 2024 (UTC)[reply]

Apparently there were some brief spikes of replication lag around the time you posted that message; when this happens, the wiki may automatically put itself into read-only mode temporarily until the database has caught up again. Lucas Werkmeister (WMDE) (talk) 09:25, 2 July 2024 (UTC)[reply]

Why If I add subclass of (P279) with for example history of Berlin (Q679741) then value-requires-statement constraint (Q21510864) pop up? For example, it pop up at history of trams in Berlin (Q1514212) while it not pop up in history of trams in Barcelona (Q11925955). Eurohunter (talk) 06:18, 2 July 2024 (UTC)[reply]

@Eurohunter You have to make sure there is a complete hierarchy of classes. In the example you have given, Q1514212 has class Q679741, but Q679741 needs to have some class too... I suggest Q122131 be added there as P279. Vojtěch Dostál (talk) 13:26, 2 July 2024 (UTC)[reply]
@Vojtěch Dostál: Thanks. Eurohunter (talk) 12:14, 6 July 2024 (UTC)[reply]

Implementing Orphanet Data into Wikipedia

[edit]

Orphanet is an important reference within wikipedia with over 1000 refs. Recently, they changed their data structure, thus the former Template:Orphaned does no longer work. I got a file with relevant changes I would like to be implemented. Zieger M (talk) 07:38, 2 July 2024 (UTC)[reply]

@Zieger M Hi, can you share the file publicly, so that I (or others) can have a look and decide if we're able to implement the change? Vojtěch Dostál (talk) 07:42, 2 July 2024 (UTC)[reply]
Yes, how can I share it? Zieger M (talk) 07:44, 2 July 2024 (UTC)[reply]
@Zieger M If it is a table file, maybe you can upload somewhere and share a link? Ideally, with properly labelled columns so that we understand what changes to what :-). Vojtěch Dostál (talk) 07:46, 2 July 2024 (UTC)[reply]
"upload somewhere"? Never done, don't know where to. Sorry Zieger M (talk) 07:51, 2 July 2024 (UTC)[reply]
https://www.mediafire.com/file/uimhjnvs9g4uf49/Linkliste+Orphanet_Original.xlsx/file Zieger M (talk) 12:01, 2 July 2024 (UTC)[reply]
@Zieger M Hi, I checked the file and I think I now better understand what you mean. In fact, the change does not have anything to do with Wikidata - you just want to properly format its links to Orphanet. I think that you only need to replace the URL string "https://www.orpha.net/consor/cgi-bin/Disease_Search.php?lng=DE&data_id=" at de:Template:Orphanet with "https://www.orpha.net/en/disease/detail/". Isn't that right? You can do it locally in Dewiki. Vojtěch Dostál (talk) 13:16, 2 July 2024 (UTC)[reply]

Wikidata Question

[edit]

Hi Wikipedia, I have two concerns regarding data for Blic, daily newspaper from Serbia. I have tried entering publication interval and for some reason it does not let me publish it. Also, I have tried editing their social media information and it did not let me. For both of them, it does not let me publish changes. Can you tell me why ? Боки 18:21, 2 July 2024 (UTC)[reply]

What does it say? Ymblanter (talk) 18:42, 2 July 2024 (UTC)[reply]
@Ymblanter it doesnt say anything.
Basically, when I try and change it, publish button is blanked so I cant click on it. Боки 18:50, 2 July 2024 (UTC)[reply]
If you enter say "1 week" in the field for publication interval then the check-mark can't be clicked. Unit goes into a separate field. Infrastruktur (talk) 19:03, 2 July 2024 (UTC)[reply]
[edit]

I noticed that this item is linked not only as an antiseptic but also for many other medical topics. Its description only mentioned "antiseptic" and I've added the prevention and treatment of iodine deficiency, based on its page linked from WikiProjectMed. The mistake may arise from the fact that it's disambiguated in the English- (and several other) language Wikipedia(s) as "iodine (medical use)". I think all other medical uses (e.g. radioactive iodine therapy (Q13233408)) should link to either to iodine as an element (iodine (Q1103)), or to a new item created for this purpose, but the antiseptic (and possibly the deficiency-preventing) use shouldn't be conflated with the radioactive or other medical means of using it. Adam78 (talk) 21:08, 2 July 2024 (UTC)[reply]

@Adam78 Is iodine as antiseptic in any way chemically different from the iodine element? If not, all such links should point to iodine (Q1103) and Q28196266 should instead be facet of (P1269) of iodine (Q1103) or something of that sort. A similar example is calcium in biology (Q60097). Vojtěch Dostál (talk) 11:25, 4 July 2024 (UTC)[reply]

Railway junctions: Q24045957 vs Q336764

[edit]

I'd be grateful if anyone could help me distinguish railway junction (Q24045957) and junction (Q336764) -- both used specifically for railway junctions, and distinct from railroad switch (Q82818) and the more general junction (Q1777515).

There seem to be two different concepts here, at least in German, but I'm not entirely seeing how they should be named in English to express the difference, or whether articles in the various different language wikis are all connected to the correct item.

Which would be most appropriate for a location where one linear ELR railway line section (Q113990375) of track (perhaps 50 km long, double-track) meets another such section? Jheald (talk) 22:10, 2 July 2024 (UTC)[reply]

Multichill (talk) Thryduulf (talk) 21:38, 2 November 2013 (UTC) -revi (talkcontribslogs)-- 01:13, 3 November 2013 (UTC) (was Hym411) User:JarrahTree (talk) 06:32, 3 November 2013 (UTC) A.Bernhard (talk) 08:28, 9 November 2013 (UTC) Micru (talk) 12:36, 9 November 2013 (UTC) Steenth (talk) YLSS (talk) 13:59, 25 November 2013 (UTC) Konggaru (talk) 12:31, 14 December 2013 (UTC) Elmarbu (talk) 21:48, 17 December 2013 (UTC) Nitrolinken (talk) 16:30, 14 February 2014 (UTC) George23820 Talk‎ 17:39, 17 August 2014 (UTC) Daniele.Brundu (talk) 21:34, 30 August 2015 (UTC) Dannebrog Spy (talk) 16:13, 9 December 2015 (UTC) Knoxhale 18:39, 26 June 2016 (UTC) happy5214 22:48, 8 July 2016 (UTC) Jklamo (talk) 07:32, 15 August 2016 (UTC) Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits DarTar (talk) 16:36, 5 September 2016 (UTC) Pizza1016 (talk | contribs) 01:33, 10 November 2016 (UTC) Sascha GPD (talk) 23:00, 1 February 2017 (UTC) Liuxinyu970226 (talk) 09:09, 2 February 2017 (UTC) A1AA1A (talk) 18:17, 21 May 2017 (UTC) Mauricio V. Genta (talk) 13:56, 9 June 2017 (UTC) Sam Wilson 10:26, 18 June 2017 (UTC) Danielt998 (talk) 05:01, 28 August 2017 (UTC) Maxim75 (talk) 06:04, 22 September 2017 (UTC) Fabio Bettani (talk) 17:48, 3 June 2018 (UTC) Geogast (talk) 23:51, 13 July 2018 (UTC) Bodhisattwa (talk) 19:29, 17 December 2018 (UTC) Jinoytommanjaly (talk) 13:13, 21 May 2019 (UTC) OktaRama2010 (talk) 00:25, 1 May 2020 (UTC) PhiH (talk) 14:20, 26 July 2020 (UTC) Jcornelius (talk) 18:47, 30 July 2020 (UTC) Mackensen (talk) 15:21, 29 August 2020 (UTC) Michgrig (talk) 22:04, 20 December 2020 (UTC) Trockennasenaffe (talk) 16:27, 5 September 2021 (UTC) Secretlondon (talk) 07:46, 3 September 2022 (UTC) GALAXYライナー (talk) 05:17, 14 October 2022 (UTC) Yirba (talk) 09:49, 10 August 2023 (UTC) Zwantzig (talk) 09:08, 07 September 2023 (UTC) S4b1nuz ᴇ.656(SMS) 16:16, 21 November 2023 (UTC) Prefuture (talk) 07:02, 16 December 2023 (UTC) Cmelak770 (talk) 14:06, 15 May 2024 (UTC) DaxServer (talk) 14:41, 31 May 2024 (UTC)[reply]
Notified participants of WikiProject Railways. (I did ask on the talk page there a couple of years ago, but it didn't get any responses.) Jheald (talk) 22:14, 2 July 2024 (UTC)
[reply]
I can explain these from the Czech point of view, but the explanation is similar for all countries in the central Europe (Poland, Germany, Slovakia etc.). At thirst railway junction (Q24045957) is very big (hundreds of switches) and junction (Q336764) is very small (sometimes only one switch, but usually not more than four switches). railway junction (Q24045957) express connection of lot of railway lines usually in one town/city. E.g. železniční uzel Praha (Prague junction) consists of all railway station in Prague (Q1085), in which all railway lines leading to this big city are connected. junction (Q336764) is usually a place where one railway line splits into two railway lines and it is not railway station (Q55488), so if the railway lines are with one track, then one switch can be enough. In Czechia and Poland it is also a place on the double track line between two stations, where are 4 switches to go from the left to the right track and vice versa (the same place is Slovakia (till 2000 also in Czechia) is classified as passing loop (Q784159)). But I have no idea how to name these different places in English. When I translate it, I usually use "junction" for both, although they have completely different meanings. --Cmelak770 (talk) 06:54, 3 July 2024 (UTC)[reply]
In Germany we strongly distinguish between free track (Q1302250) which roughly means track which is not part of a railway station (Q55488) and tracks that are part of a railway station (Q55488). junction (Q336764) is a junction, that is not part of a railway station (Q55488). As far as I know, (most?) english speaking countries don't have this concept free track (Q1302250), so this may not be easy to translate. --Trockennasenaffe (talk) 06:09, 4 July 2024 (UTC)[reply]
Asked at en:Wikipedia_talk:WikiProject_UK_Railways whether anyone there can suggest better English-language labels / descriptions Jheald (talk) 12:56, 7 July 2024 (UTC) [reply]

Ingest of SEC EDGAR data into Wikidata?

[edit]

I have recently noticed that many company infoboxes on Wikipedia are frequently out of date, even though they draw from Wikidata for many values like yearly results. All of this data is available online through the SEC's EDGAR system, at least for publicly traded companies in the US, so I was wondering whether it would be worthwhile to write a bot that would read SEC data and update Wikidata with it?

Botlord (talk) 19:18, 3 July 2024 (UTC)[reply]

Conventions for Knowledge Graph aligning

[edit]

Dear Wikidata Community,

We're looking to build a Aerospace Engineering Knowledge Graph, and linking (all) entries to wikidata. For some, like Q3319996, that's easy, for others like conceptual modelling not so much. Others, like CPACS, are not even in Wikidata yet, or Wikipedia for that matter. Given that context, I have the following cases and questions:

  1. If a perfect match exists, no questions.
  2. If a match exists that does look correct, but seems to be lacking relations, should we populate this entry as we see fit? (assumed answer: yes, see en:WP:BOLD)
  3. If a match exists that does look somewhat corect, but does not have the right type, should we split it into two different entities?
    1. e.g. Q377960 not being a Q3249551, but an Q166142 - should we create a new process instance with the same label?
    2. what about instances such as Q2623243, which specifically lists conceptual model (an object) and conceptual modelling (a process)? Does the existence of this entry mean differentiation is not desired?
  4. If no match exists, I assume we should create one. I've taken a look at Wikidata:Notability:
    1. "It refers to an instance of a clearly identifiable conceptual or material entity that can be described using serious and publicly available references."
      1. All instances would fall under this category, since all are derived from a systematic literature review and we can link to the respective papers where they are discussed.
    2. All our instances would be instances of Q10843872, Q7397, Q235557 or similar. Examples: https://github.com/DLR-SC/tixi, https://dlr-sl.github.io/cpacs-website/

Furthermore, I have some SPARQL / Database questions, which I'll add to a separate topic to not overflow this one.

Thanks, TimBorgNetzWerk (talk) 11:00, 4 July 2024 (UTC)[reply]

API / Pyton / SPARQL access questions

[edit]

Hi everyone,

please see Wikidata:Project chat#Conventions for Knowledge Graph aligning for context.


TL;DR, we're looking to check if a wikidata instance exists for ~500 entries we have in our database. We also don't want to overburden the Wikidata API, hence:

What can we do to most efficiently query the wikidata database?


What currently do is:

query = f"""
SELECT ?item ?itemLabel (GROUP_CONCAT(DISTINCT ?altLabel; separator = ", ") AS ?altLabels) 
(SAMPLE(?description) AS ?description) WHERE {{
{selection[select]}
OPTIONAL {{?item skos:altLabel ?altLabel FILTER(LANG(?altLabel) = "en")}}
OPTIONAL {{?item schema:description ?description FILTER(LANG(?description) = "en")}}
SERVICE wikibase:label {{bd:serviceParam wikibase:language "en".}}
}}
GROUP BY ?item ?itemLabel
LIMIT {limit}
"""

, wherin we limit the results to 20 at most, and select based on:

selection = {
    'label' : f'?item rdfs:label "{label}"@en.',
    'altLabel' : f'?item skos:altLabel "{label}"@en.'
}

Then, per label, we check if:

  1. entries with that label are available (e.g. "STEP file" to Q3509055
  2. if these entries do not sum up to our limit (20), then we also check if entries with that label as altLabel exist (e.g. ".stp" to Q3509055),
  3. if these entries do not sum up to our limit (20) then we try 1. and 2. again with (if != label):
    1. label.lower(), so "STEP" -> "step",
    2. label.capitalize(), so "STEP" -> "Step",
    3. label.upper(), so "STEP" -> "STEP" -> not done, since == label


Then we store all queries and results so we run no query twice, and can just check our local "copy" for the result.


Given all this, our Question:

  1. Is there a better way?

Better as in "easier on wikidata / time" as well as "better results", since currently we have about 40% match rate. Likely, many ouf our instances do, in fact, have no match, but others (like Q2117885 "Systems Modeling Language" or "SysML") are currently just not catched. We have seen advise to run some preprocessing on the labels, to lower all wikidata labels in a filter, but that seemed unfathomably taxing on all parties involved.

There is also the general advice to use a data dump. We have checked Wikidata:Database download and https://dumps.wikimedia.org/wikidatawiki/entities/, and not found a dump that contains all labels AND is relatively small. The lexemes do not seem to contain all labels, presumably only Q111352 instances. All the aformentioned entries, e.g. .p21 and .stp, are not mentioned therein.


I really appreciate your help, and am open to suggestions, improvements, hints or anything, really :)


Best, TimBorgNetzWerk (talk) 11:30, 4 July 2024 (UTC)[reply]

Have you considered using a tool like OpenRefine to help reconcile your data with Wikidata's? M2Ys4U (talk) 16:26, 4 July 2024 (UTC)[reply]
Haven't heard about it yet (I think), will be looking into it, thanks! TimBorgNetzWerk (talk) 09:50, 5 July 2024 (UTC)[reply]
OpenRefine is nice if you intend to import data into Wikidata. Last time I checked the reconciliation it uses yielded less than ideal results. Is this a publicly available graph? If your graph had it's own identifier registered on Wikidata you could use Mix'n'match to do a preliminary matching of the dataset and then let you verify each match manually. Asking for a new identifier can be done at WD:PP.
In any case freetext search may be what WDQS is worst at. Unsurprisingly the built-in search does a much better job, see [1] for Wikidata specific functionality. You won't tax the API as long as you make calls sequentially and support maxlag. There are libraries available that makes this easier. Infrastruktur (talk) 16:36, 5 July 2024 (UTC)[reply]

Very widely used property no longer works

[edit]

See Property talk:P5380#No longer works BhamBoi (talk) 22:25, 4 July 2024 (UTC)[reply]

We need to put an end to this

[edit]

For months, items like

and likely more others have been target of constant edit warring, having English and Russian description changed back and forth by various IP addresses and few-edits-accounts. Could anyone have a look, say what is going on and suggest how administrators should deal with it? --Matěj Suchánek (talk) 08:57, 5 July 2024 (UTC)[reply]

Chechen-Ingush wars. All items should be protected at a random version. May be we should block the warriors as well. Ymblanter (talk) 19:41, 5 July 2024 (UTC)[reply]
Though may be things like this would help before protection, but then I need to go manually through the list. I can do it, but very slowly. Ymblanter (talk) 19:44, 5 July 2024 (UTC)[reply]
I see no good reason to protect them to be only edited by admins. Semiprotections should be good enough. ChristianKl21:09, 6 July 2024 (UTC)[reply]

"agency" property?

[edit]

I'm getting "{{cite journal}}: |author= has generic name (help)" from:

  • CNN Newsource (24 February 2021). "Urban League of Greater Kansas City unveils social justice bus". KMIZ. Wikidata Q126365824.View profile on Scholia

in Wikipedia:Gwendolyn Grant (activist)#References.

In a section on "work with template:Cite Q?" on the talk page associated with Wikipedia:Template:Sfn, Wikipedia:User:ActivelyDisinterested said, "CNN News Source is not a valid author name ... . The correct field in this case would be |agency= but [that is not] supported by Wikidata / Cite Q." I've experimented with assigning "CNN Newsource" to different properties, so far without finding one that makes this complaint disappear.

Can someone help me find a property to which to assign "CNN Newsource" (Q5013147) so this complaint in Wikipedia disappears? Thanks, DavidMCEddy (talk) 00:15, 6 July 2024 (UTC)[reply]

Jon DeVries

[edit]

Q104346704 (duplicate: Q111549344) RIMOLA (talk) 09:54, 6 July 2024 (UTC)[reply]

→ ← Merged✓ Done
I think that this discussion is resolved and can be archived. If you disagree, don't hesitate to replace this template with your comment. RVA2869 (talk) 11:54, 6 July 2024 (UTC)[reply]

constraint on instance or subclass of

[edit]

ISFDB award ID (P11395) has constraint

subject type constraint:
class - type of award 
relation - instance or subclass of

So why is Ditmar Award (Q906455) which is a subclass of (P279) of science fiction award (Q107581015), an instance of (P31) of type of award (Q107467117) OK

While William Atheling Jr. Award (Q8004646) which is an instance of (P31) of literary award (Q378427), an instance of (P31) of type of award (Q107467117) reports a violation? Vicarage (talk) 14:07, 7 July 2024 (UTC)[reply]

Removing unreferenced religions and ethnicities

[edit]

@Nikkimaria: Was a decision made to remove all unreferenced religions and ethnicities at some point? If so I missed that discussion. If the decision was made they should be deleted by a bot, not one-by-one by any individual. Doing it that way will lead to selection bias. I noticed some disappearing and traced the deletions to Special:Contributions/Nikkimaria RAN (talk) 16:21, 7 July 2024 (UTC)[reply]

These are mostly reverts of a particular problematic IP editor who pops up periodically in Special:AbuseFilter/95. If there is a preference to revert such edits by bot I have no objection. Nikkimaria (talk) 16:37, 7 July 2024 (UTC)[reply]