<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://librarybase.org/w/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Harej</id>
	<title>Librarybase - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://librarybase.org/w/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Harej"/>
	<link rel="alternate" type="text/html" href="https://librarybase.org/wiki/Special:Contributions/Harej"/>
	<updated>2026-05-25T04:01:11Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.39.1</generator>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Project:Analytics/WCD&amp;diff=248120</id>
		<title>Project:Analytics/WCD</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Project:Analytics/WCD&amp;diff=248120"/>
		<updated>2026-05-20T19:42:16Z</updated>

		<summary type="html">&lt;p&gt;Harej: Removing notes from abandoned build attempts&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Introduction ==&lt;br /&gt;
&#039;&#039;&#039;Wikipedia Citations Database&#039;&#039;&#039; is an effort supported by Internet Archive to build a comprehensive, historical database of each citation to appear on Wikipedia.&lt;br /&gt;
&lt;br /&gt;
While methodologies of similar projects have focused on extracting standardized identifiers or easy-to-extract references, this project instead seeks to build a &#039;&#039;complete&#039;&#039; database, based on analyzing the structure.&lt;br /&gt;
&lt;br /&gt;
== Generations ==&lt;br /&gt;
&lt;br /&gt;
* WCD Generation 1: Wikibase instance on Wikibase Cloud. Importing data was painfully slow, and it was decided Wikibase was not a good format for the data we were trying to store. The dataset produced is lost and was not very interesting or useful.&lt;br /&gt;
* WCD Generation 2: [https://github.com/internetarchive/iari IARI]; Postgres-based, but was extremely difficult to work with. Parts of it are used to support the Internet Archive Reference Explorer.&lt;br /&gt;
* WCD Generation 3: The latest attempt, using [https://github.com/internetarchive/wiki-references-extractor wiki-references-extractor] and [https://github.com/internetarchive/wiki-references-db wiki-references-db] (WRDB). WRDB is a core component of the broader WCD project.&lt;br /&gt;
** Version 1: https://wikipediacitations.scatter.red&lt;br /&gt;
*** Partial build of English Wikipedia&lt;br /&gt;
*** Has issues dealing with broken wikitext&lt;br /&gt;
** Version 2: &lt;br /&gt;
*** Rather than build an entire database upfront, have it be possible to analyze a &#039;&#039;&#039;page URL&#039;&#039;&#039; at a &#039;&#039;&#039;point in time&#039;&#039;&#039;. This will allow us to prove the extraction mechanisms are working without resorting to building an entire database first&lt;br /&gt;
**** Available as [https://wikipediacitations-now.scatter.red Wikipedia Citations Now]&lt;br /&gt;
*** Be able to tell what part of the article a reference is from (in-line vs. endnote, etc.)&lt;br /&gt;
*** JSON representation of template parameters so you don&#039;t have to parse them out of the template&lt;br /&gt;
*** Use LLM to parse the article in general. Maybe have an option to select between classical parsing and LLM-based extraction.&lt;br /&gt;
**** Use basic mwparserfromhell extraction as first step. If a user comes across an entry and it looks broken, offer option to re-generate report with AI. User can then accept alternative.&lt;br /&gt;
*** In data model, associate references with revision id; then, associate revision IDs with timestamps in another table&lt;br /&gt;
&lt;br /&gt;
== Longer term challenges ==&lt;br /&gt;
&lt;br /&gt;
* Sometimes, Wikipedia includes statements from Wikidata, and those statements from Wikidata have citations, but they would not show up in revision text.&lt;br /&gt;
** Would this require cross-referencing with rendered HTML output?&lt;br /&gt;
** Or a &amp;quot;citations database plus&amp;quot; that includes the Wikidata item? (Wikipedia editors would probably like this for quickly comparing citations between Wikipedia article and Wikidata item.)&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Project:Analytics/Orb&amp;diff=248119</id>
		<title>Project:Analytics/Orb</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Project:Analytics/Orb&amp;diff=248119"/>
		<updated>2026-05-20T17:25:21Z</updated>

		<summary type="html">&lt;p&gt;Harej: one-third milestone&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== 2025-10-25 Orb Blended ==&lt;br /&gt;
The virtual machine &amp;quot;Orb Blended&amp;quot; in my workstation cluster was created to experiment with blending the Wikidata Query Service with other datasets in a single Blazegraph instance. Work started earlier, but this is the first journal.&lt;br /&gt;
&lt;br /&gt;
For the last few weeks it has been building the Wikidata Query Service. I have paused it at file 4470 since I am taking the system down. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 18:25, 25 October 2025 (UTC)&lt;br /&gt;
&lt;br /&gt;
* Resuming build. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 03:49, 28 October 2025 (UTC)&lt;br /&gt;
* 2025-11-08: Rebuild took too long; have to start again. Downloading dump from November 3. If I can&#039;t complete the build by December 3, I am not sure Blazegraph is still viable.&lt;br /&gt;
* 2025-12-06: Rebuild of dataset is complete; local up-to-date copy of WDQS on dev1001.&lt;br /&gt;
* Started to gzip data.jnl in preparation to upload to fileserver and as a backup before experimenting with adding more data sources. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 02:00, 7 December 2025 (UTC)&lt;br /&gt;
&lt;br /&gt;
== 2026-01 Rebuild attempt ==&lt;br /&gt;
Updater is running into numerous issues on both my workstation and the server, so I am downloading wikidata-20260112-all-BETA.ttl.gz to my workstation for a rebuild. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 22:33, 22 January 2026 (UTC)&lt;br /&gt;
&lt;br /&gt;
Munge process produced 4940 bundles; now ingesting into Blazegraph. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 23:25, 24 January 2026 (UTC)&lt;br /&gt;
&lt;br /&gt;
== 2026-04 Rebuild attempt ==&lt;br /&gt;
The last rebuild attempt, on my workstation, failed because it took too long to build the database. By the time it was finished, the data was more than 30 days old and could not be synced with recent changes. Trying again, directly on the production server (station1001), with wikidata-&#039;&#039;&#039;20260330&#039;&#039;&#039;-all-BETA.ttl.gz, only three days old as of writing.&lt;br /&gt;
&lt;br /&gt;
* 2026-04-02: Direct rsync from WMF dump server to orb-wdqs&lt;br /&gt;
* 2026-04-02: Begin munge process&lt;br /&gt;
* 2026-04-04 15:23 (UTC): Begin loading of &#039;&#039;&#039;4971&#039;&#039;&#039; bundles&lt;br /&gt;
* 2026-04-09 22:28 (UTC): Completed 1657 (one-third of) files&lt;br /&gt;
* 2026-04-20 03:27 (UTC): Completed 3314 (two-thirds of) files&lt;br /&gt;
* Canceled at 5:52 PM local time on 2026-04-21 since it was not going to complete and more urgent hardware matters needed to be attended to&lt;br /&gt;
&lt;br /&gt;
== 2026-05 Rebuild attempt ==&lt;br /&gt;
This time on the workstation again.&lt;br /&gt;
&lt;br /&gt;
* 2026-05-16 at 10:55 AM PDT:&lt;br /&gt;
** docker compose exec wdqs /wdqs/munge.sh -c 25000 -f /sources/wikidata-&#039;&#039;&#039;20260511&#039;&#039;&#039;-all-BETA.ttl.gz -d /sources&lt;br /&gt;
* 2026-05-17 at 01:04 PM PDT (across 5,007 shards):&lt;br /&gt;
** /wdqs/loadData.sh -n wdq -d /sources &amp;amp;&amp;amp; /wdqs/runUpdate.sh&lt;br /&gt;
** 2026-05-20 at 07:35:39 AM PDT: 1669 files (one-third) loaded&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Project:Analytics/Orb&amp;diff=248118</id>
		<title>Project:Analytics/Orb</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Project:Analytics/Orb&amp;diff=248118"/>
		<updated>2026-05-17T20:04:47Z</updated>

		<summary type="html">&lt;p&gt;Harej: Update&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== 2025-10-25 Orb Blended ==&lt;br /&gt;
The virtual machine &amp;quot;Orb Blended&amp;quot; in my workstation cluster was created to experiment with blending the Wikidata Query Service with other datasets in a single Blazegraph instance. Work started earlier, but this is the first journal.&lt;br /&gt;
&lt;br /&gt;
For the last few weeks it has been building the Wikidata Query Service. I have paused it at file 4470 since I am taking the system down. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 18:25, 25 October 2025 (UTC)&lt;br /&gt;
&lt;br /&gt;
* Resuming build. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 03:49, 28 October 2025 (UTC)&lt;br /&gt;
* 2025-11-08: Rebuild took too long; have to start again. Downloading dump from November 3. If I can&#039;t complete the build by December 3, I am not sure Blazegraph is still viable.&lt;br /&gt;
* 2025-12-06: Rebuild of dataset is complete; local up-to-date copy of WDQS on dev1001.&lt;br /&gt;
* Started to gzip data.jnl in preparation to upload to fileserver and as a backup before experimenting with adding more data sources. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 02:00, 7 December 2025 (UTC)&lt;br /&gt;
&lt;br /&gt;
== 2026-01 Rebuild attempt ==&lt;br /&gt;
Updater is running into numerous issues on both my workstation and the server, so I am downloading wikidata-20260112-all-BETA.ttl.gz to my workstation for a rebuild. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 22:33, 22 January 2026 (UTC)&lt;br /&gt;
&lt;br /&gt;
Munge process produced 4940 bundles; now ingesting into Blazegraph. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 23:25, 24 January 2026 (UTC)&lt;br /&gt;
&lt;br /&gt;
== 2026-04 Rebuild attempt ==&lt;br /&gt;
The last rebuild attempt, on my workstation, failed because it took too long to build the database. By the time it was finished, the data was more than 30 days old and could not be synced with recent changes. Trying again, directly on the production server (station1001), with wikidata-&#039;&#039;&#039;20260330&#039;&#039;&#039;-all-BETA.ttl.gz, only three days old as of writing.&lt;br /&gt;
&lt;br /&gt;
* 2026-04-02: Direct rsync from WMF dump server to orb-wdqs&lt;br /&gt;
* 2026-04-02: Begin munge process&lt;br /&gt;
* 2026-04-04 15:23 (UTC): Begin loading of &#039;&#039;&#039;4971&#039;&#039;&#039; bundles&lt;br /&gt;
* 2026-04-09 22:28 (UTC): Completed 1657 (one-third of) files&lt;br /&gt;
* 2026-04-20 03:27 (UTC): Completed 3314 (two-thirds of) files&lt;br /&gt;
* Canceled at 5:52 PM local time on 2026-04-21 since it was not going to complete and more urgent hardware matters needed to be attended to&lt;br /&gt;
&lt;br /&gt;
== 2026-05 Rebuild attempt ==&lt;br /&gt;
This time on the workstation again.&lt;br /&gt;
&lt;br /&gt;
* 2026-05-16 at 10:55 AM PDT:&lt;br /&gt;
** docker compose exec wdqs /wdqs/munge.sh -c 25000 -f /sources/wikidata-&#039;&#039;&#039;20260511&#039;&#039;&#039;-all-BETA.ttl.gz -d /sources&lt;br /&gt;
* 2026-05-17 at 01:04 PM PDT (over 5,007 shards):&lt;br /&gt;
** /wdqs/loadData.sh -n wdq -d /sources &amp;amp;&amp;amp; /wdqs/runUpdate.sh&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Project:Analytics/Orb&amp;diff=248117</id>
		<title>Project:Analytics/Orb</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Project:Analytics/Orb&amp;diff=248117"/>
		<updated>2026-05-16T17:57:08Z</updated>

		<summary type="html">&lt;p&gt;Harej: 2026-05 rebuild attempt&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== 2025-10-25 Orb Blended ==&lt;br /&gt;
The virtual machine &amp;quot;Orb Blended&amp;quot; in my workstation cluster was created to experiment with blending the Wikidata Query Service with other datasets in a single Blazegraph instance. Work started earlier, but this is the first journal.&lt;br /&gt;
&lt;br /&gt;
For the last few weeks it has been building the Wikidata Query Service. I have paused it at file 4470 since I am taking the system down. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 18:25, 25 October 2025 (UTC)&lt;br /&gt;
&lt;br /&gt;
* Resuming build. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 03:49, 28 October 2025 (UTC)&lt;br /&gt;
* 2025-11-08: Rebuild took too long; have to start again. Downloading dump from November 3. If I can&#039;t complete the build by December 3, I am not sure Blazegraph is still viable.&lt;br /&gt;
* 2025-12-06: Rebuild of dataset is complete; local up-to-date copy of WDQS on dev1001.&lt;br /&gt;
* Started to gzip data.jnl in preparation to upload to fileserver and as a backup before experimenting with adding more data sources. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 02:00, 7 December 2025 (UTC)&lt;br /&gt;
&lt;br /&gt;
== 2026-01 Rebuild attempt ==&lt;br /&gt;
Updater is running into numerous issues on both my workstation and the server, so I am downloading wikidata-20260112-all-BETA.ttl.gz to my workstation for a rebuild. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 22:33, 22 January 2026 (UTC)&lt;br /&gt;
&lt;br /&gt;
Munge process produced 4940 bundles; now ingesting into Blazegraph. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 23:25, 24 January 2026 (UTC)&lt;br /&gt;
&lt;br /&gt;
== 2026-04 Rebuild attempt ==&lt;br /&gt;
The last rebuild attempt, on my workstation, failed because it took too long to build the database. By the time it was finished, the data was more than 30 days old and could not be synced with recent changes. Trying again, directly on the production server (station1001), with wikidata-&#039;&#039;&#039;20260330&#039;&#039;&#039;-all-BETA.ttl.gz, only three days old as of writing.&lt;br /&gt;
&lt;br /&gt;
* 2026-04-02: Direct rsync from WMF dump server to orb-wdqs&lt;br /&gt;
* 2026-04-02: Begin munge process&lt;br /&gt;
* 2026-04-04 15:23 (UTC): Begin loading of &#039;&#039;&#039;4971&#039;&#039;&#039; bundles&lt;br /&gt;
* 2026-04-09 22:28 (UTC): Completed 1657 (one-third of) files&lt;br /&gt;
* 2026-04-20 03:27 (UTC): Completed 3314 (two-thirds of) files&lt;br /&gt;
* Canceled at 5:52 PM local time on 2026-04-21 since it was not going to complete and more urgent hardware matters needed to be attended to&lt;br /&gt;
&lt;br /&gt;
== 2026-05 Rebuild attempt ==&lt;br /&gt;
This time on the workstation again.&lt;br /&gt;
&lt;br /&gt;
* 2026-05-16 at 10:55 AM PDT: docker compose exec wdqs /wdqs/munge.sh -c 25000 -f /sources/wikidata-20260511-all-BETA.ttl.gz -d /sources&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Project:Analytics/Wikibases&amp;diff=248116</id>
		<title>Project:Analytics/Wikibases</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Project:Analytics/Wikibases&amp;diff=248116"/>
		<updated>2026-05-11T17:53:17Z</updated>

		<summary type="html">&lt;p&gt;Harej: Migration complete&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The &amp;quot;Cool Wikibase Network&amp;quot; includes Wikibases that James Hare created. Wikibases that were created by other people may be admitted into the network in the future.&lt;br /&gt;
&lt;br /&gt;
These wikis are:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;s&amp;gt;Open Miscellaneous Wikibase&amp;lt;/s&amp;gt; (January 2023–January 2026)&lt;br /&gt;
* [https://librarybase.org Librarybase] (May 2023–present)&lt;br /&gt;
* [https://internetdomains.wiki Internet Domains Wikibase] (May 2023–present)&lt;br /&gt;
* [https://wikibase.world Wikibase World] (June 2023–present)&lt;br /&gt;
* [https://communities.miraheze.org Miraheze Communities] (September 2025–present)&lt;br /&gt;
Membership in the Cool Wikibase Network mainly means being included in the [[Template:Cwbn]] footer featured on the member wikis, a callback to the days of web rings. It otherwise confers no material advantage or special privilege.&lt;br /&gt;
&lt;br /&gt;
==2026-01-16 Migration away from Wikibase.Cloud==&lt;br /&gt;
Wikibase Cloud is proposing new hosting requirements that would subject tenants to evaluation against arbitrary, shifting standards. To reduce operational uncertainty, I am moving critical systems off Wikibase.Cloud in favor of self-hosting.&lt;br /&gt;
* 2026-01-16: I migrated the majority of Open Miscellaneous Wikibase to Librarybase and then deleted the old wiki.&lt;br /&gt;
** InternetArchiveBot-related template tracking ([[Item:Q246948|example item]])&lt;br /&gt;
** [[Item:Q246951|MIT License modeling experiment]]&lt;br /&gt;
* 2026-01-18:&lt;br /&gt;
** Deleted the &amp;quot;credbot-daily&amp;quot; job on &amp;quot;vaccinesafety&amp;quot; Toolforge project in anticipation of migration.&lt;br /&gt;
** Posted [https://domains.wikibase.cloud/wiki/MediaWiki:Sitenotice Sitenotice on Internet Domains]&lt;br /&gt;
** Registered internetdomains.wiki&lt;br /&gt;
** [https://phabricator.wikimedia.org/T414893 Requested dump] of Internet Domains&lt;br /&gt;
*2026-04-28:&lt;br /&gt;
**Set up a VM on my server to turn import SQL dump and export XML dump for import into Miraheze; TTL dump for ingest into a query service&lt;br /&gt;
**Created &amp;quot;internetdomains.wiki&amp;quot; item to commemorate new wiki (and so that the query service updater doesn&#039;t complain about the data being too old)&lt;br /&gt;
**Ran commands:&lt;br /&gt;
***&amp;lt;code&amp;gt;php maintenance/run.php dumpBackup --full &amp;gt; /tmp/internetdomains-2026-03.xml&amp;lt;/code&amp;gt;&lt;br /&gt;
***&amp;lt;code&amp;gt;php extensions/Wikibase/repo/maintenance/dumpRdf.php --format=ttl --server &amp;lt;nowiki&amp;gt;https://internetdomains.wiki&amp;lt;/nowiki&amp;gt; &amp;gt; /tmp/internetdomains-2026-03.ttl&amp;lt;/code&amp;gt;&lt;br /&gt;
Migration to Miraheze completed on May 10, 2026&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Template:Cwbn&amp;diff=248115</id>
		<title>Template:Cwbn</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Template:Cwbn&amp;diff=248115"/>
		<updated>2026-05-11T00:30:41Z</updated>

		<summary type="html">&lt;p&gt;Harej: Internet Domains new address&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&amp;lt;div style=&amp;quot;margin-top:2em&amp;quot;&amp;gt;&amp;lt;hr style=&amp;quot;width:5em; margin-bottom:0.75em;&amp;quot; /&amp;gt;&amp;lt;span style=&amp;quot;margin-right:1.5em&amp;quot;&amp;gt;&#039;&#039;&#039;&#039;&#039;Cool Wikibase Network&#039;&#039;&#039;&#039;&#039;&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;margin-right:1em;&amp;quot;&amp;gt;[https://wikibase.world Wikibase World]&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;margin-right:1em;&amp;quot;&amp;gt;[https://librarybase.org Librarybase]&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;margin-right:1em;&amp;quot;&amp;gt;[https://internetdomains.wiki Internet Domains]&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;margin-right:1em;&amp;quot;&amp;gt;[https://communities.miraheze.org Miraheze Communities]&amp;lt;/span&amp;gt;&amp;lt;/div&amp;gt;&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Project:Analytics/Wikibases&amp;diff=248113</id>
		<title>Project:Analytics/Wikibases</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Project:Analytics/Wikibases&amp;diff=248113"/>
		<updated>2026-04-29T05:36:24Z</updated>

		<summary type="html">&lt;p&gt;Harej: Update&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The &amp;quot;Cool Wikibase Network&amp;quot; includes Wikibases that James Hare created. Wikibases that were created by other people may be admitted into the network in the future.&lt;br /&gt;
&lt;br /&gt;
These wikis are:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;s&amp;gt;Open Miscellaneous Wikibase&amp;lt;/s&amp;gt; (January 2023–January 2026)&lt;br /&gt;
* [https://librarybase.org Librarybase] (May 2023–present)&lt;br /&gt;
* [https://internetdomains.wiki Internet Domains Wikibase] (May 2023–present)&lt;br /&gt;
* [https://wikibase.world Wikibase World] (June 2023–present)&lt;br /&gt;
* [https://communities.miraheze.org Miraheze Communities] (September 2025–present)&lt;br /&gt;
Membership in the Cool Wikibase Network mainly means being included in the [[Template:Cwbn]] footer featured on the member wikis, a callback to the days of web rings. It otherwise confers no material advantage or special privilege.&lt;br /&gt;
&lt;br /&gt;
==2026-01-16 Migration away from Wikibase.Cloud==&lt;br /&gt;
Wikibase Cloud is proposing new hosting requirements that would subject tenants to evaluation against arbitrary, shifting standards. To reduce operational uncertainty, I am moving critical systems off Wikibase.Cloud in favor of self-hosting.&lt;br /&gt;
* 2026-01-16: I migrated the majority of Open Miscellaneous Wikibase to Librarybase and then deleted the old wiki.&lt;br /&gt;
** InternetArchiveBot-related template tracking ([[Item:Q246948|example item]])&lt;br /&gt;
** [[Item:Q246951|MIT License modeling experiment]]&lt;br /&gt;
* 2026-01-18:&lt;br /&gt;
** Deleted the &amp;quot;credbot-daily&amp;quot; job on &amp;quot;vaccinesafety&amp;quot; Toolforge project in anticipation of migration.&lt;br /&gt;
** Posted [https://domains.wikibase.cloud/wiki/MediaWiki:Sitenotice Sitenotice on Internet Domains]&lt;br /&gt;
** Registered internetdomains.wiki&lt;br /&gt;
** [https://phabricator.wikimedia.org/T414893 Requested dump] of Internet Domains&lt;br /&gt;
*2026-04-28:&lt;br /&gt;
**Set up a VM on my server to turn import SQL dump and export XML dump for import into Miraheze; TTL dump for ingest into a query service&lt;br /&gt;
**Created &amp;quot;internetdomains.wiki&amp;quot; item to commemorate new wiki (and so that the query service updater doesn&#039;t complain about the data being too old)&lt;br /&gt;
**Ran commands:&lt;br /&gt;
***&amp;lt;code&amp;gt;php maintenance/run.php dumpBackup --full &amp;gt; /tmp/internetdomains-2026-03.xml&amp;lt;/code&amp;gt;&lt;br /&gt;
***&amp;lt;code&amp;gt;php extensions/Wikibase/repo/maintenance/dumpRdf.php --format=ttl --server &amp;lt;nowiki&amp;gt;https://internetdomains.wiki&amp;lt;/nowiki&amp;gt; &amp;gt; /tmp/internetdomains-2026-03.ttl&amp;lt;/code&amp;gt;&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Project:Analytics/Orb&amp;diff=248109</id>
		<title>Project:Analytics/Orb</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Project:Analytics/Orb&amp;diff=248109"/>
		<updated>2026-04-26T06:58:39Z</updated>

		<summary type="html">&lt;p&gt;Harej: Unfortunate update&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== 2025-10-25 Orb Blended ==&lt;br /&gt;
The virtual machine &amp;quot;Orb Blended&amp;quot; in my workstation cluster was created to experiment with blending the Wikidata Query Service with other datasets in a single Blazegraph instance. Work started earlier, but this is the first journal.&lt;br /&gt;
&lt;br /&gt;
For the last few weeks it has been building the Wikidata Query Service. I have paused it at file 4470 since I am taking the system down. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 18:25, 25 October 2025 (UTC)&lt;br /&gt;
&lt;br /&gt;
* Resuming build. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 03:49, 28 October 2025 (UTC)&lt;br /&gt;
* 2025-11-08: Rebuild took too long; have to start again. Downloading dump from November 3. If I can&#039;t complete the build by December 3, I am not sure Blazegraph is still viable.&lt;br /&gt;
* 2025-12-06: Rebuild of dataset is complete; local up-to-date copy of WDQS on dev1001.&lt;br /&gt;
* Started to gzip data.jnl in preparation to upload to fileserver and as a backup before experimenting with adding more data sources. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 02:00, 7 December 2025 (UTC)&lt;br /&gt;
&lt;br /&gt;
== 2026-01-22 Rebuild ==&lt;br /&gt;
Updater is running into numerous issues on both my workstation and the server, so I am downloading wikidata-20260112-all-BETA.ttl.gz to my workstation for a rebuild. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 22:33, 22 January 2026 (UTC)&lt;br /&gt;
&lt;br /&gt;
Munge process produced 4940 bundles; now ingesting into Blazegraph. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 23:25, 24 January 2026 (UTC)&lt;br /&gt;
&lt;br /&gt;
== 2026-04 Rebuild ==&lt;br /&gt;
The last rebuild attempt, on my workstation, failed because it took too long to build the database. By the time it was finished, the data was more than 30 days old and could not be synced with recent changes. Trying again, directly on the production server (station1001), with wikidata-&#039;&#039;&#039;20260330&#039;&#039;&#039;-all-BETA.ttl.gz, only three days old as of writing.&lt;br /&gt;
&lt;br /&gt;
* 2026-04-02: Direct rsync from WMF dump server to orb-wdqs&lt;br /&gt;
* 2026-04-02: Begin munge process&lt;br /&gt;
* 2026-04-04 15:23 (UTC): Begin loading of &#039;&#039;&#039;4971&#039;&#039;&#039; bundles&lt;br /&gt;
* 2026-04-09 22:28 (UTC): Completed 1657 (one-third of) files&lt;br /&gt;
* 2026-04-20 03:27 (UTC): Completed 3314 (two-thirds of) files&lt;br /&gt;
* Canceled at 5:52 PM local time on 2026-04-21 since it was not going to complete and more urgent hardware matters needed to be attended to&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Project:Analytics/Orb&amp;diff=248108</id>
		<title>Project:Analytics/Orb</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Project:Analytics/Orb&amp;diff=248108"/>
		<updated>2026-04-09T22:46:06Z</updated>

		<summary type="html">&lt;p&gt;Harej: One-third milestone&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== 2025-10-25 Orb Blended ==&lt;br /&gt;
The virtual machine &amp;quot;Orb Blended&amp;quot; in my workstation cluster was created to experiment with blending the Wikidata Query Service with other datasets in a single Blazegraph instance. Work started earlier, but this is the first journal.&lt;br /&gt;
&lt;br /&gt;
For the last few weeks it has been building the Wikidata Query Service. I have paused it at file 4470 since I am taking the system down. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 18:25, 25 October 2025 (UTC)&lt;br /&gt;
&lt;br /&gt;
* Resuming build. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 03:49, 28 October 2025 (UTC)&lt;br /&gt;
* 2025-11-08: Rebuild took too long; have to start again. Downloading dump from November 3. If I can&#039;t complete the build by December 3, I am not sure Blazegraph is still viable.&lt;br /&gt;
* 2025-12-06: Rebuild of dataset is complete; local up-to-date copy of WDQS on dev1001.&lt;br /&gt;
* Started to gzip data.jnl in preparation to upload to fileserver and as a backup before experimenting with adding more data sources. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 02:00, 7 December 2025 (UTC)&lt;br /&gt;
&lt;br /&gt;
== 2026-01-22 Rebuild ==&lt;br /&gt;
Updater is running into numerous issues on both my workstation and the server, so I am downloading wikidata-20260112-all-BETA.ttl.gz to my workstation for a rebuild. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 22:33, 22 January 2026 (UTC)&lt;br /&gt;
&lt;br /&gt;
Munge process produced 4940 bundles; now ingesting into Blazegraph. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 23:25, 24 January 2026 (UTC)&lt;br /&gt;
&lt;br /&gt;
== 2026-04 Rebuild ==&lt;br /&gt;
The last rebuild attempt, on my workstation, failed because it took too long to build the database. By the time it was finished, the data was more than 30 days old and could not be synced with recent changes. Trying again, directly on the production server (station1001), with wikidata-&#039;&#039;&#039;20260330&#039;&#039;&#039;-all-BETA.ttl.gz, only three days old as of writing.&lt;br /&gt;
&lt;br /&gt;
* 2026-04-02: Direct rsync from WMF dump server to orb-wdqs&lt;br /&gt;
* 2026-04-02: Begin munge process&lt;br /&gt;
* 2026-04-04 15:23 (UTC): Begin loading of &#039;&#039;&#039;4971&#039;&#039;&#039; bundles&lt;br /&gt;
* 2026-04-09 22:28 (UTC): Completed 1657 (one-third of) files&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Project:Home&amp;diff=248107</id>
		<title>Project:Home</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Project:Home&amp;diff=248107"/>
		<updated>2026-04-08T16:20:54Z</updated>

		<summary type="html">&lt;p&gt;Harej: Links to basic documentation pages&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Welcome to &#039;&#039;&#039;Librarybase, the collaborative card catalog of everything&#039;&#039;&#039;.&lt;br /&gt;
&lt;br /&gt;
Librarybase is a project to organize, refine, and distribute metadata, as a complement to other projects like [[Item:Q2|Wikidata]] or [[Item:Q253|OpenAlex]]. For more information about Librarybase:&lt;br /&gt;
&lt;br /&gt;
* [[Project:Principles]]: The motivations behind the project&lt;br /&gt;
* [[Project:Scope]]: What is accepted for inclusion on this wiki&lt;br /&gt;
* [[Project:Consensus]]: How decisions are made here&lt;br /&gt;
{{cwbn}}&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=User:Harej&amp;diff=248106</id>
		<title>User:Harej</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=User:Harej&amp;diff=248106"/>
		<updated>2026-04-08T16:11:52Z</updated>

		<summary type="html">&lt;p&gt;Harej: IABot deployment status&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Founder of Librarybase&lt;br /&gt;
&lt;br /&gt;
Links:&lt;br /&gt;
* [https://meta.wikimedia.org/wiki/User:Harej Wikimedia user page]&lt;br /&gt;
* [https://scatter.red Scatter]&lt;br /&gt;
* [https://wikis.world/@hare @hare @wikis.world]&lt;br /&gt;
* [https://harej.co Personal website]&lt;br /&gt;
&lt;br /&gt;
== Project ideas ==&lt;br /&gt;
&lt;br /&gt;
* Mirror of WikiApiary, creating an item for each MediaWiki deployment and extension and service used by that deployment&lt;br /&gt;
* InternetArchiveBot deployment status&lt;br /&gt;
* An item for each Wikipedia article in each language, as part of Pageset&lt;br /&gt;
* Something with [https://obsoletemedia.org Museum of Obsolete Media]&lt;br /&gt;
* Historic patents&lt;br /&gt;
* Books written in the Odia language. Data pending import; [[Item:Q458|demonstration item]]&lt;br /&gt;
* Metadata items were created on works that may or may not be in the public domain, facilitating identification of these works. [[Item:Q509|Example item]]&lt;br /&gt;
&lt;br /&gt;
== Classes and roles ==&lt;br /&gt;
Items are organized by class using [[Property:P19]], with [[Item:Q410]] as the root class item. See [https://query.librarybase.org/#prefix%20lb%3A%20%3Chttps%3A%2F%2Flibrarybase.org%2Fentity%2F%3E%0Aprefix%20lbt%3A%20%3Chttps%3A%2F%2Flibrarybase.org%2Fprop%2Fdirect%2F%3E%0Aselect%20%3Fitem%20%3FitemLabel%20where%20%7B%20%3Fitem%20lbt%3AP19%20lb%3AQ410%20.%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22%5BAUTO_LANGUAGE%5D%2Cen%22.%20%7D%20%7D current list of class items on Librarybase Query Service]. This is meant to bring Librarybase closer in alignment with databases like Fatcat and OpenAlex.&lt;br /&gt;
&lt;br /&gt;
This differs from Wikidata&#039;s &amp;quot;instance of&amp;quot; and &amp;quot;subclass of&amp;quot; properties in that items are assigned classes according to a controlled vocabulary. This can be used instead of or in conjunction with those properties. One major departure from Wikidata is the concept of the [[Item:Q413|single-edition work]], used to avoid creating separate work and edition items when there is no real reason to.&lt;br /&gt;
&lt;br /&gt;
One subclass is &amp;quot;[[Item:Q422|role]],&amp;quot; which a member of a class can have. See [https://query.librarybase.org/#prefix%20lb%3A%20%3Chttps%3A%2F%2Flibrarybase.org%2Fentity%2F%3E%0Aprefix%20lbt%3A%20%3Chttps%3A%2F%2Flibrarybase.org%2Fprop%2Fdirect%2F%3E%0Aselect%20%3Fitem%20%3FitemLabel%20where%20%7B%20%3Fitem%20lbt%3AP19%20lb%3AQ422%20.%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22%5BAUTO_LANGUAGE%5D%2Cen%22.%20%7D%20%7D current list of role-type items on Librarybase Query Service].&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Project:Home&amp;diff=248105</id>
		<title>Project:Home</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Project:Home&amp;diff=248105"/>
		<updated>2026-04-08T03:27:27Z</updated>

		<summary type="html">&lt;p&gt;Harej: Clearing projects, pending something else&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Welcome to &#039;&#039;&#039;Librarybase, the collaborative card catalog of everything&#039;&#039;&#039;.&lt;br /&gt;
&lt;br /&gt;
Librarybase is a project to organize, refine, and distribute metadata, as a complement to other projects like [[Item:Q2|Wikidata]] or [[Item:Q253|OpenAlex]]. The motivations behind the project are further described at [[Project:Principles]].&lt;br /&gt;
&lt;br /&gt;
{{cwbn}}&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=User:Harej&amp;diff=248104</id>
		<title>User:Harej</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=User:Harej&amp;diff=248104"/>
		<updated>2026-04-08T03:27:19Z</updated>

		<summary type="html">&lt;p&gt;Harej: Moving from main page&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Founder of Librarybase&lt;br /&gt;
&lt;br /&gt;
Links:&lt;br /&gt;
* [https://meta.wikimedia.org/wiki/User:Harej Wikimedia user page]&lt;br /&gt;
* [https://scatter.red Scatter]&lt;br /&gt;
* [https://wikis.world/@hare @hare @wikis.world]&lt;br /&gt;
* [https://harej.co Personal website]&lt;br /&gt;
&lt;br /&gt;
== Project ideas ==&lt;br /&gt;
&lt;br /&gt;
* Fork of WikiApiary, creating an item for each MediaWiki deployment and extension and service used by that deployment&lt;br /&gt;
* An item for each Wikipedia article in each language, as part of Pageset&lt;br /&gt;
* Something with [https://obsoletemedia.org Museum of Obsolete Media]&lt;br /&gt;
* Historic patents&lt;br /&gt;
* Books written in the Odia language. Data pending import; [[Item:Q458|demonstration item]]&lt;br /&gt;
* Metadata items were created on works that may or may not be in the public domain, facilitating identification of these works. [[Item:Q509|Example item]]&lt;br /&gt;
&lt;br /&gt;
== Classes and roles ==&lt;br /&gt;
Items are organized by class using [[Property:P19]], with [[Item:Q410]] as the root class item. See [https://query.librarybase.org/#prefix%20lb%3A%20%3Chttps%3A%2F%2Flibrarybase.org%2Fentity%2F%3E%0Aprefix%20lbt%3A%20%3Chttps%3A%2F%2Flibrarybase.org%2Fprop%2Fdirect%2F%3E%0Aselect%20%3Fitem%20%3FitemLabel%20where%20%7B%20%3Fitem%20lbt%3AP19%20lb%3AQ410%20.%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22%5BAUTO_LANGUAGE%5D%2Cen%22.%20%7D%20%7D current list of class items on Librarybase Query Service]. This is meant to bring Librarybase closer in alignment with databases like Fatcat and OpenAlex.&lt;br /&gt;
&lt;br /&gt;
This differs from Wikidata&#039;s &amp;quot;instance of&amp;quot; and &amp;quot;subclass of&amp;quot; properties in that items are assigned classes according to a controlled vocabulary. This can be used instead of or in conjunction with those properties. One major departure from Wikidata is the concept of the [[Item:Q413|single-edition work]], used to avoid creating separate work and edition items when there is no real reason to.&lt;br /&gt;
&lt;br /&gt;
One subclass is &amp;quot;[[Item:Q422|role]],&amp;quot; which a member of a class can have. See [https://query.librarybase.org/#prefix%20lb%3A%20%3Chttps%3A%2F%2Flibrarybase.org%2Fentity%2F%3E%0Aprefix%20lbt%3A%20%3Chttps%3A%2F%2Flibrarybase.org%2Fprop%2Fdirect%2F%3E%0Aselect%20%3Fitem%20%3FitemLabel%20where%20%7B%20%3Fitem%20lbt%3AP19%20lb%3AQ422%20.%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22%5BAUTO_LANGUAGE%5D%2Cen%22.%20%7D%20%7D current list of role-type items on Librarybase Query Service].&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Project:Analytics/Orb&amp;diff=248103</id>
		<title>Project:Analytics/Orb</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Project:Analytics/Orb&amp;diff=248103"/>
		<updated>2026-04-04T15:23:08Z</updated>

		<summary type="html">&lt;p&gt;Harej: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== 2025-10-25 Orb Blended ==&lt;br /&gt;
The virtual machine &amp;quot;Orb Blended&amp;quot; in my workstation cluster was created to experiment with blending the Wikidata Query Service with other datasets in a single Blazegraph instance. Work started earlier, but this is the first journal.&lt;br /&gt;
&lt;br /&gt;
For the last few weeks it has been building the Wikidata Query Service. I have paused it at file 4470 since I am taking the system down. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 18:25, 25 October 2025 (UTC)&lt;br /&gt;
&lt;br /&gt;
* Resuming build. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 03:49, 28 October 2025 (UTC)&lt;br /&gt;
* 2025-11-08: Rebuild took too long; have to start again. Downloading dump from November 3. If I can&#039;t complete the build by December 3, I am not sure Blazegraph is still viable.&lt;br /&gt;
* 2025-12-06: Rebuild of dataset is complete; local up-to-date copy of WDQS on dev1001.&lt;br /&gt;
* Started to gzip data.jnl in preparation to upload to fileserver and as a backup before experimenting with adding more data sources. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 02:00, 7 December 2025 (UTC)&lt;br /&gt;
&lt;br /&gt;
== 2026-01-22 Rebuild ==&lt;br /&gt;
Updater is running into numerous issues on both my workstation and the server, so I am downloading wikidata-20260112-all-BETA.ttl.gz to my workstation for a rebuild. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 22:33, 22 January 2026 (UTC)&lt;br /&gt;
&lt;br /&gt;
Munge process produced 4940 bundles; now ingesting into Blazegraph. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 23:25, 24 January 2026 (UTC)&lt;br /&gt;
&lt;br /&gt;
== 2026-04 Rebuild ==&lt;br /&gt;
The last rebuild attempt, on my workstation, failed because it took too long to build the database. By the time it was finished, the data was more than 30 days old and could not be synced with recent changes. Trying again, directly on the production server (station1001), with wikidata-&#039;&#039;&#039;20260330&#039;&#039;&#039;-all-BETA.ttl.gz, only three days old as of writing.&lt;br /&gt;
&lt;br /&gt;
* 2026-04-02: Direct rsync from WMF dump server to orb-wdqs&lt;br /&gt;
* 2026-04-02: Begin munge process&lt;br /&gt;
* 15:23, 4 April 2026 (UTC): Begin loading of &#039;&#039;&#039;4971&#039;&#039;&#039; bundles&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Project:Analytics/Orb&amp;diff=248102</id>
		<title>Project:Analytics/Orb</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Project:Analytics/Orb&amp;diff=248102"/>
		<updated>2026-04-03T15:25:21Z</updated>

		<summary type="html">&lt;p&gt;Harej: Munge&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== 2025-10-25 Orb Blended ==&lt;br /&gt;
The virtual machine &amp;quot;Orb Blended&amp;quot; in my workstation cluster was created to experiment with blending the Wikidata Query Service with other datasets in a single Blazegraph instance. Work started earlier, but this is the first journal.&lt;br /&gt;
&lt;br /&gt;
For the last few weeks it has been building the Wikidata Query Service. I have paused it at file 4470 since I am taking the system down. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 18:25, 25 October 2025 (UTC)&lt;br /&gt;
&lt;br /&gt;
* Resuming build. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 03:49, 28 October 2025 (UTC)&lt;br /&gt;
* 2025-11-08: Rebuild took too long; have to start again. Downloading dump from November 3. If I can&#039;t complete the build by December 3, I am not sure Blazegraph is still viable.&lt;br /&gt;
* 2025-12-06: Rebuild of dataset is complete; local up-to-date copy of WDQS on dev1001.&lt;br /&gt;
* Started to gzip data.jnl in preparation to upload to fileserver and as a backup before experimenting with adding more data sources. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 02:00, 7 December 2025 (UTC)&lt;br /&gt;
&lt;br /&gt;
== 2026-01-22 Rebuild ==&lt;br /&gt;
Updater is running into numerous issues on both my workstation and the server, so I am downloading wikidata-20260112-all-BETA.ttl.gz to my workstation for a rebuild. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 22:33, 22 January 2026 (UTC)&lt;br /&gt;
&lt;br /&gt;
Munge process produced 4940 bundles; now ingesting into Blazegraph. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 23:25, 24 January 2026 (UTC)&lt;br /&gt;
&lt;br /&gt;
== 2026-04 Rebuild ==&lt;br /&gt;
The last rebuild attempt, on my workstation, failed because it took too long to build the database. By the time it was finished, the data was more than 30 days old and could not be synced with recent changes. Trying again, directly on the production server (station1001), with wikidata-&#039;&#039;&#039;20260330&#039;&#039;&#039;-all-BETA.ttl.gz, only three days old as of writing.&lt;br /&gt;
&lt;br /&gt;
* 2026-04-02: Direct rsync from WMF dump server to orb-wdqs&lt;br /&gt;
* 2026-04-02: Begin munge process&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Project:Analytics/Orb&amp;diff=248101</id>
		<title>Project:Analytics/Orb</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Project:Analytics/Orb&amp;diff=248101"/>
		<updated>2026-04-03T01:49:08Z</updated>

		<summary type="html">&lt;p&gt;Harej: 2026-04 rebuild&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== 2025-10-25 Orb Blended ==&lt;br /&gt;
The virtual machine &amp;quot;Orb Blended&amp;quot; in my workstation cluster was created to experiment with blending the Wikidata Query Service with other datasets in a single Blazegraph instance. Work started earlier, but this is the first journal.&lt;br /&gt;
&lt;br /&gt;
For the last few weeks it has been building the Wikidata Query Service. I have paused it at file 4470 since I am taking the system down. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 18:25, 25 October 2025 (UTC)&lt;br /&gt;
&lt;br /&gt;
* Resuming build. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 03:49, 28 October 2025 (UTC)&lt;br /&gt;
* 2025-11-08: Rebuild took too long; have to start again. Downloading dump from November 3. If I can&#039;t complete the build by December 3, I am not sure Blazegraph is still viable.&lt;br /&gt;
* 2025-12-06: Rebuild of dataset is complete; local up-to-date copy of WDQS on dev1001.&lt;br /&gt;
* Started to gzip data.jnl in preparation to upload to fileserver and as a backup before experimenting with adding more data sources. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 02:00, 7 December 2025 (UTC)&lt;br /&gt;
&lt;br /&gt;
== 2026-01-22 Rebuild ==&lt;br /&gt;
Updater is running into numerous issues on both my workstation and the server, so I am downloading wikidata-20260112-all-BETA.ttl.gz to my workstation for a rebuild. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 22:33, 22 January 2026 (UTC)&lt;br /&gt;
&lt;br /&gt;
Munge process produced 4940 bundles; now ingesting into Blazegraph. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 23:25, 24 January 2026 (UTC)&lt;br /&gt;
&lt;br /&gt;
== 2026-04 Rebuild ==&lt;br /&gt;
The last rebuild attempt, on my workstation, failed because it took too long to build the database. By the time it was finished, the data was more than 30 days old and could not be synced with recent changes. Trying again, directly on the production server (station1001), with wikidata-&#039;&#039;&#039;20260330&#039;&#039;&#039;-all-BETA.ttl.gz, only three days old as of writing.&lt;br /&gt;
&lt;br /&gt;
* 2026-04-02: Direct rsync from WMF dump server to orb-wdqs&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Project:Reviews/Wikipedia_citation_extraction&amp;diff=248100</id>
		<title>Project:Reviews/Wikipedia citation extraction</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Project:Reviews/Wikipedia_citation_extraction&amp;diff=248100"/>
		<updated>2026-03-13T00:38:49Z</updated>

		<summary type="html">&lt;p&gt;Harej: Start&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== OpenAlex search: wikipedia citations ==&lt;br /&gt;
&lt;br /&gt;
* Rahimi and Danesh 2023: [https://openalex.org/works/w4384659478 Analyzing Persian Wikipedia&#039;s citations to discover the effectiveness of Persian scientific papers: applied web mining techniques]&lt;br /&gt;
* Kokash, Natallia, Colavizza, Giovanni 2023: [https://openalex.org/works/w4393797324 A Comprehensive Dataset of Citations with Identifiers from English Wikipedia (2023)]&lt;br /&gt;
** [https://openalex.org/works/w4394028614 Same thing?]&lt;br /&gt;
* Baigutanova, Myung, Saez-Trumper, Chou, et al. 2023: [https://openalex.org/works/w4323925750 Longitudinal Assessment of Reference Quality on Wikipedia]&lt;br /&gt;
* Xiang and Chaoqun 2023: [https://openalex.org/works/w4321055481 Gender and Country Biases in Wikipedia Citations to Scholarly Publications]&lt;br /&gt;
** [https://openalex.org/works/w4321055481 Same thing?]&lt;br /&gt;
* Wight 2023: [https://openalex.org/works/w4394113280 Reference and map usage across Wikimedia wiki pages]&lt;br /&gt;
* Dunne, Halpin, Humphreys, McCormack et al. 2023: [https://openalex.org/works/w4367667587 Academia by Speculation: Debunking the Flawed Science behind the Claim that Wikipedia Influences Judges]&lt;br /&gt;
* Thompson, Luo, McKenzie, Richardson et al. 2023: [https://openalex.org/works/w4318612963 User-Generated Content Shapes Judicial Reasoning: Evidence From a Randomized Control Trial on Wikipedia]&lt;br /&gt;
* Zheng, Chen, Yan, Ni 2022: [https://openalex.org/works/w4308500239 Gender and country biases in Wikipedia citations to scholarly publications]&lt;br /&gt;
* Przybyła, Borkowski, Kaczyński 2022: [https://openalex.org/works/w4393821726 Wikipedia Complete Citation Corpus]&lt;br /&gt;
* Yang and Colavizza 2022: [https://openalex.org/works/w3210253694 A Map of Science in Wikipedia]&lt;br /&gt;
** [https://openalex.org/works/w4293024133 Same thing?]&lt;br /&gt;
* Benjakob, Aviram, Sobel 2023: [https://openalex.org/works/w4226283917 Citation needed? Wikipedia bibliometrics during the first wave of the COVID-19 pandemic]&lt;br /&gt;
* Czygan, Holzmann, Newbold 2021: [https://openalex.org/works/w3206003844 Refcat: The Internet Archive Scholar Citation Graph]&lt;br /&gt;
* Zhao and Strotmann 2021: [https://openalex.org/works/w3158340182 Mapping knowledge domains on Wikipedia: an author bibliographic coupling analysis of traditional Chinese medicine]&lt;br /&gt;
* Leva 2021: [https://openalex.org/works/w4393513762 English Wikipedia citations with possible SemanticScholar URLs as found by Unpaywall and OAbot]&lt;br /&gt;
** [https://openalex.org/works/w4393425517 Same thing?]&lt;br /&gt;
* Banasik-Jemielniak, Jemielniak, Wilamowski 2021: [https://openalex.org/works/w3130944461 Psychology and Wikipedia: Measuring Psychology Journals’ Impact by Wikipedia Citations]&lt;br /&gt;
* Sobel, Benjakob, Aviram 2021: [https://openalex.org/works/w3209525497 A meta analysis of Wikipedia&#039;s coronavirus sources during the COVID-19 pandemic]&lt;br /&gt;
** [https://openalex.org/works/w4393415590 Same thing?]&lt;br /&gt;
* &#039;&#039;&#039;Singh, West, Colavizza 2021: [https://openalex.org/works/w3113998794 Wikipedia citations: A comprehensive data set of citations with identifiers extracted from English Wikipedia]&#039;&#039;&#039;&lt;br /&gt;
* Piccardi, West, Redi, Colavizza 2020: [https://openalex.org/works/w3111097238 Quantifying Engagement with Citations on Wikipedia. (Part 2) (The translation and original text of the article are presented)]&lt;br /&gt;
* Piccardi, West, Redi, Colavizza 2020: [https://openalex.org/works/w3096844803 Quantifying Engagement with Citations on Wikipedia. (Part 1)]&lt;br /&gt;
* Colavizza 2020: [https://openalex.org/works/w3044875688 COVID-19 research in Wikipedia]&lt;br /&gt;
* &#039;&#039;&#039;Singh, West, Colavizza 2020: [https://openalex.org/works/w4393774949 Wikipedia Citations: A comprehensive dataset of citations with identifiers extracted from English Wikipedia]&#039;&#039;&#039;&lt;br /&gt;
* Agarwal, Redi, Sastry, Wood et al. 2020: [https://openalex.org/works/w3037750177 Wikipedia and Westminster: Quality and Dynamics of Wikipedia Pages about UK Politicians]&lt;br /&gt;
* Leva 2020: [https://openalex.org/works/w4393800038 Yearly pageviews of English Wikipedia articles with potential links to green open access scholarly articles]&lt;br /&gt;
* Piccardi, Redi, Colavizza, West 2020: [https://openalex.org/works/w3002073420 Quantifying Engagement with Citations on Wikipedia]&lt;br /&gt;
* Nicholson, Uppala, Sieber, Grabitz et al. 2020: [https://openalex.org/works/w3015681655 Measuring the quality of scientific references in Wikipedia: an analysis of more than 115M citations to over 800,000 scientific articles]&lt;br /&gt;
* Arroyo-Machado, Torres-Salinas, Herrera-Viedma, Romero-Frías 2020: [https://openalex.org/works/w4287868685 Science through Wikipedia: A novel representation of open knowledge through co-citation networks]&lt;br /&gt;
* &#039;&#039;&#039;Singh, West, Colavizza 2020: [https://openalex.org/works/w4394037276 A Comprehensive Dataset of Citations with Identifiers from English Wikipedia]&#039;&#039;&#039;&lt;br /&gt;
* Piccardi, Redi, Colavizza, West 2020: [https://openalex.org/works/w3021826561 Quantifying Engagement with Citations on Wikipedia]&lt;br /&gt;
* Joorabchi, Doherty, Dawson 2019: [https://openalex.org/works/w2998658522 ‘WP2Cochrane’, a tool linking Wikipedia to the Cochrane Library: Results of a bibliometric analysis evaluating article quality and importance]&lt;br /&gt;
* Redi, Fetahu, Morgan, Taraborelli 2019: [https://openalex.org/works/w2911641747 Citation Needed: A Taxonomy and Algorithmic Assessment of Wikipedia&#039;s Verifiability]&lt;br /&gt;
* Nazarian and Bassiliades 2018: [https://openalex.org/works/w2787826775 Towards Linking DBpedia’s Bibliographic References to Bibliographic Repositories]&lt;br /&gt;
* Lewoniewski, Härting, Węcel, Reichstein et al. 2018: [https://openalex.org/works/w2889055695 Application of SEO Metrics to Determine the Quality of Wikipedia Articles and Their Sources]&lt;br /&gt;
* Redi and Taraborelli 2018: [https://openalex.org/works/w4394434007 Accessibility and topics of citations with identifiers in Wikipedia]&lt;br /&gt;
** [https://openalex.org/works/w4394371398 Same thing?]&lt;br /&gt;
* Maggio, Willinsky, Steinberg, Mietchen et al.: [https://openalex.org/works/w2952239097 Wikipedia as a gateway to biomedical research: The relative distribution and use of citations in the English Wikipedia]&lt;br /&gt;
** [https://openalex.org/works/w2738894751 Same thing?]&lt;br /&gt;
* Leva 2017: [https://openalex.org/works/w3210630434 DOIs linked by the English Wikipedia which could be made available in green Open Access]&lt;br /&gt;
* Pooladian and Borrego 2017: [https://openalex.org/works/w2741737307 Methodological issues in measuring citations in Wikipedia: a case study in Library and Information Science]&lt;br /&gt;
* Serrano-López, Ingwersen, Sanz-Casado 2017: [https://openalex.org/works/w2724499206 Wind power research in Wikipedia: Does Wikipedia demonstrate direct influence of research publications and can it be used as adequate source in research evaluation?]&lt;br /&gt;
* Kousha and Thelwall 2017: [https://openalex.org/works/w2530523999 News stories as evidence for research? BBC citations from articles, Books, and Wikipedia]&lt;br /&gt;
* Kousha and Thelwall 2017: [https://openalex.org/works/w3035760182 Are wikipedia citations important evidence of the impact of scholarly articles and books]&lt;br /&gt;
* Halfaker, Kim, Forte, Taraborelli 2017: [https://openalex.org/works/w2772040480 Citations with contexts in Wikipedia]&lt;br /&gt;
** [https://openalex.org/works/w4394385802 Same thing?]&lt;br /&gt;
* Halfaker and Taraborelli 2017: [https://openalex.org/works/w4394533018 Scholarly article citations in Wikipedia]&lt;br /&gt;
* Deng and Liu 2017: [https://openalex.org/works/w2757374563 Evaluating the Citations of Information Systems Journals in Wikipedia]&lt;br /&gt;
*&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Project:Reviews&amp;diff=248099</id>
		<title>Project:Reviews</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Project:Reviews&amp;diff=248099"/>
		<updated>2026-03-12T23:16:27Z</updated>

		<summary type="html">&lt;p&gt;Harej: The first literature review on Librarybase&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&#039;&#039;&#039;Literature reviews!&#039;&#039;&#039; Use Librarybase to catalog and synthesize works. Build metadata records while doing so.&lt;br /&gt;
&lt;br /&gt;
By default, any given literature review is &#039;&#039;&#039;open for all&#039;&#039;&#039; to participate. If, for some reason, you do not want this, please clearly state so at the beginning of the page.&lt;br /&gt;
&lt;br /&gt;
== Ongoing reviews ==&lt;br /&gt;
&lt;br /&gt;
* [[Project:Reviews/Wikipedia citation extraction]]&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Project:Analytics/Orb&amp;diff=248097</id>
		<title>Project:Analytics/Orb</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Project:Analytics/Orb&amp;diff=248097"/>
		<updated>2026-01-24T23:25:51Z</updated>

		<summary type="html">&lt;p&gt;Harej: Now ingesting&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== 2025-10-25 Orb Blended ==&lt;br /&gt;
The virtual machine &amp;quot;Orb Blended&amp;quot; in my workstation cluster was created to experiment with blending the Wikidata Query Service with other datasets in a single Blazegraph instance. Work started earlier, but this is the first journal.&lt;br /&gt;
&lt;br /&gt;
For the last few weeks it has been building the Wikidata Query Service. I have paused it at file 4470 since I am taking the system down. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 18:25, 25 October 2025 (UTC)&lt;br /&gt;
&lt;br /&gt;
* Resuming build. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 03:49, 28 October 2025 (UTC)&lt;br /&gt;
* 2025-11-08: Rebuild took too long; have to start again. Downloading dump from November 3. If I can&#039;t complete the build by December 3, I am not sure Blazegraph is still viable.&lt;br /&gt;
* 2025-12-06: Rebuild of dataset is complete; local up-to-date copy of WDQS on dev1001.&lt;br /&gt;
* Started to gzip data.jnl in preparation to upload to fileserver and as a backup before experimenting with adding more data sources. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 02:00, 7 December 2025 (UTC)&lt;br /&gt;
&lt;br /&gt;
== 2026-01-22 Rebuild ==&lt;br /&gt;
Updater is running into numerous issues on both my workstation and the server, so I am downloading wikidata-20260112-all-BETA.ttl.gz to my workstation for a rebuild. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 22:33, 22 January 2026 (UTC)&lt;br /&gt;
&lt;br /&gt;
Munge process produced 4940 bundles; now ingesting into Blazegraph. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 23:25, 24 January 2026 (UTC)&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Project:Analytics/Orb&amp;diff=248096</id>
		<title>Project:Analytics/Orb</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Project:Analytics/Orb&amp;diff=248096"/>
		<updated>2026-01-22T22:33:09Z</updated>

		<summary type="html">&lt;p&gt;Harej: Rebuild&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== 2025-10-25 Orb Blended ==&lt;br /&gt;
The virtual machine &amp;quot;Orb Blended&amp;quot; in my workstation cluster was created to experiment with blending the Wikidata Query Service with other datasets in a single Blazegraph instance. Work started earlier, but this is the first journal.&lt;br /&gt;
&lt;br /&gt;
For the last few weeks it has been building the Wikidata Query Service. I have paused it at file 4470 since I am taking the system down. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 18:25, 25 October 2025 (UTC)&lt;br /&gt;
&lt;br /&gt;
* Resuming build. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 03:49, 28 October 2025 (UTC)&lt;br /&gt;
* 2025-11-08: Rebuild took too long; have to start again. Downloading dump from November 3. If I can&#039;t complete the build by December 3, I am not sure Blazegraph is still viable.&lt;br /&gt;
* 2025-12-06: Rebuild of dataset is complete; local up-to-date copy of WDQS on dev1001.&lt;br /&gt;
* Started to gzip data.jnl in preparation to upload to fileserver and as a backup before experimenting with adding more data sources. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 02:00, 7 December 2025 (UTC)&lt;br /&gt;
&lt;br /&gt;
== 2026-01-22 Rebuild ==&lt;br /&gt;
Updater is running into numerous issues on both my workstation and the server, so I am downloading wikidata-20260112-all-BETA.ttl.gz to my workstation for a rebuild. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 22:33, 22 January 2026 (UTC)&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Project:Analytics/WCD&amp;diff=248095</id>
		<title>Project:Analytics/WCD</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Project:Analytics/WCD&amp;diff=248095"/>
		<updated>2026-01-22T03:50:11Z</updated>

		<summary type="html">&lt;p&gt;Harej: Wikipedia Citations Now; longer term challenges&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Introduction ==&lt;br /&gt;
&#039;&#039;&#039;Wikipedia Citations Database&#039;&#039;&#039; is an effort supported by Internet Archive to build a comprehensive, historical database of each citation to appear on Wikipedia.&lt;br /&gt;
&lt;br /&gt;
While methodologies of similar projects have focused on extracting standardized identifiers or easy-to-extract references, this project instead seeks to build a &#039;&#039;complete&#039;&#039; database, based on analyzing the structure.&lt;br /&gt;
&lt;br /&gt;
== Generations ==&lt;br /&gt;
&lt;br /&gt;
* WCD Generation 1: Wikibase instance on Wikibase Cloud. Importing data was painfully slow, and it was decided Wikibase was not a good format for the data we were trying to store. The dataset produced is lost and was not very interesting or useful.&lt;br /&gt;
* WCD Generation 2: [https://github.com/internetarchive/iari IARI]; Postgres-based, but was extremely difficult to work with. Parts of it are used to support the Internet Archive Reference Explorer.&lt;br /&gt;
* WCD Generation 3: The latest attempt, using [https://github.com/internetarchive/wiki-references-extractor wiki-references-extractor] and [https://github.com/internetarchive/wiki-references-db wiki-references-db] (WRDB). WRDB is a core component of the broader WCD project.&lt;br /&gt;
** Version 1: https://wikipediacitations.scatter.red&lt;br /&gt;
*** Partial build of English Wikipedia&lt;br /&gt;
*** Has issues dealing with broken wikitext&lt;br /&gt;
** Version 2: &lt;br /&gt;
*** Rather than build an entire database upfront, have it be possible to analyze a &#039;&#039;&#039;page URL&#039;&#039;&#039; at a &#039;&#039;&#039;point in time&#039;&#039;&#039;. This will allow us to prove the extraction mechanisms are working without resorting to building an entire database first&lt;br /&gt;
**** Available as [https://wikipediacitations-now.scatter.red Wikipedia Citations Now]&lt;br /&gt;
*** Be able to tell what part of the article a reference is from (in-line vs. endnote, etc.)&lt;br /&gt;
*** JSON representation of template parameters so you don&#039;t have to parse them out of the template&lt;br /&gt;
*** Use LLM to parse the article in general. Maybe have an option to select between classical parsing and LLM-based extraction.&lt;br /&gt;
**** Use basic mwparserfromhell extraction as first step. If a user comes across an entry and it looks broken, offer option to re-generate report with AI. User can then accept alternative.&lt;br /&gt;
*** In data model, associate references with revision id; then, associate revision IDs with timestamps in another table&lt;br /&gt;
&lt;br /&gt;
== Longer term challenges ==&lt;br /&gt;
&lt;br /&gt;
* Sometimes, Wikipedia includes statements from Wikidata, and those statements from Wikidata have citations, but they would not show up in revision text.&lt;br /&gt;
** Would this require cross-referencing with rendered HTML output?&lt;br /&gt;
** Or a &amp;quot;citations database plus&amp;quot; that includes the Wikidata item? (Wikipedia editors would probably like this for quickly comparing citations between Wikipedia article and Wikidata item.)&lt;br /&gt;
&lt;br /&gt;
== 2025-10-16 WRDB migration ==&lt;br /&gt;
WRDB is the Postgres database of citation strings at the core of the WCD effort.&lt;br /&gt;
&lt;br /&gt;
Migration away from station1001 is underway. This started sooner but I am documenting my progress as of today.&lt;br /&gt;
&lt;br /&gt;
Successfully transferred between station1001 and the new VM, wrdb-gen3v1, and just now, verified via md5sum.&lt;br /&gt;
----&lt;br /&gt;
 jh@wrdb-gen3v1:/bulk/downloads$ md5sum 2024-10-wrdb.sql.gz&lt;br /&gt;
 &#039;&#039;&#039;1819&#039;&#039;&#039;ec7f76887e854903550ce1d&#039;&#039;&#039;30ac6&#039;&#039;&#039;  2024-10-wrdb.sql.gz&lt;br /&gt;
&lt;br /&gt;
 jh@station1001:/opt/librarybase/wikibase$ md5sum ~/2024-10-wrdb.sql.gz&lt;br /&gt;
 &#039;&#039;&#039;1819&#039;&#039;&#039;ec7f76887e854903550ce1d&#039;&#039;&#039;30ac6&#039;&#039;&#039;  /home/jh/2024-10-wrdb.sql.gz&lt;br /&gt;
----Downloading the English Wikipedia dump to my home workstation is going painfully slowly. I do not know why the download is only ~1.5MB/s. If I want to do a rebuild, my Plan B may be to instead set up a wrdb-gen3v2 on station1001 (rebuilt with Proxmox) and dedicate it to the rebuild. If I have a pre-process step that turns the dumps into a neat bundle of diffs, it should not require as much RAM to process (since I will no longer have to deal with XML in memory). I still want to do the WRDB rebuild on my workstation, but pre-processing on station1001 may give me a smaller file I can download. Once the pre-process is done I should have enough resources for a secondary copy of WDQS.&lt;br /&gt;
&lt;br /&gt;
In the meantime, now that I have a database dump of WRDB on the wrdb-gen3v1 virtual machine, the next steps are:&lt;br /&gt;
&lt;br /&gt;
# Re-import into Postgres&lt;br /&gt;
# Re-start web service&lt;br /&gt;
# Change proxy to point to new web service&lt;br /&gt;
# Create process for on-demand update:&lt;br /&gt;
## User requests data for a certain article. There is a notice saying to check back later if the data is not up to date.&lt;br /&gt;
## Article is added to the update queue&lt;br /&gt;
## Process works through the update queue, getting all revisions since the most recent one in the database&lt;br /&gt;
## Each revision has references extracted and the database is updated&lt;br /&gt;
## To prevent excessive updates, if an article is already in the queue, it can&#039;t be re-added to the queue. And if an article was refreshed in the last hour, it is sent to the back of the queue.&lt;br /&gt;
## Eventually add a low-priority queue that cycles through English Wikipedia in alphabetical order so that there is always passive updating.&lt;br /&gt;
&lt;br /&gt;
On-demand updates for particular articles should tide me over until I have the opportunity to do a full rebuild.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; I can&#039;t take down the current instance on station1001 until the new one is set up. Once the new one is set up, that&#039;s the final service before I can begin setting up Proxmox on station1001.&lt;br /&gt;
&lt;br /&gt;
== 2025-10-25 Downloading dump for Gen3V2 ==&lt;br /&gt;
Downloading paused for re-networking. To resume:&lt;br /&gt;
 rsync -avP -e &amp;quot;ssh -J jh@154.29.79.171&amp;quot; jh@10.0.1.8:/bulk/public/wikimedia/enwiki/20250901/enwiki-20250901-pages-meta-history*.xml-p*.bz2 ./&lt;br /&gt;
&lt;br /&gt;
Downloading resumed. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 03:47, 28 October 2025 (UTC)&lt;br /&gt;
&lt;br /&gt;
Download came to a stop on the orb-blended VM I used because it filled up. So I created a new VM, wrdb-gen3v2, moved the files to that, and resumed the download there. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 00:16, 3 November 2025 (UTC)&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Project:Analytics/WCD&amp;diff=248094</id>
		<title>Project:Analytics/WCD</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Project:Analytics/WCD&amp;diff=248094"/>
		<updated>2026-01-22T03:27:23Z</updated>

		<summary type="html">&lt;p&gt;Harej: Wikipedia Citations Now&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Introduction ==&lt;br /&gt;
&#039;&#039;&#039;Wikipedia Citations Database&#039;&#039;&#039; is an effort supported by Internet Archive to build a comprehensive, historical database of each citation to appear on Wikipedia.&lt;br /&gt;
&lt;br /&gt;
While methodologies of similar projects have focused on extracting standardized identifiers or easy-to-extract references, this project instead seeks to build a &#039;&#039;complete&#039;&#039; database, based on analyzing the structure.&lt;br /&gt;
&lt;br /&gt;
== Generations ==&lt;br /&gt;
&lt;br /&gt;
* WCD Generation 1: Wikibase instance on Wikibase Cloud. Importing data was painfully slow, and it was decided Wikibase was not a good format for the data we were trying to store. The dataset produced is lost and was not very interesting or useful.&lt;br /&gt;
* WCD Generation 2: [https://github.com/internetarchive/iari IARI]; Postgres-based, but was extremely difficult to work with. Parts of it are used to support the Internet Archive Reference Explorer.&lt;br /&gt;
* WCD Generation 3: The latest attempt, using [https://github.com/internetarchive/wiki-references-extractor wiki-references-extractor] and [https://github.com/internetarchive/wiki-references-db wiki-references-db] (WRDB). WRDB is a core component of the broader WCD project.&lt;br /&gt;
** Version 1: https://wikipediacitations.scatter.red&lt;br /&gt;
*** Partial build of English Wikipedia&lt;br /&gt;
*** Has issues dealing with broken wikitext&lt;br /&gt;
** Version 2: &lt;br /&gt;
*** Rather than build an entire database upfront, have it be possible to analyze a &#039;&#039;&#039;page URL&#039;&#039;&#039; at a &#039;&#039;&#039;point in time&#039;&#039;&#039;. This will allow us to prove the extraction mechanisms are working without resorting to building an entire database first&lt;br /&gt;
**** Available as [https://wikipediacitations-now.scatter.red Wikipedia Citations Now]&lt;br /&gt;
*** Be able to tell what part of the article a reference is from (in-line vs. endnote, etc.)&lt;br /&gt;
*** JSON representation of template parameters so you don&#039;t have to parse them out of the template&lt;br /&gt;
*** Use LLM to parse the article in general. Maybe have an option to select between classical parsing and LLM-based extraction.&lt;br /&gt;
**** Use basic mwparserfromhell extraction as first step. If a user comes across an entry and it looks broken, offer option to re-generate report with AI. User can then accept alternative.&lt;br /&gt;
*** In data model, associate references with revision id; then, associate revision IDs with timestamps in another table&lt;br /&gt;
&lt;br /&gt;
== 2025-10-16 WRDB migration ==&lt;br /&gt;
WRDB is the Postgres database of citation strings at the core of the WCD effort.&lt;br /&gt;
&lt;br /&gt;
Migration away from station1001 is underway. This started sooner but I am documenting my progress as of today.&lt;br /&gt;
&lt;br /&gt;
Successfully transferred between station1001 and the new VM, wrdb-gen3v1, and just now, verified via md5sum.&lt;br /&gt;
----&lt;br /&gt;
 jh@wrdb-gen3v1:/bulk/downloads$ md5sum 2024-10-wrdb.sql.gz&lt;br /&gt;
 &#039;&#039;&#039;1819&#039;&#039;&#039;ec7f76887e854903550ce1d&#039;&#039;&#039;30ac6&#039;&#039;&#039;  2024-10-wrdb.sql.gz&lt;br /&gt;
&lt;br /&gt;
 jh@station1001:/opt/librarybase/wikibase$ md5sum ~/2024-10-wrdb.sql.gz&lt;br /&gt;
 &#039;&#039;&#039;1819&#039;&#039;&#039;ec7f76887e854903550ce1d&#039;&#039;&#039;30ac6&#039;&#039;&#039;  /home/jh/2024-10-wrdb.sql.gz&lt;br /&gt;
----Downloading the English Wikipedia dump to my home workstation is going painfully slowly. I do not know why the download is only ~1.5MB/s. If I want to do a rebuild, my Plan B may be to instead set up a wrdb-gen3v2 on station1001 (rebuilt with Proxmox) and dedicate it to the rebuild. If I have a pre-process step that turns the dumps into a neat bundle of diffs, it should not require as much RAM to process (since I will no longer have to deal with XML in memory). I still want to do the WRDB rebuild on my workstation, but pre-processing on station1001 may give me a smaller file I can download. Once the pre-process is done I should have enough resources for a secondary copy of WDQS.&lt;br /&gt;
&lt;br /&gt;
In the meantime, now that I have a database dump of WRDB on the wrdb-gen3v1 virtual machine, the next steps are:&lt;br /&gt;
&lt;br /&gt;
# Re-import into Postgres&lt;br /&gt;
# Re-start web service&lt;br /&gt;
# Change proxy to point to new web service&lt;br /&gt;
# Create process for on-demand update:&lt;br /&gt;
## User requests data for a certain article. There is a notice saying to check back later if the data is not up to date.&lt;br /&gt;
## Article is added to the update queue&lt;br /&gt;
## Process works through the update queue, getting all revisions since the most recent one in the database&lt;br /&gt;
## Each revision has references extracted and the database is updated&lt;br /&gt;
## To prevent excessive updates, if an article is already in the queue, it can&#039;t be re-added to the queue. And if an article was refreshed in the last hour, it is sent to the back of the queue.&lt;br /&gt;
## Eventually add a low-priority queue that cycles through English Wikipedia in alphabetical order so that there is always passive updating.&lt;br /&gt;
&lt;br /&gt;
On-demand updates for particular articles should tide me over until I have the opportunity to do a full rebuild.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; I can&#039;t take down the current instance on station1001 until the new one is set up. Once the new one is set up, that&#039;s the final service before I can begin setting up Proxmox on station1001.&lt;br /&gt;
&lt;br /&gt;
== 2025-10-25 Downloading dump for Gen3V2 ==&lt;br /&gt;
Downloading paused for re-networking. To resume:&lt;br /&gt;
 rsync -avP -e &amp;quot;ssh -J jh@154.29.79.171&amp;quot; jh@10.0.1.8:/bulk/public/wikimedia/enwiki/20250901/enwiki-20250901-pages-meta-history*.xml-p*.bz2 ./&lt;br /&gt;
&lt;br /&gt;
Downloading resumed. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 03:47, 28 October 2025 (UTC)&lt;br /&gt;
&lt;br /&gt;
Download came to a stop on the orb-blended VM I used because it filled up. So I created a new VM, wrdb-gen3v2, moved the files to that, and resumed the download there. [[User:Harej|Harej]] ([[User talk:Harej|talk]]) 00:16, 3 November 2025 (UTC)&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Project:Analytics/PubPeer&amp;diff=248093</id>
		<title>Project:Analytics/PubPeer</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Project:Analytics/PubPeer&amp;diff=248093"/>
		<updated>2026-01-19T21:09:42Z</updated>

		<summary type="html">&lt;p&gt;Harej: Observation&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==API==&lt;br /&gt;
https://dashboards.pubpeer.com/docs/api#/operations/partner&lt;br /&gt;
&lt;br /&gt;
Relevant parameters:&lt;br /&gt;
* &amp;lt;code&amp;gt;page&amp;lt;/code&amp;gt;: start with &amp;lt;code&amp;gt;1&amp;lt;/code&amp;gt; then iterate based on whether there are more results&lt;br /&gt;
* &amp;lt;code&amp;gt;per_page&amp;lt;/code&amp;gt;: set at maximum value &amp;lt;code&amp;gt;300&amp;lt;/code&amp;gt;&lt;br /&gt;
* &amp;lt;code&amp;gt;sort&amp;lt;/code&amp;gt;: &lt;br /&gt;
* &amp;lt;s&amp;gt;&amp;lt;code&amp;gt;published_at&amp;lt;/code&amp;gt;&amp;lt;/s&amp;gt;: concerns when the document was &#039;&#039;published&#039;&#039;; I only care about &#039;&#039;comments&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
==Resources==&lt;br /&gt;
* Wikimedia Cloud Services&lt;br /&gt;
** Toolforge: project &amp;quot;pubpeer&amp;quot;&lt;br /&gt;
*** Don&#039;t think it&#039;s being used for anything&lt;br /&gt;
** Cloud VPS: project &amp;quot;wikicite&amp;quot;&lt;br /&gt;
*** VM wikicite-refsdb-proc-1.wikicite.eqiad1.wikimedia.cloud&lt;br /&gt;
*** Trove DB: ouqdvgrbzf3.svc.trove.eqiad1.wikimedia.cloud&lt;br /&gt;
&lt;br /&gt;
==Process==&lt;br /&gt;
=== Data collection and indexing ===&lt;br /&gt;
; PubPeer Data (index_pubpeer.py)&lt;br /&gt;
* &#039;&#039;&#039;Initial Seed:&#039;&#039;&#039;&lt;br /&gt;
** Starts from 2000-01-01&lt;br /&gt;
** Uses an initial large date window (2000-01-01 to 2014-12-31) followed by 30-day increments&lt;br /&gt;
** API query handles pagination (per_page: 300) and automatically reduces the time window if the result set hits the 10,000-record limit to ensure no data is missed&lt;br /&gt;
* &#039;&#039;&#039;Subsequent Builds:&#039;&#039;&#039;&lt;br /&gt;
** Triggered via &amp;lt;code&amp;gt;python index_pubpeer.py --update&amp;lt;/code&amp;gt;&lt;br /&gt;
** Identifies the latest_comment_date from the local database and starts fetching from that date to the present&lt;br /&gt;
* &#039;&#039;&#039;Database Updates:&#039;&#039;&#039;&lt;br /&gt;
** Updates the pubpeer_articles table&lt;br /&gt;
** Fields: id_pubpeer (URL), id_doi, id_pubmed, id_arxiv, title (truncated to 250 chars), and time_last_comment&lt;br /&gt;
; Wikipedia Citations (index_citations.py)&lt;br /&gt;
* &#039;&#039;&#039;Process:&#039;&#039;&#039;&lt;br /&gt;
** Pulls current external links from Wikimedia Cloud DB Replicas for DOI (org.doi.), PubMed (gov.nih.nlm.ncbi.pubmed.), and arXiv (org.arxiv.)&lt;br /&gt;
** Restricted to Main (0) and Draft (118) namespaces&lt;br /&gt;
** Matches these links against the local pubpeer_articles table&lt;br /&gt;
* &#039;&#039;&#039;Database Updates:&#039;&#039;&#039;&lt;br /&gt;
** &#039;&#039;&#039;wikipedia table:&#039;&#039;&#039; Stores language_code, mw_page_id, mw_page_title, and mw_talk_page_id. Page titles are refreshed during each run&lt;br /&gt;
** &#039;&#039;&#039;citations table:&#039;&#039;&#039; Maps id_pubpeer to id_wiki_page&lt;br /&gt;
** &#039;&#039;&#039;Stale Data:&#039;&#039;&#039; Automatically removes citations from the local database if the link has been removed from Wikipedia&lt;br /&gt;
&lt;br /&gt;
=== Reporting and wiki updates ===&lt;br /&gt;
* &#039;&#039;&#039;Wiki Maintenance:&#039;&#039;&#039;&lt;br /&gt;
** Automatically handles page moves and deletions via sync_wikipedia_titles.py before updating reports&lt;br /&gt;
* &#039;&#039;&#039;Report Generation (report.py):&#039;&#039;&#039;&lt;br /&gt;
** &#039;&#039;&#039;Alerts Report:&#039;&#039;&#039; Lists new citations (time_last_updated_table IS NULL) or existing citations with new comments (time_last_comment &amp;gt; time_last_updated_table)&lt;br /&gt;
** &#039;&#039;&#039;Most Affected Report:&#039;&#039;&#039; Lists Wikipedia articles with the highest number of unique PubPeer-commented citations&lt;br /&gt;
** &#039;&#039;&#039;Article List Reports:&#039;&#039;&#039; Large table and alphabetical subpages (/By article/A, etc.) listing all matched citations&lt;br /&gt;
** &#039;&#039;&#039;Frequency Report:&#039;&#039;&#039; Aggregates by PubPeer article to show which research is most cited across Wikipedia&lt;br /&gt;
* &#039;&#039;&#039;User Interactions (Dismissals):&#039;&#039;&#039; (pending working implementation)&lt;br /&gt;
** The bot reads the current wiki report and compares it to its previous version&lt;br /&gt;
** If an editor removes a row from a wiki table, the bot marks that citation as dismissed = TRUE in the database and stops reporting it&lt;br /&gt;
&lt;br /&gt;
=== Database Schema (schema.sql) ===&lt;br /&gt;
* &#039;&#039;&#039;wikipedia:&#039;&#039;&#039; id, language_code, mw_page_id, mw_page_title, mw_talk_page_id, librarybase_id&lt;br /&gt;
* &#039;&#039;&#039;pubpeer_articles:&#039;&#039;&#039; id_pubpeer, id_doi, id_pubmed, id_arxiv, title, time_last_comment&lt;br /&gt;
* &#039;&#039;&#039;citations:&#039;&#039;&#039;&lt;br /&gt;
** Links articles to wiki pages&lt;br /&gt;
** cited_id_type: 1 (DOI), 2 (PubMed), 3 (arXiv)&lt;br /&gt;
*** 0 is reserved for &amp;quot;other&amp;quot;&lt;br /&gt;
** time_last_updated_table: Tracks when the wiki report last included this citation&lt;br /&gt;
** time_last_talk_page_post: (Reserved for future talk page notifications)&lt;br /&gt;
** dismissed: Boolean flag for editor-driven dismissals&lt;br /&gt;
&lt;br /&gt;
=== Post-Implementation Status ===&lt;br /&gt;
* &#039;&#039;&#039;Talk Page Notifications:&#039;&#039;&#039; Code includes fields for tracking (time_last_talk_page_post), but the active workflow currently focuses on centralized reports (Wikipedia:PubPeer/*) rather than automated talk page posting&lt;br /&gt;
* &#039;&#039;&#039;Frequency of Runs:&#039;&#039;&#039; Designed to be run periodically (e.g., via cron) using the --update flag for index_pubpeer.py&lt;br /&gt;
* &#039;&#039;&#039;Dismissals Not Recognized:&#039;&#039;&#039; The logic to recognize user removals of report entries does not work yet and has been disabled.&lt;br /&gt;
&lt;br /&gt;
== 2026-01-19 Observation ==&lt;br /&gt;
Example edit: https://en.wikipedia.org/w/index.php?title=Wikipedia:PubPeer/By_article/Z&amp;amp;diff=prev&amp;amp;oldid=1333802277&lt;br /&gt;
&lt;br /&gt;
Sometimes, an article will have multiple PubPeer IDs, with different sets of comments on each. This causes the report generation process to be confused. Should probably canonize on a different identifier.&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Project:Analytics/Wikibases&amp;diff=248092</id>
		<title>Project:Analytics/Wikibases</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Project:Analytics/Wikibases&amp;diff=248092"/>
		<updated>2026-01-18T23:24:09Z</updated>

		<summary type="html">&lt;p&gt;Harej: What membership entails&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The &amp;quot;Cool Wikibase Network&amp;quot; includes Wikibases that James Hare created. Wikibases that were created by other people may be admitted into the network in the future.&lt;br /&gt;
&lt;br /&gt;
These wikis are:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;s&amp;gt;Open Miscellaneous Wikibase&amp;lt;/s&amp;gt; (January 2023–January 2026)&lt;br /&gt;
* [https://librarybase.org Librarybase] (May 2023–present)&lt;br /&gt;
* [https://internetdomains.wiki Internet Domains Wikibase] (May 2023–present)&lt;br /&gt;
* [https://wikibase.world Wikibase World] (June 2023–present)&lt;br /&gt;
* [https://communities.miraheze.org Miraheze Communities] (September 2025–present)&lt;br /&gt;
Membership in the Cool Wikibase Network mainly means being included in the [[Template:Cwbn]] footer featured on the member wikis, a callback to the days of web rings. It otherwise confers no material advantage or special privilege.&lt;br /&gt;
&lt;br /&gt;
==2026-01-16 Migration away from Wikibase.Cloud==&lt;br /&gt;
Wikibase Cloud is proposing new hosting requirements that would subject tenants to evaluation against arbitrary, shifting standards. To reduce operational uncertainty, I am moving critical systems off Wikibase.Cloud in favor of self-hosting.&lt;br /&gt;
* 2026-01-16: I migrated the majority of Open Miscellaneous Wikibase to Librarybase and then deleted the old wiki.&lt;br /&gt;
** InternetArchiveBot-related template tracking ([[Item:Q246948|example item]])&lt;br /&gt;
** [[Item:Q246951|MIT License modeling experiment]]&lt;br /&gt;
* 2026-01-18:&lt;br /&gt;
** Deleted the &amp;quot;credbot-daily&amp;quot; job on &amp;quot;vaccinesafety&amp;quot; Toolforge project in anticipation of migration.&lt;br /&gt;
** Posted [https://domains.wikibase.cloud/wiki/MediaWiki:Sitenotice Sitenotice on Internet Domains]&lt;br /&gt;
** Registered internetdomains.wiki&lt;br /&gt;
** [https://phabricator.wikimedia.org/T414893 Requested dump] of Internet Domains&lt;br /&gt;
&lt;br /&gt;
[[User:Harej|Harej]] ([[User talk:Harej|talk]]) 20:04, 18 January 2026 (UTC)&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Project:Analytics/Wikibases&amp;diff=248091</id>
		<title>Project:Analytics/Wikibases</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Project:Analytics/Wikibases&amp;diff=248091"/>
		<updated>2026-01-18T20:12:45Z</updated>

		<summary type="html">&lt;p&gt;Harej: Linking to my Wikibases&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The &amp;quot;Cool Wikibase Network&amp;quot; includes Wikibases that James Hare created. Wikibases that were created by other people may be admitted into the network in the future.&lt;br /&gt;
&lt;br /&gt;
These wikis are:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;s&amp;gt;Open Miscellaneous Wikibase&amp;lt;/s&amp;gt; (January 2023–January 2026)&lt;br /&gt;
* [https://librarybase.org Librarybase] (May 2023–present)&lt;br /&gt;
* [https://internetdomains.wiki Internet Domains Wikibase] (May 2023–present)&lt;br /&gt;
* [https://wikibase.world Wikibase World] (June 2023–present)&lt;br /&gt;
* [https://communities.miraheze.org Miraheze Communities] (September 2025–present)&lt;br /&gt;
&lt;br /&gt;
==2026-01-16 Migration away from Wikibase.Cloud==&lt;br /&gt;
Wikibase Cloud is proposing new hosting requirements that would subject tenants to evaluation against arbitrary, shifting standards. To reduce operational uncertainty, I am moving critical systems off Wikibase.Cloud in favor of self-hosting.&lt;br /&gt;
* 2026-01-16: I migrated the majority of Open Miscellaneous Wikibase to Librarybase and then deleted the old wiki.&lt;br /&gt;
** InternetArchiveBot-related template tracking ([[Item:Q246948|example item]])&lt;br /&gt;
** [[Item:Q246951|MIT License modeling experiment]]&lt;br /&gt;
* 2026-01-18:&lt;br /&gt;
** Deleted the &amp;quot;credbot-daily&amp;quot; job on &amp;quot;vaccinesafety&amp;quot; Toolforge project in anticipation of migration.&lt;br /&gt;
** Posted [https://domains.wikibase.cloud/wiki/MediaWiki:Sitenotice Sitenotice on Internet Domains]&lt;br /&gt;
** Registered internetdomains.wiki&lt;br /&gt;
** [https://phabricator.wikimedia.org/T414893 Requested dump] of Internet Domains&lt;br /&gt;
&lt;br /&gt;
[[User:Harej|Harej]] ([[User talk:Harej|talk]]) 20:04, 18 January 2026 (UTC)&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Project:Analytics/Wikibases&amp;diff=248090</id>
		<title>Project:Analytics/Wikibases</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Project:Analytics/Wikibases&amp;diff=248090"/>
		<updated>2026-01-18T20:04:03Z</updated>

		<summary type="html">&lt;p&gt;Harej: Created page with &amp;quot;==2026-01-16 Migration away from Wikibase.Cloud== Wikibase Cloud is proposing new hosting requirements that would subject tenants to evaluation against arbitrary, shifting standards. To reduce operational uncertainty, I am moving critical systems off Wikibase.Cloud in favor of self-hosting. * On 2026-01-16 I migrated the majority of Open Miscellaneous Wikibase to Librarybase and then deleted the old wiki. ** InternetArchiveBot-related template tracking (Item:Q246948|ex...&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==2026-01-16 Migration away from Wikibase.Cloud==&lt;br /&gt;
Wikibase Cloud is proposing new hosting requirements that would subject tenants to evaluation against arbitrary, shifting standards. To reduce operational uncertainty, I am moving critical systems off Wikibase.Cloud in favor of self-hosting.&lt;br /&gt;
* On 2026-01-16 I migrated the majority of Open Miscellaneous Wikibase to Librarybase and then deleted the old wiki.&lt;br /&gt;
** InternetArchiveBot-related template tracking ([[Item:Q246948|example item]])&lt;br /&gt;
** [[Item:Q246951|MIT License modeling experiment]]&lt;br /&gt;
* 2026-01-18: deleted the &amp;quot;credbot-daily&amp;quot; job on &amp;quot;vaccinesafety&amp;quot; Toolforge project in anticipation of migration.&lt;br /&gt;
* Posted [https://domains.wikibase.cloud/wiki/MediaWiki:Sitenotice Sitenotice on Internet Domains]&lt;br /&gt;
* Registered internetdomains.wiki&lt;br /&gt;
* [https://phabricator.wikimedia.org/T414893 Requested dump] of Internet Domains&lt;br /&gt;
&lt;br /&gt;
[[User:Harej|Harej]] ([[User talk:Harej|talk]]) 20:04, 18 January 2026 (UTC)&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Template:Cwbn&amp;diff=248089</id>
		<title>Template:Cwbn</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Template:Cwbn&amp;diff=248089"/>
		<updated>2026-01-16T16:24:38Z</updated>

		<summary type="html">&lt;p&gt;Harej: Removing Open Misc&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&amp;lt;div style=&amp;quot;margin-top:2em&amp;quot;&amp;gt;&amp;lt;hr style=&amp;quot;width:5em; margin-bottom:0.75em;&amp;quot; /&amp;gt;&amp;lt;span style=&amp;quot;margin-right:1.5em&amp;quot;&amp;gt;&#039;&#039;&#039;&#039;&#039;Cool Wikibase Network&#039;&#039;&#039;&#039;&#039;&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;margin-right:1em;&amp;quot;&amp;gt;[https://wikibase.world Wikibase World]&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;margin-right:1em;&amp;quot;&amp;gt;[https://librarybase.org Librarybase]&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;margin-right:1em;&amp;quot;&amp;gt;[https://domains.wikibase.cloud Internet Domains]&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;margin-right:1em;&amp;quot;&amp;gt;[https://communities.miraheze.org Miraheze Communities]&amp;lt;/span&amp;gt;&amp;lt;/div&amp;gt;&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Item:Q246951&amp;diff=248088</id>
		<title>Item:Q246951</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Item:Q246951&amp;diff=248088"/>
		<updated>2026-01-16T16:14:45Z</updated>

		<summary type="html">&lt;p&gt;Harej: /* wbsetclaim-create:2||1 */ Property:P44: MIT&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Item:Q246951&amp;diff=248087</id>
		<title>Item:Q246951</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Item:Q246951&amp;diff=248087"/>
		<updated>2026-01-16T16:14:19Z</updated>

		<summary type="html">&lt;p&gt;Harej: /* wbsetclaim-create:2||1 */ Property:P42: Item:Q246968&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Item:Q246951&amp;diff=248086</id>
		<title>Item:Q246951</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Item:Q246951&amp;diff=248086"/>
		<updated>2026-01-16T16:13:55Z</updated>

		<summary type="html">&lt;p&gt;Harej: /* wbsetclaim-create:2||1 */ Property:P42: Item:Q246964&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Item:Q246951&amp;diff=248085</id>
		<title>Item:Q246951</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Item:Q246951&amp;diff=248085"/>
		<updated>2026-01-16T16:13:54Z</updated>

		<summary type="html">&lt;p&gt;Harej: /* wbsetclaim-create:2||1 */ Property:P42: Item:Q246966&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Item:Q246951&amp;diff=248084</id>
		<title>Item:Q246951</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Item:Q246951&amp;diff=248084"/>
		<updated>2026-01-16T16:13:52Z</updated>

		<summary type="html">&lt;p&gt;Harej: /* wbsetclaim-create:2||1 */ Property:P42: Item:Q246967&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Item:Q246951&amp;diff=248083</id>
		<title>Item:Q246951</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Item:Q246951&amp;diff=248083"/>
		<updated>2026-01-16T16:13:50Z</updated>

		<summary type="html">&lt;p&gt;Harej: /* wbsetclaim-create:2||1 */ Property:P42: Item:Q246965&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Property:P44&amp;diff=248082</id>
		<title>Property:P44</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Property:P44&amp;diff=248082"/>
		<updated>2026-01-16T16:11:23Z</updated>

		<summary type="html">&lt;p&gt;Harej: /* wbeditentity-create:2|en */ SPDX identifier, license identifier&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Item:Q246951&amp;diff=248081</id>
		<title>Item:Q246951</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Item:Q246951&amp;diff=248081"/>
		<updated>2026-01-16T16:09:46Z</updated>

		<summary type="html">&lt;p&gt;Harej: /* wbsetclaim-update:2||1|5 */ Property:P42: Item:Q246953&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Item:Q246951&amp;diff=248080</id>
		<title>Item:Q246951</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Item:Q246951&amp;diff=248080"/>
		<updated>2026-01-16T16:08:04Z</updated>

		<summary type="html">&lt;p&gt;Harej: /* wbsetclaim-update:2||1|2 */ Property:P42: Item:Q246953&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Item:Q246951&amp;diff=248079</id>
		<title>Item:Q246951</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Item:Q246951&amp;diff=248079"/>
		<updated>2026-01-16T16:07:08Z</updated>

		<summary type="html">&lt;p&gt;Harej: /* wbsetclaim-create:2||1 */ Property:P42: Item:Q246953&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Item:Q246968&amp;diff=248078</id>
		<title>Item:Q246968</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Item:Q246968&amp;diff=248078"/>
		<updated>2026-01-16T16:05:35Z</updated>

		<summary type="html">&lt;p&gt;Harej: /* wbsetclaim-create:2||1 */ Property:P19: Item:Q246952&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Item:Q246968&amp;diff=248077</id>
		<title>Item:Q246968</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Item:Q246968&amp;diff=248077"/>
		<updated>2026-01-16T16:05:26Z</updated>

		<summary type="html">&lt;p&gt;Harej: /* wbeditentity-create:2|en */ work is copyrighted, license term&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Item:Q246967&amp;diff=248076</id>
		<title>Item:Q246967</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Item:Q246967&amp;diff=248076"/>
		<updated>2026-01-16T16:05:14Z</updated>

		<summary type="html">&lt;p&gt;Harej: /* wbsetclaim-create:2||1 */ Property:P19: Item:Q246952&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Item:Q246967&amp;diff=248075</id>
		<title>Item:Q246967</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Item:Q246967&amp;diff=248075"/>
		<updated>2026-01-16T16:03:59Z</updated>

		<summary type="html">&lt;p&gt;Harej: /* wbeditentity-create:2|en */ disclaimer of all liability, license term&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Item:Q246966&amp;diff=248074</id>
		<title>Item:Q246966</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Item:Q246966&amp;diff=248074"/>
		<updated>2026-01-16T16:03:47Z</updated>

		<summary type="html">&lt;p&gt;Harej: /* wbsetclaim-create:2||1 */ Property:P19: Item:Q246952&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Item:Q246966&amp;diff=248073</id>
		<title>Item:Q246966</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Item:Q246966&amp;diff=248073"/>
		<updated>2026-01-16T16:03:38Z</updated>

		<summary type="html">&lt;p&gt;Harej: /* wbeditentity-create:2|en */ disclaimer of all warranty, license term&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Item:Q246965&amp;diff=248072</id>
		<title>Item:Q246965</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Item:Q246965&amp;diff=248072"/>
		<updated>2026-01-16T16:03:18Z</updated>

		<summary type="html">&lt;p&gt;Harej: /* wbsetclaim-create:2||1 */ Property:P19: Item:Q246952&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Item:Q246965&amp;diff=248071</id>
		<title>Item:Q246965</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Item:Q246965&amp;diff=248071"/>
		<updated>2026-01-16T16:03:08Z</updated>

		<summary type="html">&lt;p&gt;Harej: /* wbeditentity-create:2|en */ requirement to distribute license text, license term&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Item:Q246964&amp;diff=248070</id>
		<title>Item:Q246964</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Item:Q246964&amp;diff=248070"/>
		<updated>2026-01-16T16:02:33Z</updated>

		<summary type="html">&lt;p&gt;Harej: /* wbsetclaim-create:2||1 */ Property:P19: Item:Q246952&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Item:Q246964&amp;diff=248069</id>
		<title>Item:Q246964</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Item:Q246964&amp;diff=248069"/>
		<updated>2026-01-16T16:02:25Z</updated>

		<summary type="html">&lt;p&gt;Harej: /* wbeditentity-create:2|en */ requirement to distribute copyright notice, license term&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Item:Q246963&amp;diff=248068</id>
		<title>Item:Q246963</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Item:Q246963&amp;diff=248068"/>
		<updated>2026-01-16T15:59:47Z</updated>

		<summary type="html">&lt;p&gt;Harej: /* wbsetclaim-create:2||1 */ Property:P19: Item:Q246954&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Item:Q246963&amp;diff=248067</id>
		<title>Item:Q246963</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Item:Q246963&amp;diff=248067"/>
		<updated>2026-01-16T15:59:32Z</updated>

		<summary type="html">&lt;p&gt;Harej: /* wbeditentity-create:2|en */ permit another to re-use under the same terms, mode of re-use&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
	<entry>
		<id>https://librarybase.org/w/index.php?title=Item:Q246962&amp;diff=248066</id>
		<title>Item:Q246962</title>
		<link rel="alternate" type="text/html" href="https://librarybase.org/w/index.php?title=Item:Q246962&amp;diff=248066"/>
		<updated>2026-01-16T15:59:02Z</updated>

		<summary type="html">&lt;p&gt;Harej: /* wbsetclaim-create:2||1 */ Property:P19: Item:Q246954&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Harej</name></author>
	</entry>
</feed>