Project:Analytics/PubPeer

From Librarybase
< Project:Analytics
Revision as of 23:58, 7 January 2026 by Harej (talk | contribs) (Start)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

API

https://dashboards.pubpeer.com/docs/api#/operations/partner

Relevant parameters:

  • page: start with 1 then iterate based on whether there are more results
  • per_page: set at maximum value 300
  • sort:
  • published_at: concerns when the document was published; I only care about comments

Resources

  • Wikimedia Cloud Services
    • Toolforge: project "pubpeer"
    • Cloud VPS: project "wikicite", Trove DB instance

Process

  • Initial seed:
    • Build pageset
    • Start from 2006-01-01...2025-12-31
    • Iterate through as many pages as needed to get to the end
    • Build internal database:
      • id_pubpeer (key)
      • id_doi (update on conflict)
      • id_pubmed (update on conflict)
      • id_arxiv (update on conflict)
      • time_last_notified_wiki (null when created)
      • time_most_recent_comment (on conflict, update if submitted > stored)
  • Subsequent builds:
    • Get most recent time_most_recent_comment from database
    • Start from that date...present day
    • Iterate through as many result pages as needed (probably only one page)
    • Submit into database, which should transparently handle conflicts