Hypothesis caching the same post title for all annotations on a single site

The Hypothesis web-based annotation client. Contribute to hypothesis/client development by creating an account on GitHub.

Steps to reproduce

Annotate any page on https://boffosocko.com
View https://hypothes.is/search?q=url%3Aboffosocko.com%2F*

Expected behaviour

I would expect the titles of the various annotated posts displayed on H to be that of the <h1> tag on the annotated page or some other logical name based on a parsing algorithm.

Actual behaviour

The titles for almost all the annotations on my website (since 2016), regardless of the page they’re on, appear to be the incorrect title: “Top Ed-Tech Trends of 2015: Indie Ed-Tech”. The few exceptions seem to be the self-hosted .pdf files on my domain/storage that I’ve annotated.

Browser/system information

This appears to be browser and OS independent

Additional details

The only post on my website related to the title which appears seems to be https://boffosocko.com/2020/05/16/top-ed-tech-trends-of-2015-indie-ed-tech-audrey-watters/ which contains copies of the annotations I made on Audrey Waters’ page. See: http://hackeducation.com/2015/12/21/trends-indie#annotations:soppjJeoEeq9gccaKJdTtg

Some time around 2016 Audrey disabled annotations on her site (due to abuse, though it appears she’s since re-enabled them?). Is it possible that the H client has somehow cached the title of her post and is somehow mapping it as the title for all of the annotations made on my site? Having looked at the pages which have been annotated on my site, there’s nothing hiding in or related to the the meta data or rel=”canonical” links that would indicate that they should have the titles that H is finding for them.

Published by

Chris Aldrich

I'm a biomedical and electrical engineer with interests in information theory, complexity, evolution, genetics, signal processing, IndieWeb, theoretical mathematics, and big history. I'm also a talent manager-producer-publisher in the entertainment industry with expertise in representation, distribution, finance, production, content delivery, and new media. View all posts by Chris Aldrich

2 thoughts on “Hypothesis caching the same post title for all annotations on a single site”

katelyn says: @ github.com

February 22, 2021 at 10:55 am

Hi @chrisaldrich I believe this is a metadata issue rather than a bug, so I have transferred this issue to our support repo and asked our support team to take a look.

Reply
Chris Aldrich says: @ boffosocko.com

August 14, 2022 at 9:32 am

The ability to annotate archived material on the Internet Archive with Hypothes.is is definitely possible, and I do it from time to time. I’m not sure which browser or annotation tool (via, browser extensions, other) you’re using, but it’s possible that one or more combinations may have issues allowing you to do it or not. The standard browser extension on Chrome has worked well for me in the past.
Hypothes.is has methods for establishing document equivalency which archive.org apparently conforms. I did an academic experiment a few years back with an NYT article about books where you’ll see equivalent annotations on the original, the archived version, and a copy on my own site that has a rel="canonical" link back to the original as well:

https://www.nytimes.com/2017/01/16/books/obamas-secret-to-surviving-the-white-house-years-books.html
https://web.archive.org/web/20170119220705/https://www.nytimes.com/2017/01/16/books/obamas-secret-to-surviving-the-white-house-years-books.html
https://boffosocko.com/2017/01/19/obamas-secret-to-surviving-the-white-house-years-books-the-new-york-times/

I don’t recommend doing the rel-canonical trick on your own site frequently as I have noticed a bug, which I don’t think has been fixed.
The careful technologist with one tool or another, will see that I and a couple others have been occasionally delving into the archive and annotating Manfred Kuehn’s work. (I see at least one annotation from 2016, which was probably native on his original site before it was shut down in 2018.) I’ve found some great gems and leads into some historical work from his old site. In particular, he’s got some translations from German texts that I haven’t seen in other places.

Reply

Mentions

💬 Chris Aldrich