Hypothesis caching the same post title for all annotations on a single site

Filed an Issue hypothesis / client by hypothes.ishypothes.is (GitHub)
The Hypothesis web-based annotation client. Contribute to hypothesis/client development by creating an account on GitHub.

Steps to reproduce

  1. Annotate any page on https://boffosocko.com
  2.  View https://hypothes.is/search?q=url%3Aboffosocko.com%2F* 
  3.  

Expected behaviour

I would expect the titles of the various annotated posts displayed on H to be that of the <h1> tag on the annotated page or some other logical name based on a parsing algorithm.

Actual behaviour

The titles for almost all the annotations on my website (since 2016), regardless of the page they’re on, appear to be the incorrect title: “Top Ed-Tech Trends of 2015: Indie Ed-Tech”. The few exceptions seem to be the self-hosted .pdf files on my domain/storage that I’ve annotated.

Browser/system information

This appears to be browser and OS independent

Additional details

The only post on my website related to the title which appears seems to be https://boffosocko.com/2020/05/16/top-ed-tech-trends-of-2015-indie-ed-tech-audrey-watters/ which contains copies of the annotations I made on Audrey Waters’ page. See: http://hackeducation.com/2015/12/21/trends-indie#annotations:soppjJeoEeq9gccaKJdTtg

Some time around 2016 Audrey disabled annotations on her site (due to abuse, though it appears she’s since re-enabled them?). Is it possible that the H client has somehow cached the title of her post and is somehow mapping it as the title for all of the annotations made on my site? Having looked at the pages which have been annotated on my site, there’s nothing hiding in or related to the the meta data or rel=”canonical” links that would indicate that they should have the titles that H is finding for them.

Published by

Chris Aldrich

I'm a biomedical and electrical engineer with interests in information theory, complexity, evolution, genetics, signal processing, IndieWeb, theoretical mathematics, and big history. I'm also a talent manager-producer-publisher in the entertainment industry with expertise in representation, distribution, finance, production, content delivery, and new media.

2 thoughts on “Hypothesis caching the same post title for all annotations on a single site”

  1. The ability to annotate archived material on the Internet Archive with Hypothes.is is definitely possible, and I do it from time to time. I’m not sure which browser or annotation tool (via, browser extensions, other) you’re using, but it’s possible that one or more combinations may have issues allowing you to do it or not. The standard browser extension on Chrome has worked well for me in the past.
    Hypothes.is has methods for establishing document equivalency which archive.org apparently conforms. I did an academic experiment a few years back with an NYT article about books where you’ll see equivalent annotations on the original, the archived version, and a copy on my own site that has a rel="canonical" link back to the original as well: 

    https://www.nytimes.com/2017/01/16/books/obamas-secret-to-surviving-the-white-house-years-books.html
    https://web.archive.org/web/20170119220705/https://www.nytimes.com/2017/01/16/books/obamas-secret-to-surviving-the-white-house-years-books.html
    https://boffosocko.com/2017/01/19/obamas-secret-to-surviving-the-white-house-years-books-the-new-york-times/

    I don’t recommend doing the rel-canonical trick on your own site frequently as I have noticed a bug, which I don’t think has been fixed.
    The careful technologist with one tool or another, will see that I and a couple others have been occasionally delving into the archive and annotating Manfred Kuehn’s work. (I see at least one annotation from 2016, which was probably native on his original site before it was shut down in 2018.) I’ve found some great gems and leads into some historical work from his old site. In particular, he’s got some translations from German texts that I haven’t seen in other places.

Mentions

Leave a Reply

Your email address will not be published. Required fields are marked *

To respond to a post on this site using your own website, create your post making sure to include the (target) URL/permalink for my post in your response. Then enter the URL/permalink of your response in the (source) box and click the 'Ping me' button. Your response will appear (possibly after moderation) on my page. Want to update or remove your response? Update or delete your post and re-enter your post's URL again. (Learn More)