Steps to reproduce
- Annotate any page on https://boffosocko.com
- View https://hypothes.is/search?q=url%3Aboffosocko.com%2F*
Expected behaviour
I would expect the titles of the various annotated posts displayed on H to be that of the <h1> tag on the annotated page or some other logical name based on a parsing algorithm.
Actual behaviour
The titles for almost all the annotations on my website (since 2016), regardless of the page they’re on, appear to be the incorrect title: “Top Ed-Tech Trends of 2015: Indie Ed-Tech”. The few exceptions seem to be the self-hosted .pdf files on my domain/storage that I’ve annotated.
Browser/system information
This appears to be browser and OS independent
Additional details
The only post on my website related to the title which appears seems to be https://boffosocko.com/2020/05/16/top-ed-tech-trends-of-2015-indie-ed-tech-audrey-watters/ which contains copies of the annotations I made on Audrey Waters’ page. See: http://hackeducation.com/2015/12/21/trends-indie#annotations:soppjJeoEeq9gccaKJdTtg
Some time around 2016 Audrey disabled annotations on her site (due to abuse, though it appears she’s since re-enabled them?). Is it possible that the H client has somehow cached the title of her post and is somehow mapping it as the title for all of the annotations made on my site? Having looked at the pages which have been annotated on my site, there’s nothing hiding in or related to the the meta data or rel=”canonical” links that would indicate that they should have the titles that H is finding for them.
Hi @chrisaldrich I believe this is a metadata issue rather than a bug, so I have transferred this issue to our support repo and asked our support team to take a look.
The ability to annotate archived material on the Internet Archive with Hypothes.is is definitely possible, and I do it from time to time. I’m not sure which browser or annotation tool (via, browser extensions, other) you’re using, but it’s possible that one or more combinations may have issues allowing you to do it or not. The standard browser extension on Chrome has worked well for me in the past.
Hypothes.is has methods for establishing document equivalency which archive.org apparently conforms. I did an academic experiment a few years back with an NYT article about books where you’ll see equivalent annotations on the original, the archived version, and a copy on my own site that has a
rel="canonical"link back to the original as well:https://www.nytimes.com/2017/01/16/books/obamas-secret-to-surviving-the-white-house-years-books.html
https://web.archive.org/web/20170119220705/https://www.nytimes.com/2017/01/16/books/obamas-secret-to-surviving-the-white-house-years-books.html
https://boffosocko.com/2017/01/19/obamas-secret-to-surviving-the-white-house-years-books-the-new-york-times/
I don’t recommend doing the rel-canonical trick on your own site frequently as I have noticed a bug, which I don’t think has been fixed.
The careful technologist with one tool or another, will see that I and a couple others have been occasionally delving into the archive and annotating Manfred Kuehn’s work. (I see at least one annotation from 2016, which was probably native on his original site before it was shut down in 2018.) I’ve found some great gems and leads into some historical work from his old site. In particular, he’s got some translations from German texts that I haven’t seen in other places.