🔖 [1806.00871] A Framework for Aggregating Private and Public Web Archives | arXiv

Bookmarked [1806.00871] A Framework for Aggregating Private and Public Web Archives by Mat Kelly, Michael L. Nelson, Michele C. Weigle (arxiv.org)
Personal and private Web archives are proliferating due to the increase in the tools to create them and the realization that Internet Archive and other public Web archives are unable to capture personalized (e.g., Facebook) and private (e.g., banking) Web pages. We introduce a framework to mitigate issues of aggregation in private, personal, and public Web archives without compromising potential sensitive information contained in private captures. We amend Memento syntax and semantics to allow TimeMap enrichment to account for additional attributes to be expressed inclusive of the requirements for dereferencing private Web archive captures. We provide a method to involve the user further in the negotiation of archival captures in dimensions beyond time. We introduce a model for archival querying precedence and short-circuiting, as needed when aggregating private and personal Web archive captures with those from public Web archives through Memento. Negotiation of this sort is novel to Web archiving and allows for the more seamless aggregation of various types of Web archives to convey a more accurate picture of the past Web.
Syndicated copies to:
Replied to Chris Jones on Twitter (Twitter)
A1. Personally I have multiple Wordpress sites, and I'm working on a stand-alone domain for my higher-end, highest-value #content .. all are candidates to aggregate relevant insights, yes? Pros/Cons? #smchat #contentseries

@sourcePOV I like @jgmac1106‘s general idea, but having less overhead to manage and administer appealed to me a lot. Here’s some thoughts on what I ended up doing: http://boffosocko.com/2016/12/18/rss-feeds-a-follow-up-on-my-indieweb-commitment-2017/

tl;dr: Having fewer sites to deal with seemed like a stronger idea. I still wanted to feature the richer content over the smaller tidbits while also not overwhelming people who had subscribed in the past. I also took into account trying to make it relatively easy for people to subscribe to the particular data they want/need out of my website. My home page has a list of various post kinds available which may be useful to think about as well.

@mrkrndvs has played around in this area of aggregation as well with a few different websites and may have some insight too.

Whatever you decide, be sure to have some fun along the way.

Syndicated copies to:

🔖 PressForward in the classroom

Bookmarked PressForward in the classroom (pressforward.org)
Do you want to work with students to publish class assignments or research? Instructors use PressForward in the classroom to consolidate and review student assignments, help students learn to survey their fields, and create opportunities for collaboration, communication, and research. The Lewis & Clark College Environmental Studies Program produces Environment Across Boundaries, a student-led publication that cultivates interdisciplinary perspectives on environmental issues. Participation gives students an opportunity to engage with their discipline through experiential, project based learning. They develop skills both in their field and with a suite of digital tools.

An interesting use case for PressForward: creating a “planet” website to aggregate and/or showcase work of students in an entire classroom who are all posting content to their own separate web spaces.

Sketch idea: create a standalone WordPress site for a course, install the PressForward plugin, input the RSS feeds for students’ websites to aggregate all their work collectively into one space. Various ideas include:

  • Use the feed for students and teacher to keep up with the entire classroom.
  • Publish an OPML file for students to easily subscribe to all feeds in their feed reader of choice.
  • Optionally publish the highlights of the best work or even all of it.
  • Teachers could use the feed to check that students are posting/keeping up with assignments for grading purposes.
  • Use the read/unread functionality to “mark” pieces as graded/ungraded or seen/unseen.
  • Use the internal commenting system to keep private notes on student’s work.
  • Create output feeds for specific tags and/or categories
  • Works with any student sites that produce feeds, not just WordPress, so students have choices of different CMSes.
  • Use the nomination functionality to quickly aggregate and disseminate online sources for classroom assignments or readings.

I had contemplated planet like aggregation at the recent WPCampus online conference. It’s interesting to see that PressForward has considered it as a use case as well though I’d love to hear about or see examples of this in the wild.

How else could this rich, multi-functional Swiss Army knife-like plugin be used in education?

Cartoon diagram with a funnel collecting content from sites like YouTube, WordPress, Drupal, Blogger, RSS being aggregated into a single computer-based website.
Diagram of PressForward functionality as a content hub or “planet” courtesy of PressForward.org
Syndicated copies to:

Reply to Laying the Standards for a Blogging Renaissance by Aaron Davis

Replied to Laying the Standards for a Blogging Renaissance by Aaron Davis (Read Write Respond)
With the potential demise of social media, does this offer a possible rebirth of blogging communities and the standards they are built upon?

Aaron, some excellent thoughts and pointers.

A lot of your post also reminds me of Bryan Alexander’s relatively recent post I defy the world and to go back to RSS.

I completely get the concept of what you’re getting at with harkening back to the halcyon days of RSS. I certainly love, use, and rely on it heavily both for consumption as well as production. Of course there’s also still the competing standard of Atom still powering large parts of the web (including GNU Social networks like Mastodon). But almost no one looks back fondly on the feed format wars…

I think that while many are looking back on the “good old days” of the web, that we not forget the difficult and fraught history that has gotten us to where we are. We should learn from the mistakes made during the feed format wars and try to simplify things to not only move back, but to move forward at the same time.

Today, the easier pared-down standards that are better and simpler than either of these old and and difficult specs is simply adding Microformat classes to HTML (aka P.O.S.H) to create feeds. Unless one is relying on pre-existing infrastructure like WordPress, building and maintaining RSS feed infrastructure can be difficult at best, and updates almost never occur, particularly for specifications that support new social media related feeds including replies, likes, favorites, reposts, etc. The nice part is that if one knows how to write basic html, then one can create a simple feed by hand without having to learn the mark up or specifics of RSS. Most modern feed readers (except perhaps Feedly) support these new h-feeds as they’re known. Interestingly, some CMSes like WordPress support Microformats as part of their core functionality, though in WordPress’ case they only support a subsection of Microformats v1 instead of the more modern v2.

For those like you who are looking both backward and simultaneously forward there’s a nice chart of “Lost Infractructure” on the IndieWeb wiki which was created following a post by Anil Dash entitled The Lost Infrastructure of Social Media. Hopefully we can take back a lot of the ground the web has lost to social media and refashion it for a better and more flexible future. I’m not looking for just a “hipster-web”, but a new and demonstrably better web.

The Lost Infrastructure of the Web from the IndieWeb Wiki (CC0)

Some of the desire to go back to RSS is built into the problems we’re looking at with respect to algorithmic filtering of our streams (we’re looking at you Facebook.) While algorithms might help to filter out some of the cruft we’re not looking for, we’ve been ceding too much control to third parties like Facebook who have different motivations in presenting us material to read. I’d rather my feeds were closer to the model of fine dining rather than the junk food that the-McDonald’s-of-the-internet Facebook is providing. As I’m reading Cathy O’Neil’s book Weapons of Math Distraction, I’m also reminded that the black box that Facebook’s algorithm is is causing scale and visibility/transparency problems like the Russian ad buys which could have potentially heavily influenced the 2017 election in the United States. The fact that we can’t see or influence the algorithm is both painful and potentially destructive. If I could have access to tweaking a third-party transparent algorithm, I think it would provide me a lot more value.

As for OPML, it’s amazing what kind of power it has to help one find and subscribe to all sorts of content, particularly when it’s been hand curated and is continually self-dogfooded. One of my favorite tools are readers that allow one to subscribe to the OPML feeds of others, that way if a person adds new feeds to an interesting collection, the changes propagate to everyone following that feed. With this kind of simple technology those who are interested in curating things for particular topics (like the newsletter crowd) or even creating master feeds for class material in a planet-like fashion can easily do so. I can also see some worthwhile uses for this in journalism for newspapers and magazines. As an example, imagine if one could subscribe not only to 100 people writing about #edtech, but to only their bookmarked articles that have the tag edtech (thus filtering out their personal posts, or things not having to do with edtech). I don’t believe that Feedly supports subscribing to OPML (though it does support importing OPML files, which is subtly different), but other readers like Inoreader do.

I’m hoping to finish up some work on my own available OPML feeds to make subscribing to interesting curated content a bit easier within WordPress (over the built in, but now deprecated link manager functionality.) Since you mentioned it, I tried checking out the OPML file on your blog hoping for something interesting in the #edtech space. Alas… 😉 Perhaps something in the future?

Syndicated copies to: