👓 Why Not Blog? | Kathleen Fitzpatrick

Read Why Not Blog? by Kathleen FitzpatrickKathleen Fitzpatrick (Kathleen Fitzpatrick)
My friend Alan Jacobs, a key inspiration in my return (such as it is, so far) to blogging and RSS and a generally pre-Twitter/Facebook outlook on the scholarly internet, is pondering the relationship between blogging and other forms of academic writing in thinking about his next project. Perhaps needless to say, this is something I’m considering as well, and I’m right there with him in most regards.

Highlights, Quotes, Annotations, & Marginalia

The blog was not just the venue in which I started putting together the ideas that became my second book, the one that made promotion and various subsequent jobs possible, but it was also the way that I was able to demonstrate that there might be a readership for that second book, without which it’s much less likely that a press would have been interested.  

This sounds like she’s used her blog as both a commonplace book as well as an author platform.

In fact blog posts are not the kind of thing one can detail on one’s annual review form, and even a blog in the aggregate doesn’t have a place in which it’s easy to be claimed as a site of ongoing scholarly productivity.  

Mine have gone more like (1) having some vague annoying idea with a small i; (b) writing multiple blog posts thinking about things related to that idea; (iii) giving a talk somewhere fulminating about some other thing entirely; (4) wondering if maybe there are connections among those things; (e) holy carp, if I lay the things I’ve been noodling about over the last year and a half out in this fashion, it could be argued that I am in the middle of writing a book!  

Here’s another person talking about blogs as “thought spaces” the same way that old school bloggers like Dave Winer and Om Malik amongst many others have in the past. While I’m thinking about it I believe that Colin Walker and Colin Devroe have used this sort of idea as well.

👓 Retroactive Webmentioning | Peter Rukavina

Read Retroactive Webmentioning by Peter RukavinaPeter Rukavina (ruk.ca)
By way of testing out my Webmention module for Drupal, I took the 256 posts I’ve written here this year, ferreted out all the external links, discovered their Webmention endpoints, and sent a Webmention. Those 256 posts contained 840 links in total; of those links, 149 were to a target that suppor...
There are some interesting/useful statistics here. There’s also an interesting kernel of an idea about how one links to one’s own website internally as well. I find this very intriguing with respect to owning a digital commonplace book. Perhaps there are some ways to modify IndieMap for extracting some useful metadata out of one’s own website?

An Outline for Using Hypothesis for Owning your Annotations and Highlights

I was taken with Ian O’Byrne’s righteous excitement in his video the other day over the realization that he could potentially own his online annotations using Hypothesis, that I thought I’d take a moment to outline a few methods I’ve used.

There are certainly variations of ways for attempting to own one’s own annotations using Hypothesis and syndicating them to one’s website (via a PESOS workflow), but I thought I’d outline the quickest version I’m aware of that requires little to no programming or code, but also allows some relatively pretty results. While some of the portions below are WordPress specific, there’s certainly no reason they couldn’t be implemented for other systems.

Saving individual annotations one at a time

Here’s an easy method for taking each individual annotation you create on Hypothesis and quickly porting it to your site:

Create an IFTTT.com recipe to port your Hypothesis RSS feed into WordPress posts. Generally chose an “If RSS, then WordPress” setup and use the following data to build the recipe:

  • Input feed: https://hypothes.is/stream.atom?user=username (change username to your user name)
  • Optional title: 📑 {{EntryTitle}}
  • Body: {{EntryContent}} from {{EntryUrl}} <br />{{EntryPublished}}
  • Categories: Highlight (use whatever categories you prefer, but be aware they’ll apply to all your future posts from this feed)
  • Tags: hypothes.is
  • Post status (optional): I set mine to “Draft” so I have the option to keep it privately or to publish it publicly at a later date.

Modify any of the above fields as necessary for your needs. IFTTT.com usually polls your feed every 10-15 minutes. You can usually pretty quickly take this data and turn it into your post kind of preference–suggestions include read, bookmark, like, favorite, or even reply. Add additional categories, tags, or other metadata as necessary for easier searching at a later time.

Here’s an example of one on my website that uses this method. I’ve obviously created a custom highlight post kind of my own for more specific presentation as well as microformats markup.

A highlight from Hypothesis posted on my own website using some customized code to create a “Highlight post” using the Post Kinds Plugin.

Aggregating lots of annotations on a single page

If you do a lot of annotations on Hypothesis and prefer to create a bookmark or read post that aggregates all of your annotations on a given post, the quickest way I’ve seen on WordPress to export your data is to use the Hypothesis Aggregator plugin [GitHub].

  • Create a tag “key” for a particular article by creating an acronym from the article title followed by the date and then the author’s initials. This will allow you to quickly conglomerate all the annotations for a particular article or web page. As an example for this article I’d use: OUHOAH062218CA. In addition to any other necessary tags, I’ll tag each of my annotations on the particular article with this somewhat random, yet specific key for which there are unlikely to be any other similar tags in my account.
  • Create a bookmark, read, reply or other post kind to which you’ll attach your annotations. I often use a bookmarklet for speed here.
  • Use the Hypothesis Aggregator’s short code for your tag and username to pull your annotations for the particular tag. It will look like this:
    [hypothesis user = 'username' tags = 'tagname']

    If you’re clever, you could include this shortcode in the body of your IFTTT recipe (if you’re using drafts) and simply change the tag name to the appropriate one to save half a step or need to remember the shortcode format each time.

If you’re worried that Hypothes.is may eventually shut down, the plugin quits working (leaving you with ugly short codes in your post) or all of the above, you can add the following steps as a quick work-around.

  • Input the shortcode as above, click on the “Preview” button in WordPress’s Publish meta box which will open a new window and let you view your post.
  • Copy the preview of the annotations you’d like to keep in your post and paste them over your shortcode in the Visual editor tab on your draft post. (This will maintain the simple HTML formatting tags, which you can also edit or supplement if you like.)
  • I also strip out the additional unnecessary data from Hypothesis Aggregator about the article it’s from as well as the line about who created the annotation which isn’t necessary as my post will implicitly have that data. Depending on how you make your post (i.e. not using the Post Kinds Plugin), you may want to keep it.

As Greg McVerry kindly points out, Jon Udell has created a simple web-tool for inputting a few bits of data about a set of annotations to export them variously in HTML, CSV, or JSON format. If you’re not a developer and don’t want to fuss with Hypothesis’ API, this is also a reasonably solid method of quickly exporting subsections of your annotations and cutting and pasting them onto your website. It does export a lot more data that one might want for their site and could require some additional clean up, particularly in HTML format.

Perhaps with some elbow grease and coding skill, sometime in the future, we’ll have a simple way to implement a POSSE workflow that will allow you to post your annotations to your own website and syndicate them to services like Hypothesis. In the erstwhile, hopefully this will help close a little of the data gap for those using their websites as their commonplace books or digital notebooks.

📺 Open science: Michael Nielsen at TEDxWaterloo | YouTube

Watched Open science: Michael Nielsen at TEDxWaterloo by Michael NielsenMichael Nielsen from YouTube

Michael Nielsen is one of the pioneers of quantum computation. Together with Ike Chuang of MIT, he wrote the standard text in the field, a text which is now one of the twenty most highly cited physics books of all time. He is the author of more than fifty scientific papers, including invited contributions to Nature and Scientific American. His research contributions include involvement in one of the first quantum teleportation experiments, named as one of Science Magazine's Top Ten Breakthroughs of the Year for 1998. Michael was a Fulbright Scholar at the University of New Mexico, and has worked at Los Alamos National Laboratory, as the Richard Chace Tolman Prize Fellow at Caltech, as Foundation Professor of Quantum Information Science at the University of Queensland, and as a Senior Faculty Member at the Perimeter Institute for Theoretical Physics. Michael left academia to write a book about open science, and the radical change that online tools are causing in the way scientific discoveries are made.

Sadly this area of science hasn’t opened up as much as it likely should have in the intervening years. More scientists need to be a growing part of the IndieWeb movement and owning their own data, their content, and, yes, even their own publishing platforms. With even simple content management systems like WordPress researchers can actively practice academic samizdat to a much greater extent and take a lot of the centralized power away from the major journal and textbook publishing enterprises.

I can easily see open web technology like the Webmention spec opening up online scientific communication and citations drastically even to the point of quickly replacing tools like Altmetric. If major publishing wants something to do perhaps they could work on the archiving and aggregation portions?

What if one could publish a research paper or journal article on one’s own (or one’s lab’s) website? It could receive data via webmention about others who are bookmarking it, reading it, highlighting and annotating it. It could also accept webmention replies as part of a greater peer-review process–the equivalent of the researcher hosting their own pre-print server as well as their own personal journal and open lab notebook.

We need to help empower scientists to be the center of their own writing and publishing. For those interested, this might be a useful starting point: https://indieweb.org/Indieweb_for_Education

 

 

Reply to Open Science notebooks | Ryan Barrett

Replied to a post by Ryan BarrettRyan Barrett (snarfed.org)
Notebooks like Jupyter and Observable are great for research, data science, and really any interactive computing or documentation. I want to start using them for ops/SRE projects too. Thomas Kluyver‘s bash_kernel works, but has lots of rough edges. Anyone have any other ideas?
I’ve been watching that space for a few years. Apparently you saw the same article push them into the broader mainstream consciousness. I would mention Mathematica, but you’re certainly aware of it. There are a few other math-related platforms I’ve used, but I suspect they’re not within the realm you’re looking for.

I’ve seen one or two much smaller projects along the lines of bash_kernel, but they’re either in incredibly rough shape or have very limited scopes or very niche uses. There’s a reasonably interesting list of open science related resources on GitHub, but it’s a tad old and some of the projects on it have merged or changed drastically since it was started. Foster has some interesting material and resources on open science if you care to dig through it. One day I’ll delve into the Open Science Framework to see if they’ve got anything I haven’t seen before too.

I keep meaning to document people who are using their own websites for pieces of this type of thing , but most are doing it in a hybrid fashion. Carl Boettiger is certainly a good example[1][2] and may be aware of some additional resources including one he helps manage.

Reply to a reply to Dan Cohen tweet

Replied to Reply to Dan Cohen tweet by Chris AldrichChris Aldrich (BoffoSocko)
Dan, There are a lot of moving pieces in your question and a variety of ways to implement them depending on your needs and particular website set up. Fortunately there are lots of educators playing around in these spaces already who are experimenting with various means and methods as well as some of their short and long term implications.
@jbj Given the number of people I’ve seen experimenting over the past months, I’d be happy to put together a series of short pieces for @ProfHacker covering the areas of overlap of between , , , research, academic publishing, samizdat, commonplace books, etc. Essentially tighter versions of some of https://boffosocko.com/research/indieweb/ but specifically targeting the education space using WordPress, Known, and Grav. Let me know if you’d accept submissions for the community.

reply to tkasasagi tweet

Replied to a tweet by tkasasagitkasasagi (Twitter)
I hope you do blog about it, I’m sure many would find it useful. I’ve been using my own website as a commonplace book for a while now, not only for blogging as you’ve considered, but also to bookmark interesting things, to highlight and make notes of what I read, and generally use it as my online notebook/research/study space. I do post some personal tidbits, but a large amount of what I post (both research and personal) is actually private and only viewable by me. Perhaps worth considering as you continue your studies which others have interest in as well?

👓 Self-platforming, DoOO, and academic workflows | Tim Clarke

Read Self-platforming, DoOO, and academic workflows by Tim ClarkeTim Clarke (simulacrumbly.com)
I see self-platforming as an expression of my own digital citizenship, and I also see it as my deliberate answer to the call for digital sanctuary.  The frequency and extent to which educators urge students onto extractive applications is of great concern.  Self-platforming offers opportunities to benefit from the collaborative, hyper-textual, asynchronous, and distributed qualities of the web, while diminishing the costs — often hidden to us — of working on proprietary and extractive platforms.
I love that Tim is looking closely at how the choices of tools he’s using can potentially impact his students/readers. I’ve also been in the boat he’s in–trying to wrangle some simple data in a way that makes it easy to collect, read, and disseminate content for myself, students, and other audiences.

Needing to rely on five or more outside services (Twitter, Instapaper, Pinboard, bit.ly, and finally even Canvas, where some of them are paid services) seems just painful and excessive. He mentions the amount and level of detail he’s potentially giving away to just bit.ly, but each of these are all taking a bite out of the process. Of course this doesn’t take into consideration the fact that Instapaper is actually a subsidiary of Betaworks, the company that owns and controls bit.ly, so there’s even more personal detail being consumed and aggregated there than he may be aware. All this is compounded by the fact that Instapaper is currently completely blocking its users within the EU because it hasn’t been able to comply with the privacy and personal data details/restrictions of the GDPR. Naturally, there’s currently no restrictions on it in the U.S. or other parts of the world.

I (and many others) have been hacking away for the past several years in trying to tame much of our personal data in a better way to own it and control it for ourselves. And isn’t this part of the point of having a domain of one’s own? Even his solution of using Shaarli to self-host his own bookmarks, while interesting, seems painful to me in some aspects. Though he owns and controls the data, because it sits on a separate domain it’s not as tightly integrated into his primary site or as easily searched. To be even more useful, it needs additional coding and integration into his primary site which appears to run on WordPress. With the givens, it looks more like he’s spending some additional time running his own separate free-standing social media silo just for bookmarks. Why not have it as part of his primary personal hub online?

I’ve been watching a growing trend of folks both within the IndieWeb/DoOO and edtech spaces begin using their websites like a commonplace book to host a growing majority of their own online and social related data. This makes it all easier to find, reference, consume, and even create new content in the future. On their own sites, they’re conglomerating all their data about what they’re reading, highlighting, annotating, bookmarking, liking, favoriting, and watching in addition to their notes and thoughts. When appropriate, they’re sharing that content publicly (more than half my website is hidden privately on my back end, but still searchable and useful only to me) or even syndicating it out to social sites like Twitter, Facebook, Flickr, Instapaper, et al. to share it within other networks.

Some other examples of educators and researchers doing this other than myself include Aaron Davis, Greg McVerry, John Johnson, and more recently W. Ian O’Byrne and Cathie LeBlanc among many others. Some have chosen to do it on their primary site while others are experimenting using two or even more. I would hope that as Tim explores, he continues to document his process as well as the pros and cons of what he does and the resultant effects. But I also hopes he discovers this growing community of scholars, teachers, programmers and experimenters who have been playing in the same space so that he knows he’s not alone and perhaps to prevent himself from going down some rabbit holes some of us have explored all too well. Or to use what may be a familiar bit of lingo to him, I hope he joins our impromptu, but growing personal learning network (PLN).

👓 Why I Love Link Blogging | BirchTree

Read Why I Love Link Blogging (BirchTree)
More often than not, I write articles for this site after reading something someone else wrote. I browse the web for articles and tweets that I find interesting, and the ones that make me think are very often the ones that inspire me to write something myself. This leads to a funny situation as a w...
How many levels deep could the link blogging on these posts go? Is it linkblogging all the way down?

For me, I’ll add it specifically to my linkblog of things I’ve read which is a subsection of my collected linkblog which also collects favorites, likes, bookmarks, and sites I’m following.

Incidentally, this seems to be another post about people who use their websites for thinking and writing, which I seem to be coming across many of lately. I ought to collect them all into a group and write a piece about them and the general phenomenon.

IndieWeb Journalism in the Wild

I noticed a few days ago that professor and writer John Naughton not only has his own website but that he’s posting both his own content to it as well as (excerpted) content he’s writing for other journalistic outlets, lately in his case for The Guardian. This is awesome for so many reasons. The primary reason is that I can follow him via his own site and get not only his personally posted content, which informs his longer pieces, but I don’t need to follow him in multiple locations to get the “firehose” of everything he’s writing and thinking about. While The Guardian and The Observer are great, perhaps I don’t want to filter through multiple hundreds of articles to find his particular content or potentially risk missing it?  What if he was writing for 5 or more other outlets? Then I’d need to delve in deeper still and carry a multitude of subscriptions and their attendant notifications to get something that should rightly emanate from one location–him! While he may not be posting his status updates or Tweets to his own website first–as I do–I’m at least able to get the best and richest of his content in one place. Additionally, the way he’s got things set up, The Guardian and others are still getting the clicks (for advertising sake) while I still get the simple notifications I’d like to have so I’m not missing what he writes.

His site certainly provides an interesting example of either POSSE or PESOS in the wild, particularly from an IndieWeb for Journalism or even an IndieWeb for Education perspective. I suspect his article posts occur on the particular outlet first and he’s excerpting them with a link to that “original”. (Example: A post on his site with a link to a copy on The Guardian.) I’m not sure whether he’s (ideally) physically archiving the full post there on his site (and hiding it privately as both a personal and professional portfolio of sorts) or if they’re all there on the respective pages, but just hidden behind the “read more” button he’s providing. I will note that his WordPress install is giving a rel=”canonical link to itself rather than the version at The Guardian, which also has a rel=”canonical” link on it. I’m curious to take a look at how Google indexes and ranks the two pages as a result.

In any case, this is a generally brilliant set up for any researcher, professor, journalist, or other stripe of writer for providing online content, particularly when they may be writing for a multitude of outlets.

I’ll also note that I appreciate the ways in which it seems he’s using his website almost as a commonplace book. This provides further depth into his ideas and thoughts to see what sources are informing and underlying his other writing.

Alas, if only the rest of the world used the web this way…

Replied to a tweet by Matt Reed (Twitter)
Wish Twitter would distinguish between "favorite" and "save for later." People could infer some pretty misleading things...
Intent on Twitter is often so muddled, this is the last thing some might worry about. (Yet it’s still a tremendous tool.) Pocket has browser extensions, and I know the one for Chrome has settings one can toggle an icon to appear on Twitter to allow bookmarking things to read for later directly within your Pocket account, which is generally a reasonable experience.

Pocket’s browser extension can add a much better “save to read for later” button to one’s Twitter feed.

I think the much stronger and better solution for one’s personal commonplace book is to simply add these intents to one’s own website and either favorite, bookmark, mark as read, repost, reply to, annotate, highlight, or just about “anything else” them there and syndicate the appropriate response to Twitter separately. (Examples: bookmarks and reads.) This makes it much more difficult to muddle the intent. It’ll also give you a much more highly searchable set of data that you can own on your own website.

Why wait around for Twitter or another social service to build the tools you want/need when it’s relatively easy to cobble them together for yourself on a variety of opensource platforms? While you’re at it, remove some of the other limitations like 280 characters as well…

Replied to A post on Mastodon by wiobyrnewiobyrne (Scholar Social)
@mrkrndvs I've really been interested in the collection/curation I see from you & @chrisaldrich - I'm in the process of figuring out how to build up a WordPress site to serve as the "commonplace book" on the WordPress site, keep it simple, and have it pump into my weekly newsletter. Any links/guidance/plugins on how to make this happen is definitely appreciated. I want to keep this as simple/lightweight as possible. :)
@wiobyrne@scholar.social @mrkrndvs@mastodon.cloud

I’ve written in a little bit of depth before about Organizing my research related reading.

My favorite piece of the puzzle is using the Post Kinds Plugin for WordPress which has some useful browser bookmarklets for quickly saving what I read, bookmark, and interact with online. Having the ability to add categories, tags, and other behind-the-scenes metadata also helps a lot.

You can then turn around and add all this to a newsletter at your leisure.

Happy to share more, but this should be a good start.

Thoughts on linkblogs, bookmarks, reads, likes, favorites, follows, and related links

Within the social media space there’s a huge number of services that provide a variety of what I would call bookmark-type functionality of one sort or another. They go under a variety of monikers including bookmarks, likes, favorites, stars, reads, follows, claps, and surely many quirky others. Each platform has created its own semantics which don’t always overlap with the others.

Because I’m attempting to own all of my own data, I’ve roughly mapped many of these intents into my own website. But because I have the ultimate control over them, I get to form my own personal definitions. I also have a lot more control over them in addition to adding other metadata to each for better after-the-fact search and use within my personal online commonplace book. As such, I thought it might be useful to lay out some definitions (both for myself and others) for how I view these on my website.

At the basest level, I look at most of these interactions simply as URL permalinks to interesting content and their aggregation as a “linkblog”, or a feed of interesting links I’ve come across. The specific names given to them imply a level of specificity about what I think exactly makes them interesting.

In addition to a bookmark specific feed, which by itself could be considered a “traditional” linkblog, my site also has separate aggregated feeds for things I’ve liked, read, followed, and favorited. It’s the semantic reasons for saving or featuring these pieces of content which ultimately determine which names they ultimately have. (For those interested in subscribing to one or or more, or all of these, one can add /feed/ to the ends of the specific types’ URLs, which I’ve linked,  for an RSS feed. Thus, for example, http://boffosocko.com/type/link/feed/ will give you the RSS feed for the “Master” linkblog that includes all the bookmarks, likes, reads, follows, and favorites.)

On my site, I try to provide a title for the content and some type of synopsis of what the content is about. These help to provide some context to others seeing them as well as a small reminder to me of what they were about. When appropriate/feasible, I’ll try to include an image for similar reasons. I’ll also often add a line of text or two as a commentary or supplement to my thoughts on the piece. Finally, I add an icon to help to quickly visually indicate which of the types of posts each is, so they can be more readily distinguished when seen in aggregate.

In relative order of decreasing importance or value to me I would put them in roughly the following order of importance (with their attached meanings as I view them on my site):

  1. Favorite – This is often something which might easily have had designations of bookmark, like, and/or read, or even multiple of them at the same time. In any case they’re often things which I personally find important or valuable in the long term. There are far less of these than any of the other types of linkblog-like posts.
  2. Follow – Indicating that I’m now following a person, organization, or source of future content which I deem to have enough regular constant value to my life that I want to be able to see what that source is putting out on a regular basis. Most often these sources have RSS feeds which I consume in a feed reader, but frequently they’ll appear on other social silos which I will have ported into a feed reader as well. Of late I try to be much more selective in what I’m following and why. I also categorize sources based on topics of value to me. Follows often include sources which I have either previously often liked or bookmarked or suspect I would like or bookmark frequently in the future. For more details see: A Following Page (aka some significant updates to my Blogroll) and the actual Following page.
  3. Read – These are linkblog-like posts which I found interesting enough for one reason or another to have actually spent the time to read in their entirety. For things I wish to highlight or found most interesting, I’ll often add additional thought or commentary in conjunction with the post.
  4. Like – Depending on the content, these posts may not always have been read in their entirety, but I found them more interesting than the majority of content which I’ve come across. Most often these posts serve to show my appreciation for the original source of the related post as a means of saying “congratulations”, “kudos”, “good job”, or in cases of more personal level content “I appreciate this”, “you’re awesome”, or simply as the tag says “I liked this.”
  5. Bookmark – Content which I find interesting, but might not necessarily have the time to deal with at present. Often I’ll wish to circle back to the content at some future point and engage with at a deeper level. Bookmarking it prevents me from losing track of it altogether. I may optionally add a note about how the content came to my attention to be able to better remember it at a future time. While there are often things here which others might have “liked” or “favorited” on other social silos, on my site these things have been found interesting enough to have been bookmarked, but I haven’t personally read into them enough yet to form any specific opinion about them beyond their general interest to me or potentially followers interested in various category tags I use. I feel like this is the lowest level of interaction, and one in which I see others often like, favorite, or even repost on other social networks without having actually read anything other than the headline, if they’ve even bothered to do that. In my case, however, I more often than not actually come back to the content while others on social media rarely, if ever, do.

While occasionally some individual specimens of each might “outrank” others in the category above this is roughly the order of how I perceive them. Within this hierarchy, I do have some reservations about including the “follow” category, which in some sense I feel stands apart from the continuum represented by the others. Still it fits into the broader category of a thing with a URL, title, and high interest to me. Perhaps the difference is that it represents a store of future potentially useful information that hasn’t been created or consumed yet? An unseen anti-library of people instead of books in some sense of the word.

I might also include the Reply post type toward the top of the list, but for some time I’ve been categorizing these as “statuses” or “note-like” content rather than as “links”. These obviously have a high priority if lumped in as I’ve not only read and appreciated the underlying content, but I’ve spent the time and thought to provide a reasoned reply, particularly in cases where the reply has taken some time to compose. I suppose I might more likely include these as linkblog content if I didn’t prefer readers to value them more highly than if they showed up in those feeds. In some sense, I value the replies closer on par to my longer articles for the value of not only my response, but for that of the original posts themselves.

In general, if I take the time to add additional commentary, notes, highlights, or other marginalia, then the content obviously resonated with me much more than those which stand as simple links with titles and descriptions.

Perhaps in the near future, I’ll write about how I view these types on individual social media platforms. Often I don’t post likes/favorites from social platforms to my site as they often have less meaning to me directly and likely even less meaning to my audiences here. I suppose I could aggregate them here on my site privately, but I have many similar questions and issues that Peter Molnar brings up in his article Content, Bloat, privacy, arichives.

I’m curious to hear how others apply meaning to their linkblog type content especially since there’s such a broad range of meaning from so many social sites. Is there a better way to do it all? Is it subtly different on sites which don’t consider themselves (or act as) commonplace books?

Organizing my research related reading

There’s so much great material out there to read and not nearly enough time. The question becomes: “How to best organize it all, so you can read even more?”

I just came across a tweet from Michael Nielsen about the topic, which is far deeper than even a few tweets could do justice to, so I thought I’d sketch out a few basic ideas about how I’ve been approaching it over the last decade or so. Ideally I’d like to circle back around to this and better document more of the individual aspects or maybe even make a short video, but for now this will hopefully suffice to add to the conversation Michael has started.

Keep in mind that this is an evolving system which I still haven’t completely perfected (and may never), but to a great extent it works relatively well and I still easily have the ability to modify and improve it.

Overall Structure

The first piece of the overarching puzzle is to have a general structure for finding, collecting, triaging, and then processing all of the data. I’ve essentially built a simple funnel system for collecting all the basic data in the quickest manner possible. With the basics down, I can later skim through various portions to pick out the things I think are the most valuable and move them along to the next step. Ultimately I end up reading the best pieces on which I make copious notes and highlights. I’m still slowly trying to perfect the system for best keeping all this additional data as well.

Since I’ve seen so many apps and websites come and go over the years and lost lots of data to them, I far prefer to use my own personal website for doing a lot of the basic collection, particularly for online material. Toward this end, I use a variety of web services, RSS feeds, and bookmarklets to quickly accumulate the important pieces into my personal website which I use like a modern day commonplace book.

Collecting

In general, I’ve been using the Inoreader feed reader to track a large variety of RSS feeds from various clearinghouse sources (including things like ProQuest custom searches) down to individual researcher’s blogs as a means of quickly pulling in large amounts of research material. It’s one of the more flexible readers out there with a huge number of useful features including the ability to subscribe to OPML files, which many readers don’t support.

As a simple example arXiv.org has an RSS feed for the topic of “information theory” at http://arxiv.org/rss/math.IT which I subscribe to. I can quickly browse through the feed and based on titles and/or abstracts, I can quickly “star” the items I find most interesting within the reader. I have a custom recipe set up for the IFTTT.com service that pulls in all these starred articles and creates new posts for them on my WordPress blog. To these posts I can add a variety of metadata including top level categories and lower level tags in addition to other additional metadata I’m interested in.

I also have similar incoming funnel entry points via many other web services as well. So on platforms like Twitter, I also have similar workflows that allow me to use services like IFTTT.com or Zapier to push the URLs easily to my website. I can quickly “like” a tweet and a background process will suck that tweet and any URLs within it into my system for future processing. This type of workflow extends to a variety of sites where I might consume potential material I want to read and process. (Think academic social services like Mendeley, Academia.com, Diigo, or even less academic ones like Twitter, LinkedIn, etc.) Many of these services often have storage ability and also have simple browser bookmarklets that allow me to add material to them. So with a quick click, it’s saved to the service and then automatically ported into my website almost without friction.

My WordPress-based site uses the Post Kinds Plugin which takes incoming website URLs and does a very solid job of parsing those pages to extract much of the primary metadata I’d like to have without requiring a lot of work. For well structured web pages, it’ll pull in the page title, authors, date published, date updated, synopsis of the page, categories and tags, and other bits of data automatically. All these fields are also editable and searchable. Further, the plugin allows me to configure simple browser bookmarklets so that with a simple click on a web page, I can pull its URL and associated metadata into my website almost instantaneously. I can then add a note or two about what made me interested in the piece and save it for later.

Note here, that I’m usually more interested in saving material for later as quickly as I possibly can. In this part of the process, I’m rarely ever interested in reading anything immediately. I’m most interested in finding it, collecting it for later, and moving on to the next thing. This is also highly useful for things I find during my busy day that I can’t immediately find time for at the moment.

As an example, here’s a book I’ve bookmarked to read simply by clicking “like” on a tweet I cam across late last year. You’ll notice at the bottom of the post, I’ve optionally syndicated copies of the post to other platforms to “spread the wealth” as it were. Perhaps others following me via other means may see it and find it useful as well?

Triaging

At regular intervals during the week I’ll sit down for an hour or two to triage all the papers and material I’ve been sucking into my website. This typically involves reading through lots of abstracts in a bit more detail to better figure out what I want to read now and what I’d like to read at a later date. I can delete out the irrelevant material if I choose, or I can add follow up dates to custom fields for later reminders.

Slowly but surely I’m funneling down a tremendous amount of potential material into a smaller, more manageable amount that I’m truly interested in reading on a more in-depth basis.

Document storage

Calibre with GoodReads sync

Even for things I’ve winnowed down, there is still a relatively large amount of material, much of it I’ll want to save and personally archive. For a lot of this function I rely on the free multi-platform desktop application Calibre. It’s essentially an iTunes-like interface, but it’s built specifically for e-books and other documents.

Within it I maintain a small handful of libraries. One for personal e-books, one for research related textbooks/e-books, and another for journal articles. It has a very solid interface and is extremely flexible in terms of configuration and customization. You can create a large number of custom libraries and create your own searchable and sort-able fields with a huge variety of metadata. It often does a reasonable job of importing e-books, .pdf files, and other digital media and parsing out their meta data which prevents one from needing to do some of that work manually. With some well maintained metadata, one can very quickly search and sort a huge amount of documents as well as quickly prioritize them for action. Additionally, the system does a pretty solid job of converting files from one format to another, so that things like converting an .epub file into a .mobi format for Kindle are automatic.

Calibre stores the physical documents either in local computer storage, or even better, in the cloud using any of a variety of services including Dropbox, OneDrive, etc. so that one can keep one’s documents in the cloud and view them from a variety of locations (home, work, travel, tablet, etc.)

I’ve been a very heavy user of GoodReads.com for years to bookmark and organize my physical and e-book library and anti-libraries. Calibre has an exceptional plugin for GoodReads that syncs data across the two. This (and a few other plugins) are exceptionally good at pulling in missing metadata to minimize the amount that must be done via hand, which can be tedious.

Within Calibre I can manage my physical books, e-books, journal articles, and a huge variety of other document related forms and formats. I can also use it to further triage and order the things I intend to read and order them to the nth degree. My current Calibre libraries have over 10,000 documents in them including over 2,500 textbooks as well as records of most of my 1,000+ physical books. Calibre can also be used to add document data that one would like to ultimately acquire the actual documents, but currently don’t have access to.

BibTeX and reference management

In addition to everything else Calibre also has some well customized pieces for dovetailing all its metadata as a reference management system. It’ll allow one to export data in a variety of formats for document publishing and reference management including BibTex formats amongst many others.

Reading, Annotations, Highlights

Once I’ve winnowed down the material I’m interested in it’s time to start actually reading. I’ll often use Calibre to directly send my documents to my Kindle or other e-reading device, but one can also read them on one’s desktop with a variety of readers, or even from within Calibre itself. With a click or two, I can automatically email documents to my Kindle and Calibre will also auto-format them appropriately before doing so.

Typically I’ll send them to my Kindle which allows me a variety of easy methods for adding highlights and marginalia. Sometimes I’ll read .pdf files via desktop and use Adobe to add highlights and marginalia as well. When I’m done with a .pdf file, I’ll just resave it (with all the additions) back into my Calibre library.

Exporting highlights/marginalia to my website

For Kindle related documents, once I’m finished, I’ll use direct text file export or tools like clippings.io to export my highlights and marginalia for a particular text into simple HTML and import it into my website system along with all my other data. I’ve briefly written about some of this before, though I ought to better document it. All of this then becomes very easily searchable and sort-able for future potential use as well.

Here’s an example of some public notes, highlights, and other marginalia I’ve posted in the past.

Synthesis

Eventually, over time, I’ve built up a huge amount of research related data in my personal online commonplace book that is highly searchable and sortable! I also have the option to make these posts and pages public, private, or even password protected. I can create accounts on my site for collaborators to use and view private material that isn’t publicly available. I can also share posts via social media and use standards like webmention and tools like brid.gy so that comments and interactions with these pieces on platforms like Facebook, Twitter, Google+, and others is imported back to the relevant portions of my site as comments. (I’m doing it with this post, so feel free to try it out yourself by commenting on one of the syndicated copies.)

Now when I’m ready to begin writing something about what I’ve read, I’ve got all the relevant pieces, notes, and metadata in one centralized location on my website. Synthesis becomes much easier. I can even have open drafts of things as I’m reading and begin laying things out there directly if I choose. Because it’s all stored online, it’s imminently available from almost anywhere I can connect to the web. As an example, I used a few portions of this workflow to actually write this post.

Continued work

Naturally, not all of this is static and it continues to improve and evolve over time. In particular, I’m doing continued work on my personal website so that I’m able to own as much of the workflow and data there. Ideally I’d love to have all of the Calibre related piece on my website as well.

Earlier this week I even had conversations about creating new post types on my website related to things that I want to read to potentially better display and document them explicitly. When I can I try to document some of these pieces either here on my own website or on various places on the IndieWeb wiki. In fact, the IndieWeb for Education page might be a good place to start browsing for those interested.

One of the added benefits of having a lot of this data on my own website is that it not only serves as my research/data platform, but it also has the traditional ability to serve as a publishing and distribution platform!

Currently, I’m doing most of my research related work in private or draft form on the back end of my website, so it’s not always publicly available, though I often think I should make more of it public for the value of the aggregation nature it has as well as the benefit it might provide to improving scientific communication. Just think, if you were interested in some of the obscure topics I am and you could have a pre-curated RSS feed of all the things I’ve filtered through piped into your own system… now multiply this across hundreds of thousands of other scientists? Michael Nielsen posts some useful things to his Twitter feed and his website, but what I wouldn’t give to see far more of who and what he’s following, bookmarking, and actually reading? While many might find these minutiae tedious, I guarantee that people in his associated fields would find some serious value in it.

I’ve tried hundreds of other apps and tools over the years, but more often than not, they only cover a small fraction of the necessary moving pieces within a much larger moving apparatus that a working researcher and writer requires. This often means that one is often using dozens of specialized tools upon which there’s a huge duplication of data efforts. It also presumes these tools will be around for more than a few years and allow easy import/export of one’s hard fought for data and time invested in using them.

If you’re aware of something interesting in this space that might be useful, I’m happy to take a look at it. Even if I might not use the service itself, perhaps it’s got a piece of functionality that I can recreate into my own site and workflow somehow?

If you’d like help in building and fleshing out a system similar to the one I’ve outlined above, I’m happy to help do that too.

Related posts

🔖 The Story of Your Life: Using WordPress as Your Memory Warehouse

Bookmarked The Story of Your Life: Using WordPress as Your Memory Warehouse by Brianna Privett (WordCamp US 2017)
The Personal Web of the 1990s/early 2000s was the first wave of online diarists and bloggers who use the web as a platform to chronicle and share their our daily lives. WordPress came out of this movement, and is now in its second decade. 2017 marks 20 years that I’ve been using the web to create and archive memories, and 12 years that I’ve been doing it with WordPress. I’ve learned a few things about creating a real and permanent record of a lifetime on the ephemeral digital landscape, and together we’ll discuss how to use WordPress to create your own home on the web. We’ll cover topics such as how to maintain your (and your family’s) privacy, using WordPress to build a keepsake repository your friends and family can contribute to, and how to ensure that these digital spaces are available as a legacy for lifetimes to come.
I can’t wait until WordPress.TV (presumably) posts this up in a few weeks. This sounds a lot like Brianna’s talking about a web-enabled commonplace book, a topic which intrigues me greatly and the purpose for which I’m most often using my own site.

In looking briefly at her personal site, I don’t see lots of evidence of her use of the idea, so I’m guessing that she’s either keeping it privately on her back end, password protected, or on another site altogether like I do for some of my content. Her talk mentions this, so I’m excited to see how she executes on it.

I’m also curious, after having recently remotely attended the Dodging the Memory Hole 2017 conference, how she’s archiving and backing it up for future generations, particularly if she’s keeping large chunks privately.

I’m keeping my eyes open to see if she posts slides from her presentation.

Update December 10, 2017:

Here are links to the slides (Google Docs version).

The video has also been posted today on WordPress.tv:
Brianna Privett: The Story of Your Life: Using WordPress as Your Memory Warehouse