Skip to content

METS schema poster more like wallpaper

I’m starting in on a snazzy workbench project which will encode a bunch of it’s working data in the METS schema. In the process I’ll probably need to understand and use the standard in ways that have been only theory to me. Premis in METS, also exciting for me.

In any case, I found myself constantly flipping back to the Oxygen XSD viewer in Eclipse to consult the standard.  That’s very cumbersome to do all day long. This one time I tried opening an additional workbench window within the same Eclipse instance, using Eclipse on two displays. Nice in theory, but I do not recommend doing this if you want to retain a stable Eclipse IDE.

That’s when I noticed the plotter sitting in the GIS area here. It can print 3 feet by whatever length! The next thing I discovered is that you can save the current Oxygen XSD schema view to an image, capturing whatever parts of the schema are expanded. (Thank you Oxygen.)

The first draft I tried was an image with the basic parts of the entire schema expanded underneath the root element. Even after collapsing all repeated element types, this was a ten foot long poster. While fun for demo purposes, this would be impractical without inventing an analog scrolling device. I was able to shorten the poster to a more reasonable length by breaking the schema up into element types and rearranging to use the 3 foot width. Now the poster is 3 feet wide by 6 feet tall, small enough to pin to the back of a tall bookcase or cube divider.

Show them how serious you are about metadata transmission!

If you use METS, I highly recommend you look around your institution for a plotter and go to town. The final image and Gimp file are linked.

Only entropy comes easy.

I was looking for some pithy quotes that neatly sum up the digital preservation game and ended up searching quote sites for terms like “decay” and “entropy”.

The art of progress is to preserve order amid change.  -Alfred North Whitehead

For most men, life is a search for the proper manila envelope in which to get themselves filed  -Clifton Fadiman

Decay is inherent in all compounded things. Strive on with diligence.  –Buddha

Language is only the instrument of science, and words are but the signs of ideas: I wish, however, that the instrument might be less apt to decay, and that signs might be permanent, like the things they denote  -Samuel Johnson

Only entropy comes easy.  -Anton Chekhov

Desolation is a file, and the endurance of darkness is preparation for great light.  -St. John of the Cross

Still paying attention?  That last one was more for humor.  The Chekhov is my favorite because it is so succinct and wry, but still says a lot.  Whitehead boils the concepts down to a pure unassailable essence, while the Buddha inspires radical diligence.  Got any more?  What’s the oldest quote you can find that is still relevant and what does that say?

Dia Shapes for Fedora Commons Content Models

I’ve been designing Fedora content models for two years now.  The whole time I’ve struggled to communicate the mix-in approach to content models.  This is a variant on the “decorator” design pattern that’s familiar to object-oriented programmers, but pretty unfamiliar to most others.  The models I used for the repository at UNC do not have inheritance, just a set of models that might or might not apply to a given object.  So at least the diagram doesn’t need to express relationships between models.

In any case, the other day I saw that folks at Wisconsin had an elegant solution, color.  Follow the link and look at the PDF diagram.  It seems like a small thing, but for expressing content models it is a real breakthrough.  You can lay out your example objects in a normal-looking composition diagram, then color code the content models to match the traits that they imply.  I don’t know how decorators are modeled in UML and thankfully now I don’t need to…

Now the Wisconsin model is beautiful, with drop-shadows and curvy shapes.  I want that too, but I prefer open source tools.  So that limits me to only a few diagramming apps, namely Dia, OpenOffice Draw, Inkscape.  Okay not so few.  My preference is Dia, of all my options it is closest to Visio and I’ve done a lot of work with it in the past.  However, there weren’t any shapes quite suitable to the task at hand in Dia.  (There probably aren’t in Visio either..)  So now I had to roll my own Fedora modeling shape.  Like most such things, it really doesn’t take very long once you find the right documentation:

I found the best way to create a shape was to make a template from some similar shape that ships with Dia.  Find your Dia preferences folder.  On Linux it’s in a .dia folder under your home directory.  The .dia folder (or equivalent) contains a shapes folder and a sheets folder.  So I created a new subfolder in each of these locations called “Fedora”.  Then I copied a sheet file and a shape file into these locations from similar locations in the main Dia install.  On Ubuntu linux Dia installs it’s default shapes and sheets in /usr/share/dia.

I created the outline of my new shapes in Dia and then exported them as SVG.  This SVG went into the shape file.  Then I customized it based on the second link above.  I ended up with two basic fedora shapes, one for objects and one for datastreams.  They are a work in progress, but allowed me to create the diagram below.

click for full size

click for full size

There are some lingering todos, mostly making the arrows able to link to the center instead of an edge.  That probably involves replacing separate SVG lines with a closed path description that can be filled.  I also want to find a way to depict model-tied services, something like UML class operations I suppose.  Or, maybe someone else will pick up where I left off.

Here’s the shape/sheet bundle:   fedorashapes

don’t forget to tip your journalist

This starts with The Daily Show interview of Walter Isaacson the other day, which led me to his recent cover story, “How to Save Your Newspaper“.  The article is a revealing look at how newspaper revenues have changed in the shift to online content.  Basically they can’t make any money because people don’t like to pay for online content.

I agree that this is the central problem.  I also agree that the web provides a unique solution.

One of history’s ironies is that hypertext — an embedded Web link that refers you to another page or site — had been invented by Ted Nelson in the early 1960s with the goal of enabling micropayments for content.

“Micropayments, that will never work..”, I can hear you already.  It’s true for several surmountable reasons.

One is that I want privacy.  I don’t want to hand out my identity every time I read the occasional insightful article on an otherwise extremist blog.  This call will not be recorded, you get my money in a plain brown wrapper.

Second is that I am the judge.  I’ll pay you after I read the damn thing.  You aren’t selling me another big headline with no journalism to back it up.  You might even let me choose how much I want to pay.  It’s a tip, a massively scalable tip jar.  Write something good and it will fill up.  Let your aggregation sites chase the ad revenue, but never let them get their hands in your tip jar.

So okay, you are this award-winning journalist and we must be tripping over ourselves to pay for the privilege of seeing each sentence..  Wait, what was your name again?  Time to let go of all that or watch the hungry bloggers eat your lunch and then drink your milkshake.  I am the judge.  :)

Third reason, micropayments are too troublesome or hard or time consuming.  Not true.  PayPal.  Even better solutions suggest themselves from the quote above.  For one thing, a tip jar is easier than PayPal because in a tip scenario the seller doesn’t have to verify payment.  So all you need is a service that records the urls of any articles you tip.  Then they can distribute all your tips through an account like PayPal.  It could be just another verb under the article, “Share on Facebook – Tip with Tipster – Share on Del.icio.us”.  (/me writes a very very simple business plan.)

Anything I sit down and read for 10 or 20 minutes is worth at least a buck, so I’d probably cough up a miserly 25 cents and/or blog about your article or share your article with my hyper-efficient, but perfectly commonplace, social network.

Sir Allen Ran Away

Maybe I’m being a nasty post-colonial oppressor and shoving aside Caribbean sovereigns, but I have a problem with calling this fella a knight.  So this Texan is knighted in Antigua/Barbuda, so what?  He basically owns the place.

HE IS THE LARGEST EMPLOYER IN ANTIGUA AFTER GOVERNMENT

If found guilty of these fraud allegations, then they should dub him privateer and send him forth to scuttle rival ponzi schemes in Grand Cayman.  Ah yes, echos of Python wit on corporate raiders.  Let’s just call him “Great Pirate Allen” and then the ladies can coo with delight every time they say it.

Big Blue Joins OpenOffice

IBM has joined the OpenOffice community and intends to contribute accessibility and other enhancements. This may be predictable to those following IBM’s commitment to the open source movement, but to governments and large organizations this support will make adopting the open document standard a more viable option.
M$ has been trying to squeeze more time out of its office suite through tightly integrated collaboration tools, but now IBM can effectively counter with a Lotus suite based around the ODF standard.

Anyone who has unzipped an ODF spreadsheet, run an XSL transform and then packed it back up knows the power of open formats. Makes it so easy to do exactly what you want with your data. I can’t wait for the day that I can send an ODF without hearing, “Sorry, I can only open Word format.”

44 Blackboard Patents Under Review

If you work in higher education, Blackboard is the 800 pound gorilla. Now open source advocates are challenging 44 of their patents. In an effort to gather evidence of previous work they created a fascinating Wikipedia article on the subject. See History of Virtual Learning Environments. I think Douglas Engelbart is one of the more forward thinking people in this history. Invented hypertext, the mouse and early user interfaces. There is a video I’ve seen of him running a hypertext demo back in the late sixties. If you watch it, wait for the title cards to pass and you will be rewarded.
Anyway, I found the Blackboard news refreshing.

Public Calendars and Classification: First Problem

Some folks already know that I’ve been researching institutional calendar software throughout teh intarnets. I’ve found a few decent tools, probably four candidates with one strong open source tool and one strong vendor solution. I know a lot more about the open source tool now, since all their product information is in the open. The vendor still hasn’t called me, despite my email. (While their site lists all the important features we need, there are no screenshots or demos to play with and a general lack of detail.)
Meanwhile I’ve got the open source Bedework up and running with a “quickstart” release, piece of cake. I had my doubts about it at first, the end-users docs are pretty thin and don’t explain the basic organization of the calendar suite very well. I had my doubts that it could fit our needs, despite it being designed from the ground up for educational institutions. Then I found the developer wiki. Surprise, the wiki covers the basics so much better than the PDFs written for administrators.
So, coming around to the point, I found this nugget of truthiness, which gets at what I see as a large information sharing problem here at UNC. The high-level objective for the calendar is to create more multidisciplinary participation in events, see the Chancellor’s report on intellectual climate. In order to get there you have to classify events on the public calendar by topic, not department. We just tried to address similar issues on the ITS site, focusing on recognizable topics instead of organizational divisions in our service categories. I think the tension between organizational and topical categories can be found in many campus information systems. The problem this creates is especially obvious when you consider new arrivals to campus or prospective students/faculty. They don’t know whether to look at the cashiers office, the registrar or admissions for relevant information.

Document Imagineering

This post from The Content Wrangler looks at the future role of documents in the workplace and who will be called in to analyze and improve the situation. It reviews a book that might be a good add for our KM wishlist.

Theme with an annoying layout issue

I really like the hemingway theme that I’ve put on my work blog. It’s very clean and professional looking, but has one quirk that I don’t have the CSS chops to fix. At least I tried for a bit today and gave up. I’ll probably have to dig in to figure it out and that a project for another day, but let me know if you have suggestions.
The problem is the search div covers up the link in the blog title, making it hard to return to the front page from my story pages. I’m guessing a z-index would fix it, or maybe getting the positioning right somehow w/o going 100% of the “live-search” div.