<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-7419001987588063203</id><updated>2012-02-02T17:52:09.400-08:00</updated><category term='linux'/><category term='images'/><category term='TEI'/><category term='perseus'/><category term='tools'/><category term='diogenes'/><category term='google maps'/><category term='yam'/><category term='lexicon'/><category term='XML'/><category term='lycian'/><category term='xul'/><category term='google books'/><category term='machine actionable'/><category term='chrome'/><category term='living dinosaur'/><category term='apa'/><category term='grumpy old man'/><category term='archimedes'/><category term='Greek'/><category term='dnid'/><category term='Epidoc'/><category term='alpheios'/><category term='dml'/><category term='homer multitext'/><category term='LSJ'/><category term='CTS'/><category term='Morpheus'/><category term='transcoding transformer'/><category term='citation'/><category term='diagrams'/><category term='tlg'/><category term='open access'/><category term='corpus linguistics'/><category term='digital humanities'/><category term='writing'/><category term='sceptical'/><title type='text'>Vitruvian design for scholarship in the humanities</title><subtitle type='html'></subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>32</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-7847799462064604260</id><published>2012-01-31T06:48:00.001-08:00</published><updated>2012-01-31T06:55:26.451-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='yam'/><category scheme='http://www.blogger.com/atom/ns#' term='writing'/><title type='text'>A checklist for writers</title><content type='html'>&lt;div&gt;&lt;p&gt;Technology both shapes and reflects our values. What do we value in scholarly writing, and how well do our technological choices match those values?&lt;/p&gt;&lt;p&gt;I look for software that supports four necessary or possible qualities of good scholarly writing:&lt;/p&gt;&lt;/div&gt;&lt;div&gt;&lt;ol&gt;&lt;li&gt;expository writing should be &lt;strong&gt;explicit and unambiguous&lt;/strong&gt;&lt;/li&gt;&lt;li&gt;the writing process is &lt;strong&gt;iterative&lt;/strong&gt;: good writing only comes from rewriting&lt;/li&gt;&lt;li&gt;academic writing in the natural sciences is often &lt;strong&gt;collaborative&lt;/strong&gt;; this is becoming less rare in the humanities (although not necessarily in the &lt;a href="http://vitruviandesign.blogspot.com/2012/01/humanities-that-must-not-be-named.html"&gt;cargo-cult humanities&lt;/a&gt; )&lt;/li&gt;&lt;li&gt;born-digital writing should be &lt;strong&gt;reusable&lt;/strong&gt;&lt;/li&gt;&lt;/ol&gt;&lt;/div&gt;&lt;div&gt;&lt;p&gt;In a digital enviroment, to write explicitly and unambiguously means more than choosing our words well: it also means expressing the structure and contents of our writing explicitly and unambiguously. Our writing should embody the fundamental principle of separating concerns in our digital work: our first goal is to express our ideas clearly, not to exercise our typesetting skills, so we need a format that that can explicitly and unambiguously express structure. We might choose an XML-based semantic markup system, or some semantically classed “markdown” system such as markdown or textile. What we should not choose is a “word processor.” Even if you can approximate a semantic structure using a carefully chosen set of “styles” (a tell-tale term!), you will be planting your semantic hints in a thick forest of code focused on the particulars of displaying your text visually. Note that it’s perfectly possible to express this irrelevant information using XML formats like OpenDocument. Our question is not “is this an XML format?” but “does this format express the semantics of my document?”&lt;/p&gt;&lt;/div&gt;&lt;div&gt;&lt;p&gt;In considering how to support the remaining items in our list, we should look for examples beyond the humanities, since expository prose is not the only form of writing that shares these qualities. In particular, each is characteristic of good composition in computer programming, and computer programmers routinely use software that directly takes account of each of these qualities.&lt;/p&gt;&lt;/div&gt;&lt;div&gt;&lt;p&gt;Programmers use &lt;em&gt;version control systems&lt;/em&gt; to work with the entire history of a document to update, restore or compare versions. Version control systems also simplify collaboration, and allow mulitiple contributors to work simultaneously on a document. Changes can be silently integrated and shared; if two authors simultaneously make conflicting changes, version control systems can recognize that, and offer authors options to reconcile conflicts manually. There are many good, freely available version control systems. One reason that humanists are less familiar with them than they should be is that version control systems work best with textual data: the binary formats that word processors produce are a major obstacle to integrating our writing in version control, but once we have adopted a text-based semantic format, that obstacle vanishes, and we have a writer’s desktop that lets us write iteratively and collaboratively.&lt;/p&gt;&lt;/div&gt;&lt;div&gt;&lt;p&gt;Programmers also provide a model for reusing our writing. Units of code are often packaged in libraries that other programs use. Programmers working on large projects manage the potentially complex interrelations and dependencies of of different libraries and programs using &lt;em&gt;build systems&lt;/em&gt;. We are not yet accustomed to thinking about automating the reuse of our writing, but there is no technical obstacle to doing so. We could use build systems to assemble chapters into a book, incorporate common navigational headers into all the pages on a web site, or automatically update an index if one section of a text changes, to name just a few obvious examples.&lt;/p&gt;&lt;/div&gt;&lt;div&gt;&lt;p&gt;So our checklist of required tools for writers includes:&lt;/p&gt;&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;an editor that works comfortably with semantically structured text&lt;/li&gt;&lt;li&gt;a version control system&lt;/li&gt;&lt;li&gt;a build system&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;I plan to add a series of posts with the tag &lt;a href="http://vitruviandesign.blogspot.com/search/label/writing"&gt;writing&lt;/a&gt; to look at how we can work with tools like these to write more effectively in a digital setting. Meanwhile, take the checklist to your college or university IT department, and ask what specific software they support for semantic editors, version control systems and build systems. I would love to learn of an academic institution that is not just pressing commercial word processing software on its students and faculty, but I don’t know of one.&lt;/p&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-7847799462064604260?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/7847799462064604260/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=7847799462064604260' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/7847799462064604260'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/7847799462064604260'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2012/01/checklist-for-writers.html' title='A checklist for writers'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-5801178478239313721</id><published>2012-01-30T05:06:00.000-08:00</published><updated>2012-01-30T05:08:10.462-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='yam'/><title type='text'>The recursive arithmetic of tenure</title><content type='html'>&lt;div&gt;&lt;p&gt;The long career path from college student to a tenured academic job is designed to be conservative. A student in the humanities who discovers a passion for an academic subject in his or her first year of college can expect that four years of college will be followed by, say, six years of graduate school that not only provide training in a discipline, but initiate the student in its culture. The (increasingly rare) PhD who then immediately walks into a tenure-track job typically faces seven years of scrutiny before a tenure decision. Newly tenured professors have proven that their work meets the professional standards of their colleagues — seventeen years after entering college.&lt;/p&gt;&lt;p&gt;Like many professors, I hope that a college education is a formative experience in the lives of my students. Imagine that the newly tenured professor was inspired, seventeen years ago, by an exciting teacher and scholar. That person of course would have climbed the rungs of the same professional ladder, so the &lt;strong&gt;youngest&lt;/strong&gt; tenured professor who could have inspired today’s &lt;strong&gt;youngest&lt;/strong&gt; tenured professor might in turn have first been inspired as a new college student … 34 years ago.&lt;/p&gt;&lt;p&gt;In 1977, the late Steve Jobs was just starting a company he had formed the previous year to sell the computers he and Steve Wozniak were building in his father’s garage.&lt;/p&gt;&lt;p&gt;We’re trying to cross an ocean by standing at the shore and waiting for continental drift to carry us to the other side.&lt;/p&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-5801178478239313721?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/5801178478239313721/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=5801178478239313721' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/5801178478239313721'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/5801178478239313721'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2012/01/recursive-arithmetic-of-tenure.html' title='The recursive arithmetic of tenure'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-1029004360996553026</id><published>2012-01-30T03:50:00.000-08:00</published><updated>2012-01-30T13:03:01.540-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='yam'/><category scheme='http://www.blogger.com/atom/ns#' term='digital humanities'/><title type='text'>The humanities-that-must-not-be-named</title><content type='html'>&lt;p&gt;I’m not thrilled with the term “digital humanities.” When people refer to the “humanities,” I think I know what they mean: those disciplines that are concerned with human activity and everything it produces, and take as their task both to preserve and transmit that culture on the one hand, and to understand and interpret it on the other. But what is the sense of qualifying that noun with the adjective “digital”?&lt;/p&gt;&lt;p&gt;In the twenty-first century, the phrase can’t really stand in opposition to an implied “analog humanities”: no such thing exists. (When was the last time anyone submitted a hand-written or manually typed manuscript to be edited with grease pencil before being manually typeset with hot lead?) “Digital humanities” refers instead to scholarship in the humanities that consciously takes account of the fact that we all work digitally now.&lt;br /&gt;&lt;/p&gt;&lt;p&gt;What troubles me is that our use of the marked term “digital humanities” implies that the unmarked term, “humanities,” is being used to refer to scholarship that does &lt;em&gt;not&lt;/em&gt; reflect on the media we all work in (a usage that is sadly accurate in the academy today). I am particularly disturbed because I would like to imagine that an education in the humanities encourages the kind of critical self-awareness that would enable us to think more meaningfully about our relation to the environment we live and work in, including our technological environment and the ways it is interwoven with our institutions and values. &lt;/p&gt;&lt;p&gt;By using “digital humanities,” we’re allowing the term “humanities” to stand for an uncritical scholarly practice that is at odds with the goals of a humanistic education.&lt;/p&gt;&lt;p&gt;&lt;img src="http://blog.bhargreaves.com/wp-content/uploads/2010/04/cargo-cult.jpg" alt="cargo cult plane" style="float:right;" /&gt;I can understand why there is not a spontaneous groundswell of support in academic departments around the world for a term meaning “work that unthinkingly perpetuates obsolete forms of scholarly practice,” or “scholarship that is oblivious to the media we use today,” but rather than accept without reservation the marginalizing label “digital humanities,” I’ll offer my own suggestion. We could extend Richard Feynman’s &lt;a href="http://en.wikipedia.org/wiki/Cargo_cult#Metaphorical_uses_of_the_term"&gt;“cargo-cult science”&lt;/a&gt; to “cargo-cult scholarship” more generally, and refer to the “cargo-cult humanities.”&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-1029004360996553026?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/1029004360996553026/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=1029004360996553026' title='7 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/1029004360996553026'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/1029004360996553026'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2012/01/humanities-that-must-not-be-named.html' title='The humanities-that-must-not-be-named'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>7</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-960846759310611929</id><published>2012-01-29T12:52:00.000-08:00</published><updated>2012-01-29T14:44:44.657-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='yam'/><category scheme='http://www.blogger.com/atom/ns#' term='sceptical'/><title type='text'>“Digital natives”</title><content type='html'>&lt;p&gt;I recently attended a workshop at my home institution where I heard teachers confidently assert that today’s students are so adept at technological tasks that we can rely on them to help their older teachers develop important technological skills.&lt;/p&gt;&lt;p&gt;Really?&lt;/p&gt;&lt;p&gt;For more than 15 years, I’ve introduced Classics students at Holy Cross to XML markup. To build on any prior experience they might have, I routinely begin by asking who has ever peeked behind a web page to view its HTML source. Fifteen years ago, I would usually find anywhere from a quarter to a half of the students would say yes. Today, if I ask a group of 20–25 students, I will get one or two “yes” answers.&lt;/p&gt;&lt;p&gt;I do not know if my students were telling me the truth fifteen years ago (or today), but that doesn’t much matter for my present point. Fifteen years ago, far more students either had seen HTML or felt some kind of pressure to pretend that they had.&lt;/p&gt;&lt;p&gt;What does it mean? I suspect that the “digital natives” I teach have indeed grown up so familiar with information technology that they are more oblivious to it than their elders. I worry that they are also incurious, or at least need to learn to be curious about it.&lt;/p&gt;&lt;p&gt;My personal experience makes up only a limited sample, of students in Classics at a small liberal-arts college, but the trend among those students is very clear. Unless someone can show me better evidence, I’ll remain very sceptical about &lt;i&gt;a priori&lt;/i&gt; assertions concerning the skills that “digital natives” will confer on their teachers.&lt;/p&gt;&lt;h2 id="note-new-tags"&gt;Note: new tags&lt;/h2&gt;&lt;p&gt;I’m using a couple of new tags on this post:  “&lt;a href="http://vitruviandesign.blogspot.com/search/label/sceptical"&gt;sceptical&lt;/a&gt;” (for obvious reasons), and “&lt;a href="http://vitruviandesign.blogspot.com/search/label/yam"&gt;yam&lt;/a&gt;” [yet another meeting] to help me find posts responding to ideas I’ve gathered from yammering at meetings. I hope to post soon on a couple of additional “sceptical” topics, and several  “yam” topics (since January is a big month for meetings in the academic world).&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-960846759310611929?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/960846759310611929/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=960846759310611929' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/960846759310611929'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/960846759310611929'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2012/01/digital-natives.html' title='“Digital natives”'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-3468104657447351998</id><published>2011-10-20T09:01:00.001-07:00</published><updated>2011-10-20T09:28:30.295-07:00</updated><title type='text'>Tending our garden</title><content type='html'>To an astonishing extent, Google really has changed the way the world's information is organized.&lt;br /&gt;&lt;br /&gt;A pair of trivial examples:&lt;br /&gt;&lt;br /&gt;[1] Liddell-Scott-Jones is the best lexicon of ancient Greek in any language, but I don't always find the search interface to the digital version from the Perseus project to be the easiest navigation system.  I recently discovered a simple way to look up an entry in Perseus:  type the lemma of the Greek word in Google.  Duh.  &lt;br /&gt;&lt;br /&gt;The point is that Perseus does not — and should not — have to worry about how people search for articles in LSJ as long as the Perseus edition is cleanly organized by article with a recognizably tagged lemma.  Since I'm frequently browsing the web from OS X, typing UTF-8 Greek is trivial, thanks to &lt;a href="http://www.sourcecod.com/sophokeys/"&gt;SophoKeys&lt;/a&gt;.  Perseus does not — and should not — have to worry about keyboard input systems.  The digital LSJ was available from Perseus long before Google or SophoKeys came along, but it's more readily usable and that much more valuable to me now because the Perseus project &lt;em&gt;did &lt;/em&gt; take responsibility for its area of domain-specific knowledge, and  properly organized the contents of the lexicon by article.&lt;br /&gt;&lt;br /&gt;[2] &lt;code&gt;urn:cts:greekLit:tlg0012.tlg001&lt;/code&gt; is a URN referring to the &lt;em&gt;Iliad&lt;/em&gt;.   URNs are not direct addresses like URLs:  they are technologically independent (although they follow a machine-parseable syntax).  Enter that URN in Google, and you'll probably get some amusingly off-target advertisements for cremation urns, but I scrolled through &lt;a href="http://www.google.com/search?q=urn%3Acts%3AgreekLit%3Atlg0012.tlg001"&gt;several pages of Google's hit list&lt;/a&gt; without encountering a single irrelevant match.&lt;br /&gt;&lt;br /&gt;One implication of these examples is not trivial.  If we can publish meaningful units of content in association with canonical identifiers like URNs, we can count on the ever-increasing momentum of the internet to create new ways of discovering and working with that content.   Like Pangloss, we need to tend our own garden;   Google or others &lt;a href="http://abu.cnam.fr/cgi-bin/donner_html?candide3"&gt;will find it rapidly&lt;/a&gt; (whether or not we think that makes for the best of all possible worlds).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-3468104657447351998?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/3468104657447351998/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=3468104657447351998' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/3468104657447351998'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/3468104657447351998'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2011/10/tending-our-garden.html' title='Tending our garden'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-737402527714506392</id><published>2011-10-06T07:17:00.001-07:00</published><updated>2011-10-06T07:22:06.781-07:00</updated><title type='text'>Steve Jobs on building things</title><content type='html'>Amid today's many recollections of Steve Jobs citing his quotable aphorisms, two particularly caught my attention:&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;People don't know what they want until you build it.&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;and related to that theme:&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;The builders are the real thinkers.&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;Jobs was immediately referring to engineers and consumers, but the statements are as true in scholarship as they are in any other endeavor.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-737402527714506392?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/737402527714506392/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=737402527714506392' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/737402527714506392'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/737402527714506392'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2011/10/steve-jobs-on-building-things.html' title='Steve Jobs on building things'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-7212021660723705135</id><published>2011-09-28T07:43:00.000-07:00</published><updated>2011-09-28T07:54:31.084-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='open access'/><category scheme='http://www.blogger.com/atom/ns#' term='apa'/><title type='text'>Access to scholarly work</title><content type='html'>Members of Princeton University's faculty have unanimously voted for a policy guaranteeing open access to their scholarship (blogged with quotation of the key passage &lt;a href="https://freedom-to-tinker.com/blog/appel/open-access-scholarly-publications-princeton"&gt;here&lt;/a&gt;).&lt;br /&gt;&lt;br /&gt;Contrast with that the capital campaign of the American Philological Association (a professional organization purporting to represent the discipline of Classics).  The APA's "Campaign for Classics" plans to offer access to digital resources, but in many cases that access will be restricted to APA members.&lt;br /&gt;&lt;br /&gt;If the contrast is not pointed enough, think of it this way:  as of September, 2011, Princeton faculty members risk violating their university's policy if they contribute scholarly work to the APA project.&lt;br /&gt;&lt;br /&gt;If you believe that consistent principles should guide our behavior, then Princeton faculty members who are dues-paying members of the APA face a real ethical dilemma:  how can they support the work of an organization that directly conflicts with the policies unanimously adopted by their university?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-7212021660723705135?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/7212021660723705135/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=7212021660723705135' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/7212021660723705135'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/7212021660723705135'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2011/09/access-to-scholarly-work.html' title='Access to scholarly work'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-3412890951690764244</id><published>2011-09-20T07:00:00.000-07:00</published><updated>2011-09-20T07:18:42.526-07:00</updated><title type='text'>Druids in Oxford?</title><content type='html'>A colleague recently pointed me to the "Ancient Lives" project allowing members of the general public to view and transcribe papyri from the vast Oxyrhynchus collection that remains largely unpublished more than a century after its discovery.  &lt;br /&gt;&lt;br /&gt;I imagined something like the ground-breaking &lt;a href="http://www.stoa.org/sol/"&gt;Suda Online&lt;/a&gt; (or, SOL) -- an astonishingly successful project that has now translated for the first time ever more than 29,000 of the 30,000-odd articles in a Byzantine encyclopedia of the ancient world.&lt;br /&gt;&lt;br /&gt;Instead, when I followed the link on copyright on the "Ancient Lives" website, I read:&lt;br /&gt;&lt;blockquote&gt; Images may not be copied or offloaded, and the images and their texts may not be published. &lt;/blockquote&gt;&lt;br /&gt;This reminded me not of the forward-looking SOL, but of Julius Caesar's description of Gallic Druids.  From Caesar, &lt;em&gt;Gallic Wars&lt;/em&gt;, 6.14:&lt;br /&gt;&lt;blockquote&gt;Neque fas esse existimant ea litteris mandare ... Id mihi ... instituisse videntur, quod neque in vulgum disciplinam efferri velint ..&lt;/blockquote&gt;&lt;br /&gt;&lt;blockquote&gt;"They consider it wrong to commit these [sacred texts] to writing.  I believe that they have established this practice because they do not want their professional knowledge to be published to the common people." &lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-3412890951690764244?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/3412890951690764244/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=3412890951690764244' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/3412890951690764244'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/3412890951690764244'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2011/09/druids-in-oxford.html' title='Druids in Oxford?'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-489347366549946241</id><published>2011-01-18T05:34:00.000-08:00</published><updated>2011-01-18T05:59:11.519-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='chrome'/><title type='text'>Chrome apps and extensions</title><content type='html'>&lt;div&gt;You follow good practices, archive your priceless material in standard formats, and make it accessible through network services.   Now where do you build applications to use those services?&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;In digital scholarship, your time is your single most valuable resource, so the cost of a full-blown cross-platform desktop application is usually prohibitively high.  As CSS 3 and javascript give Web browsers better and better support for visually rich interactive applications, the browser becomes more attractive as a platform for applications, rather than just a viewer for documents.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;HTML5 clearly understands the browser this way.  (If you're not familiar with HTML5 yet, see &lt;a href="http://www.html5rocks.com/"&gt;http://www.html5rocks.com&lt;/a&gt;/) The blogosphere is full of comments about HTML5's audio and video capabilities, but its integration of local and networked resources raises more interesting architectural questions.  HTML5 can give you access to local file storage or locally persistent databases as well as providing networked communication with remote processes.  With an HTML 5 platform, the browser is more like a local application with good access to remote data.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Google has taken this idea a step further with the definition of extensions and apps for its Chrome browser.  They are written in HTML, javascript and CSS:  it is perfectly possible to write build a "Chrome app" that is nothing more than a web page displaying equally well in any browser.  (For a web developer, the road to "Hello, world" has never been shorter.)  The Chrome app is defined in a simple JSON manifest file.  The manifest can define permissions for access to remote resources (no more work-arounds to deal with restrictions on unsafe cross-site scripting).  It also defines how your app or extension is integrated into Chrome.  A single line in the manifest can tie your app to a button permanently available on your toolbar, accessible from your choice of user action (button click, key-press combination, etc).  Full Chrome apps can be distributed through Google's App Store where any Chrome user could install the app locally.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;If you've ever looked at developing browser-based web applications, or have ever thought about extending Firefox with its extension mechanism, you owe it to yourself to take a quick look at&lt;/div&gt;&lt;div&gt;&lt;a href="http://code.google.com/chrome/extensions/index.html"&gt;http://code.google.com/chrome/extensions/index.html&lt;/a&gt;.  That's all it takes to get started.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-489347366549946241?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/489347366549946241/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=489347366549946241' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/489347366549946241'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/489347366549946241'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2011/01/you-follow-good-practices-archive-your.html' title='Chrome apps and extensions'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-771856504174744959</id><published>2010-10-11T09:39:00.000-07:00</published><updated>2010-10-11T09:47:59.717-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='citation'/><category scheme='http://www.blogger.com/atom/ns#' term='CTS'/><title type='text'>Thinking about citation</title><content type='html'>In digital scholarship, citation is the most fundamental form of ontology.   Identifying an entity in a recognized form of citation allows scholars to agree that the object exists, while leaving room to disagree over how to represent it.  Once we agree that there is an object I call "the Parthenon in Athens," and can unambiguously cite that object, we allow software to recognize that the same object might be represented in a GIS with spatial data, in a photo gallery with a collection of photographs, or in an architectural database with structured fields of textual data.&lt;br /&gt;&lt;br /&gt;For digital scholars, it's hard to avoid talking or even thinking about scholarly reference systems without getting bogged down in technology-specific tar pits.  Take the W3C's &lt;a href="http://www.w3.org/TR/rdf-concepts/"&gt;Resource Description Framework&lt;/a&gt; as an example.  Its triplet model is brilliantly simple and powerful:  whether you think of it as "subject-verb-object" or "object-directed link-object", it is general, extensible, and lends itself readily to both abstract graph models and real machine implementations.  Its syntax is expressed in terms of URIs, which could equally well be URLs (real addresses in the "http" schema), or URNs (abstract names in the "urn" schema).   (See the W3C's document "&lt;a href="http://www.w3.org/TR/uri-clarification/"&gt;URI Clarification&lt;/a&gt;" or this discussion "&lt;a href="http://www.ibm.com/developerworks/xml/library/x-urlni.html#N10150"&gt;Untangle URIs, URLs, and URNs&lt;/a&gt;" to sort out the acronyms.)  But when was the last time you saw a scholarly project using RDF to describe relations among objects identified by abstract name, rather than by address?  In an RDF graph, "http" is a good URI scheme for an application retrieving material on the internet, but a poor choice for a description of relations among persistent and immutable objects, a case where the "urn" scheme would be more appropriate.&lt;br /&gt;&lt;br /&gt;For canonically citable texts, work at the Center for Hellenic Studies on has led to:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;identifying abstract properties of canonical texts&lt;/li&gt;&lt;li&gt;developing a human readable and machine actionable notation for citation that expresses these properties (the CTS URN notation)&lt;/li&gt;&lt;li&gt;defining a service for identifying and retrieving texts identified by this notation (the Canonical Text Services protocol)&lt;/li&gt;&lt;/ol&gt;I've written a bit about this in a dry article on "&lt;a href="http://lccn.loc.gov/2010001406"&gt;Digital infrastructure and the Homer Multitext Project&lt;/a&gt;" but here I would just note that this three-tiered hierarchy has been very useful both in trying to think about citation outside of any particular technological context, and for defining machine-actionable tests to evaluate implementations of text services.  Only level [3] deals with specific technologies on the internet.   If you don't want to interact with a Canonical Text Service, the CTS URN notation [2] can still be used by any application referring to canonically cited texts.  If you don't like the CTS URN notation, you can still evaluate any alternative notation by seeing whether it implements the abstract properties of [1].  And if you are dissatisfied with the identification of those properties, you can start from scratch and redefine what you think a canonically citable text really is.&lt;br /&gt;&lt;br /&gt;I think this graded distinction of abstract property, reference notation, and application could be equally valuable in citing other kinds of material.  In a following series of posts, I'll look successively at each level to analyze how we might cite uniquely identified objects.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-771856504174744959?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/771856504174744959/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=771856504174744959' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/771856504174744959'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/771856504174744959'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2010/10/thinking-about-citation.html' title='Thinking about citation'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-5357346421075476111</id><published>2010-08-28T04:01:00.000-07:00</published><updated>2010-08-28T04:13:11.740-07:00</updated><title type='text'>Is 2010 "the year of open data" in Classics?</title><content type='html'>Tim Berners-Lee, the creator of the World Wide Web, has called for "raw data now"; in a TED talk this spring, he showed examples of what can happen when people have access to openly licensed and freely reusable data sets.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The American Philological Association thinks the internet is a gated community.  The lead story on the APA's website is the continuing effort to raise funds for a "portal" that will help members find resources available only to subscribers.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Compare Berners-Lee's talk (freely licensed so I can legally embed it in this blog post), with the APA's video presentation of its campaign (from the APA website either in &lt;a href="http://apaclassics.org/index.php/support_the_APA/campaign_videos/quicktime_video"&gt;Quicktime&lt;/a&gt; or &lt;a href="http://apaclassics.org/index.php/support_the_APA/campaign_videos/windows_media_video"&gt;Windows Media&lt;/a&gt; format).  Which vision of sharing scientific and scholarly data do you see as the future of Classics?&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt; &lt;/div&gt;&lt;div&gt;&lt;object width="446" height="326"&gt;&lt;param name="movie" value="http://video.ted.com/assets/player/swf/EmbedPlayer.swf"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;param name="wmode" value="transparent"&gt;&lt;param name="bgColor" value="#ffffff"&gt; &lt;param name="flashvars" value="vu=http://video.ted.com/talks/dynamic/TimBerners-Lee_2010U-medium.flv&amp;amp;su=http://images.ted.com/images/ted/tedindex/embed-posters/TimBerners-Lee-2010U.embed_thumbnail.jpg&amp;amp;vw=432&amp;amp;vh=240&amp;amp;ap=0&amp;amp;ti=788&amp;amp;introDuration=15330&amp;amp;adDuration=4000&amp;amp;postAdDuration=830&amp;amp;adKeys=talk=tim_berners_lee_the_year_open_data_went_worldwide;year=2010;theme=what_s_next_in_tech;theme=a_taste_of_ted2010;theme=the_rise_of_collaboration;event=TED2010;&amp;amp;preAdTag=tconf.ted/embed;tile=1;sz=512x288;"&gt;&lt;embed src="http://video.ted.com/assets/player/swf/EmbedPlayer.swf" pluginspace="http://www.macromedia.com/go/getflashplayer" type="application/x-shockwave-flash" wmode="transparent" bgcolor="#ffffff" width="446" height="326" allowfullscreen="true" allowscriptaccess="always" flashvars="vu=http://video.ted.com/talks/dynamic/TimBerners-Lee_2010U-medium.flv&amp;amp;su=http://images.ted.com/images/ted/tedindex/embed-posters/TimBerners-Lee-2010U.embed_thumbnail.jpg&amp;amp;vw=432&amp;amp;vh=240&amp;amp;ap=0&amp;amp;ti=788&amp;amp;introDuration=15330&amp;amp;adDuration=4000&amp;amp;postAdDuration=830&amp;amp;adKeys=talk=tim_berners_lee_the_year_open_data_went_worldwide;year=2010;theme=what_s_next_in_tech;theme=a_taste_of_ted2010;theme=the_rise_of_collaboration;event=TED2010;"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-5357346421075476111?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/5357346421075476111/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=5357346421075476111' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/5357346421075476111'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/5357346421075476111'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2010/08/is-2010-year-of-open-data-in-classics.html' title='Is 2010 &quot;the year of open data&quot; in Classics?'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-1028020657663725031</id><published>2010-08-27T02:24:00.000-07:00</published><updated>2010-08-27T02:32:05.689-07:00</updated><title type='text'>What's the difference?</title><content type='html'>&lt;div&gt;How do we compare two texts?  For line-by-line comparison of electronic files, Unix systems have had a &lt;code&gt;diff&lt;/code&gt; command since time immemorial.  (A version had already been around long enough for &lt;a href="http://www.cs.dartmouth.edu/~doug/diff.ps"&gt;Hunt and McIlroy to write an article about it in 1976&lt;/a&gt;, before I had ever laid eyes on a computer, and before any of my students were born.)   For XML documents, &lt;a href="http://xmlunit.sourceforge.net"&gt;XMLUnit&lt;/a&gt; is a java library that can describe differences not only in the text nodes of two documents, but in their XML structure as well.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Both &lt;code&gt;diff&lt;/code&gt; and XMLUnit describe differences between electronic documents in specific formats, and report on differences in terms of that format (line-by-line structure from &lt;code&gt;diff&lt;/code&gt;; XML structure from XMLUnit).  Neither solution is adequate for the Homer Multitext project.  Its collection of texts is not defined in terms of specific document formats:  instead, texts are managed and manipulated by the more abstract references of canonical citation&lt;/div&gt;&lt;div&gt;(e.g., &lt;a href="http://homericpapyri.appspot.com"&gt;Homeric papyri&lt;/a&gt;; and &lt;a href="http://hmt-cts.appspot.com"&gt;texts from Byzantine manuscripts&lt;/a&gt;).  How would we compare two passages of texts identified by canonical citations?&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Traditional Homeric scholarship suggests an approach.  Homerists refer to "vertical" vs. "horizontal" variation across versions of the &lt;i&gt;Iliad&lt;/i&gt;.  "Vertical variation" refers to entire lines that are present in one version but not another;  "horizontal variation" refers to differences across versions in "the same" line (that is, a line that is canonically cited by the same reference).  This can be generalized to any canonically citable text, if we think of vertical variation as the differences not just of Iliadic lines, but of the citation units of any text.  (By this way of thinking, manuscripts of the &lt;i&gt;Gospel of John&lt;/i&gt; that include the longer ending would show "vertical" variation compared to manuscripts that stop after the shorter ending.)  Horizontal variation then would be a description of the variation within a single citation unit (whether that is a line of the &lt;i&gt;Iliad&lt;/i&gt; or verse of &lt;i&gt;John&lt;/i&gt;).  &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;We could then reduce vertical difference to an operation on two ordered lists of citation values.  Analogously, if we can tokenize the textual content of a citable passage, we could treat horizontal difference as an operation on two ordered lists of tokens.  (Tokenizing texts has its own challenges, but I'll blog on that topic some other time.)&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;This is a happy result.  First, we can acquire the data for a comparison with existing requests in the CTS protocol, to determine citation values, or to find textual content of a passage.  Second, we can apply a single algorithm to both vertical/structural variation, and horizontal/intra-node variation.  Third, comparing two ordered sequences of tokens is one of the best studied problems in computer science (since applications like comparing DNA sequences have attracted a lot more attention than differences in Homeric manuscripts).  I googled up &lt;a href="http://www.cs.princeton.edu/introcs/96optimization/LCS.java.html"&gt;this site&lt;/a&gt; and had a basic collation class running in a few minutes (but a good programmer would be able to do this far more easily than my stumbling efforts).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;This raises the far more interesting question:  what exactly do we want to know about the vertical and horizontal differences between two passages?  Some of the things we can determine include:&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;what tokens are unique to passage A?&lt;/li&gt;&lt;li&gt;what tokens are unique to passage B?&lt;/li&gt;&lt;li&gt;what is the longest common sequence of the two lists?&lt;/li&gt;&lt;li&gt;what is the complete ordered union of the two lists, and the status of each token in the list (A only, B only, in both, in both but in a different position) ?&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;I've blogged a very minimal example &lt;a href="http://homermultitext.blogspot.com/2010/08/comparing-two-manuscripts-with-cts.html"&gt;comparing two manuscripts of the &lt;i&gt;Iliad&lt;/i&gt;&lt;/a&gt;.  There's lots more to do.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-1028020657663725031?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/1028020657663725031/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=1028020657663725031' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/1028020657663725031'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/1028020657663725031'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2010/08/whats-difference.html' title='What&apos;s the difference?'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-8988102885676770087</id><published>2010-08-24T03:07:00.000-07:00</published><updated>2010-08-24T04:02:53.072-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='google books'/><category scheme='http://www.blogger.com/atom/ns#' term='perseus'/><category scheme='http://www.blogger.com/atom/ns#' term='alpheios'/><title type='text'>Who does Classics?  Where?</title><content type='html'>&lt;div&gt;&lt;a href="http://vitruviandesign.blogspot.com/2010/08/greek-latin-arabic.html"&gt;David Bamman's presentation&lt;/a&gt; was only one of several high points at last week's meeting at Tufts University on "Greek, Latin, Arabic."  We heard from &lt;a href="http://alpheios.net/"&gt;the Alpheios project&lt;/a&gt; about recent development of their language learning tools.  I'm thrilled to be using alpheios this fall both as a teacher of intermediate Latin and a student of first-semester Arabic, but what continues to impress me most about the project is the thoughtfulness of its architecture.  The lexica (such as Liddell-Scott-Jones for Greek, and Lewis-Short for Latin) and linguistic information (very comprehensive morphological analyses, and for some sets of texts, syntactic tree banks of the kind David Bamman's research uses) are cleanly organized as services that are accessible over the internet.  If there is another project in the digital humanities that has grasped this fundamental architectural principle as clearly as the Alpheios project has, I'm not familiar with it.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Also in attendance was Google's Will Brockman, who was able to comment on the recent &lt;a href="http://www.google.com/googlebooks/ancient-greek-and-latin.html"&gt;public release of scans of over 500 Greek and Latin texts&lt;/a&gt;.   (Six copies from three different editions of Pomponius Mela!  Can you do that in your home library?)&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;A &lt;a href="http://nlp.perseus.tufts.edu/lexicon/"&gt;dynamically constructed lexicon&lt;/a&gt;;  network services exposing Greek and Latin lexical and linguistic information to the internet ;  a corpus of freely available texts — individually, these are major contributions to the study of Classics.  Collectively, they really do lay the foundations for a radically altered discipline — and they exist today.  If I wasn't constantly hearing from fellow classicists that our discipline is in crisis, I would think that there has never been a better time to study Greek and Latin.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Oh, and who does this work?  David Bamman is a senior researcher at the Perseus project, not a member of an academic department.  The Alpheios project is independent of any academic institutional affiliation.  Google — well, you've heard of Google.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I was reminded of &lt;a href="http://booksearch.blogspot.com/2010/06/google-releases-500-scans-of-ancient.html"&gt;Brockman's blog post&lt;/a&gt; when he first announced Google's free release of Greek and Latin texts in June.  He cites three examples of studying ancient texts that excite him: reading a Latin text in the Perseus project's interactive edition;  reading an article about Sophocles in English from the Suda On Line;  and consulting the high-resolution photography of the Venetus A manuscript from the Homer Multitext project.  His selection caught my eye, because I've been involved in all three projects, and know some of the back stories.  None of the junior members of the original Perseus project were tenured at their original home institutions:  all moved to other jobs, or left the field altogether.  When an external review committee visited the University of Kentucky in the 1990s, after an extensive presentation about the Stoa prominently including the Suda On Line, a classicist asked the late Ross Scaife, "In what way does any of this constitute scholarship?"  (A curious question about the first effort ever to translate into &lt;i&gt;any&lt;/i&gt; language the rich and complex text of the Suda.)  The Homer Multitext project has not faced such overt hostility, but it is interesting to note that it originates from the Center for Hellenic Studies (a branch of Harvard University in Washington, D.C., independent of the Department of the Classics), and that the two editors and two project architects all hold academic positions at institutions that do not grant PhDs in Classics. (Speaking for myself, I couldn't be happier about that.)&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Connect the dots however you like.  I draw two conclusions:  first, that the study of classics is far too important to leave to classicists;  and second that the study of Greek and Latin is still exciting enough to attract brilliant contributions from committed scholars who are not shackled with a title like "Professor of Classics."  In 2010, I'm starting to envy my students, and wish I had a few more decades to continue this work.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-8988102885676770087?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/8988102885676770087/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=8988102885676770087' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/8988102885676770087'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/8988102885676770087'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2010/08/who-does-classics-where.html' title='Who does Classics?  Where?'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-3589367294035466631</id><published>2010-08-23T17:52:00.000-07:00</published><updated>2010-08-23T17:59:47.640-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='machine actionable'/><category scheme='http://www.blogger.com/atom/ns#' term='lexicon'/><category scheme='http://www.blogger.com/atom/ns#' term='Greek'/><category scheme='http://www.blogger.com/atom/ns#' term='corpus linguistics'/><title type='text'>Greek, Latin, Arabic</title><content type='html'>&lt;div&gt;Last week, I attended a meeting on "Greek, Latin, Arabic" at Tufts University.  Some of the most stimulating discussion focused on how it's possible to exploit more or less limited corpora of structured digital texts to find valuable information in a larger, less structured morass (think:  the internet).  Lots of interesting research worth blogging about (although you won't be likely to hear about any of it if you go to an APA convention), but I want to comment briefly on David Bamman's presentation, because I think his work is as significant as any research I've seen in classics in the past 30 years.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;For some time, Bamman has been pursuing interesting work in two distinct areas:  (1) automatic alignment of texts in different languages, and (2) using dependency treebanks to represent the syntax of Greek and Latin.  I've followed his progress for a couple of years, but last week was the first occasion when I've begun to realize how he can weave these two strands of work together.  (That's only a comment on my own obtuseness.) If you want to jump over the methods, and go straight to an astonishing result, follow &lt;a href="http://nlp.perseus.tufts.edu/lexicon/"&gt;this link&lt;/a&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;It's a dynamically induced lexicon.  Stop, and reread that sentence.  It's &lt;b&gt;a dynamically induced lexicon&lt;/b&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I've read this (preprint of an) article about &lt;a href="http://www.perseus.tufts.edu/publications/jcdl27-bamman.pdf"&gt;projecting markup across translations&lt;/a&gt; a couple of times.  That's not really enough, and I'll probably reread it tomorrow, but let me reduce one result to this summary:  at the level of individual words, Bamman achieves about a 70% success rate aligning roughly five million words of Greek with seven million words of English.  That in itself is fairly astonishing, but Bamman also leverages his work building treebanks to model the syntax of Greek and Latin texts.  Crossing the syntactic models from his treebanking with information gleaned by aligning versions texts in different languages, Bamman builds a dynamic lexicon that can take a Greek term and trace how translations in a language like English render that term in different sets of texts, including recognizing the syntactic constructions in which the term appears;  or conversely, he can take an English term, give the most closely corresponding Greek terms, and from there, again lead you through the history of the Greek term, as glossed or explicated (automatically) by its English translations.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;In his presentation, as in the written publications I have seen to date, Bamman's work simultaneously shows the general implications of his research for computational linguistics, and how the Latin and Greek case studies he has chosen are distinctive.  That tension between generality and specificity is, I think, often the hallmark of really great scholarship — a category David Bamman's work clearly falls into, in my view.  If there are any lovers of classical languages or literature who still doubt whether they are computational linguists, Bamman should persuade you that in 2010 we all are all computational linguists.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-3589367294035466631?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/3589367294035466631/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=3589367294035466631' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/3589367294035466631'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/3589367294035466631'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2010/08/greek-latin-arabic.html' title='Greek, Latin, Arabic'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-1563006240524143079</id><published>2009-03-26T11:26:00.000-07:00</published><updated>2009-03-26T11:39:14.470-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='open access'/><title type='text'>MIT faculty mandate open access to publication</title><content type='html'>Blogged &lt;a href="http://www.bitsbook.com/2009/03/mit-adopts-an-open-access-policy/ "&gt;here&lt;/a&gt; by Hal Abelson, chair of the committee that composed the resolution:  the MIT faculty have voted &lt;strong&gt;unanimously&lt;/strong&gt; to adopt a resolution that includes these phrases:&lt;br /&gt;&lt;blockquote&gt; [E]ach Faculty member grants to MIT a nonexclusive, irrevocable, paid-up, worldwide license to exercise any and all rights under copyright relating to each of his or her scholarly articles, in any medium &lt;/blockquote&gt;&lt;br /&gt;and&lt;br /&gt;&lt;blockquote&gt;The policy is to take effect immediately&lt;/blockquote&gt;&lt;br /&gt;Think that's clear enough for any of the hair-splitting legalists out there?&lt;br /&gt;&lt;br /&gt;I believed there was time when humanists set the moral direction for the academy.  We're lucky that our scientists' pursuit of truth seems to be generating enough of a draft to pull us along.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-1563006240524143079?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/1563006240524143079/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=1563006240524143079' title='40 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/1563006240524143079'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/1563006240524143079'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2009/03/mit-faculty-mandate-open-access-to.html' title='MIT faculty mandate open access to publication'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>40</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-3237567595319018011</id><published>2008-11-26T05:17:00.000-08:00</published><updated>2008-11-26T05:24:10.553-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='LSJ'/><category scheme='http://www.blogger.com/atom/ns#' term='Greek'/><category scheme='http://www.blogger.com/atom/ns#' term='Morpheus'/><title type='text'>The vocabulary of ancient Greek</title><content type='html'>What is the vocabulary of ancient Greek?  That is, what set of words, or lexical entities, actually occur in our extant texts?&lt;br /&gt;&lt;br /&gt;The First Thousand Years of Greek project (announced &lt;a href="http://chs.harvard.edu"&gt;here&lt;/a&gt;) aims to simplify posing such straightforward questions, but we need more than online texts to talk unambiguously about words.  One essential piece of infrastructure is an inventory of &lt;em&gt;uniquely identified&lt;/em&gt; lexical entities in Greek.  In print publications, lexical entities have traditionally been identified by a word's lemma form.  While lemmata are valuable labels, they are potentially ambiguous.  Instead, basic principles of information design dictate that arbitrary identifiers guaranteed to be unique should be associated with lemma strings, so that references to a lexical entity can be unambiguously machine processed (using the identifier), and remain intelligible to human readers (using the labelling lemma string).&lt;br /&gt;&lt;br /&gt;The Perseus project has given classicists two monumental resources that must be coordinated with an inventory of lexical entities:  the digital LSJ lexicon of Greek, and the Morpheus morphological parsing system that can associate surface forms of words with a lemma.  Taken together with the invaluable list Peter Heslin has created by running Perseus' morphological parser over the word list of the TLG project's E disk, they provide an obvious starting point for an inventory of Greek lexical entities would be to compare these two resources.  &lt;br /&gt;&lt;br /&gt;The digital LSJ has already been provided with unique identifiers for each entry, and each entry includes a lemma string.  Perseus' morphological analyses identify entities by lemma.  Where there is a one-to-one mapping between the parser's lemma and the LSJ lemma (normalized so that LSJ's markings of long and short vowels are removed), we can fairly assume that they represent the same entity, and could simply adopt the LSJ identifier to refer to the more general notion of the lexical entity — an unambiguous reference that could be associated with an entry in the lexicon, with morphological analyses, or with any other information.&lt;br /&gt;&lt;br /&gt;While this simple (and easily automated) task takes care of the vast majority of the vocabulary in both the LSJ and in the parser's output, there are several categories of problematic cases.  They include:&lt;br /&gt;&lt;ul&gt;&lt;br /&gt;&lt;li&gt;entities where LSJ's orthography differs from the parser's orthography.  This is actually a large group with several subcategories, some of which can probably be reliably resolved automatically. For example, LSJ and Morpheus sometimes disagree on whether the lemma form of a verb should be active or middle/passive voice:  a a careful script could accommodate that kind of variation, but human intervention would be necessary when LSJ and Morpheus use alternate forms of the lemma.&lt;/li&gt;&lt;br /&gt;&lt;li&gt;entities that appear in the parser's list of lemmata, but not in LSJ.  This occurs frequently with compound verbs that are not given separate articles in LSJ.  In these cases, since there is no LSJ identifier to reuse, we would, obviously, need to create new identifiers for those entities not in LSJ.&lt;/li&gt;&lt;br /&gt;&lt;li&gt;"ghost entities."  For reasons that are not clear to me, LSJ routinely lists verbal adjectives in -τέον as distinct entities, unconnected to the verb from which they are formed.  (E.g., the adjective λυτέον is a distinct entry, unrelated to the verb λύω.)  Whatever the reasoning, in a digital environment, this is the wrong taxonomy:  the morphological analysis should allow applications to distinguish verbal adjectives from other forms deriving from the same verbal root, while the identifier for the lexical entity should recognize verbal adjectives and conjugated forms of a verb alike as forms of the same entity.  Mapping these LSJ and Morpheus lemmata to the correct verbal lemmata will be a relatively straightforward task, but again will need human supervision for some common cases (e.g., δοτέον &lt; δίδωμι).&lt;/li&gt;&lt;br /&gt;&lt;li&gt;entities in LSJ but not in the list of lemmata generated by running the parser over the TLG E word list.  Presumably, these result from the contributors to LSJ covering texts that are beyond the scope of the TLG E disk's corpus.  As a basic principle, we should make absolutely explicit what digital corpus of texts an inventory of lexical entities is based on.  Since our first pass is working from Heslin's analysis of the TLG E corpus, we should not enter these LSJ IDs into our inventory — at least, not yet.  As the inventory is checked against further texts, new vocabulary may appear, and at that time new candidates for addition to the inventory will need to be checked in both LSJ and Morpheus.&lt;/li&gt;&lt;br /&gt;&lt;/ul&gt;&lt;br /&gt;That is a substantial, but I think manageable, list of tasks.  One easy way to begin would be to limit the scope of coverage further, and rather than beginning from the entire TLG E word list, start with a word list created from a specified corpus of texts.  As lemmatized word indices for the First Thousand Years of Greek are released, we will guarantee that all surface forms of a word are resolved to a uniquely identified lexical entity.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-3237567595319018011?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/3237567595319018011/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=3237567595319018011' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/3237567595319018011'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/3237567595319018011'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2008/11/vocabulary-of-ancient-greek.html' title='The vocabulary of ancient Greek'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-8581471061383850703</id><published>2008-10-30T05:09:00.000-07:00</published><updated>2008-10-31T06:59:57.908-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='archimedes'/><category scheme='http://www.blogger.com/atom/ns#' term='dml'/><category scheme='http://www.blogger.com/atom/ns#' term='diagrams'/><title type='text'>Beyond text</title><content type='html'>If you are interested in the architecture of scholarly resources, run, don't walk, to Gabe Weaver's new sourceforge site, &lt;a href="http://episteme.sourceforge.net/index.html"&gt;episteme&lt;/a&gt;.  &lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_72-VxQGtGLY/SQmni8fEfUI/AAAAAAAAABI/9SFUJc6APSw/s1600-h/diagram_tour-small.png"&gt;&lt;img style="margin: 0pt 0pt 10px 10px; float: right; cursor: pointer; width: 200px; height: 106px;" src="http://3.bp.blogspot.com/_72-VxQGtGLY/SQmni8fEfUI/AAAAAAAAABI/9SFUJc6APSw/s200/diagram_tour-small.png" alt="" id="BLOGGER_PHOTO_ID_5262921858352971074" border="0" /&gt;&lt;/a&gt;  The nascent site (opened to coincide with the public release of "digital product" from the &lt;a href="http://archimedespalimpsest.net/"&gt;Archimedes Palimpsest project&lt;/a&gt;) documents his work representing and manipulating information encoded as mathematical diagrams.&lt;br /&gt;&lt;br /&gt;There's already a lot to think about here, but one intriguing aspect is that entities in figures are referred to with identifiers that can be coordinated with canonical references to passages of textual content from the same document.  (Short-term consequence for me personally — urgent need to re-think my presentation for the "Text and Graphics" panel at next week's &lt;a href="http://www.cch.kcl.ac.uk/cocoon/tei2008/programme/index.html"&gt;TEI meeting in London&lt;/a&gt;.  Ouch.)&lt;br /&gt;&lt;br /&gt;Oh, and if you just want to enjoy some beautiful drawings, there's an Easter egg with a larger display of the image above — a collage of figures from book 1 of Archimedes' treatise &lt;em&gt;On Floating Bodies&lt;/em&gt;.  You can see it  &lt;a href="http://episteme.sourceforge.net/img/diagram_tour.png"&gt;here&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;(Updated Oct. 31:  Episteme now includes &lt;a href="http://episteme.sourceforge.net/dmlnav.html"&gt;interactive eye candy&lt;/a&gt;, too.)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-8581471061383850703?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/8581471061383850703/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=8581471061383850703' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/8581471061383850703'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/8581471061383850703'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2008/10/beyond-text.html' title='Beyond text'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_72-VxQGtGLY/SQmni8fEfUI/AAAAAAAAABI/9SFUJc6APSw/s72-c/diagram_tour-small.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-5846894043075962036</id><published>2008-08-06T13:31:00.000-07:00</published><updated>2008-08-06T13:57:04.200-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='transcoding transformer'/><category scheme='http://www.blogger.com/atom/ns#' term='tools'/><category scheme='http://www.blogger.com/atom/ns#' term='Greek'/><category scheme='http://www.blogger.com/atom/ns#' term='Epidoc'/><title type='text'>Epidoc transcoding transformer bats 1.000</title><content type='html'>Hugh Cayless's transcoding transformer library (available from the Epidoc project's sourceforge site &lt;a href="http://sourceforge.net/projects/epidoc/"&gt; here&lt;/a&gt;) is indispensable for anyone working with ancient Greek texts in java or groovy.  How reliable is it?&lt;br /&gt;&lt;br /&gt;I decided to test it against two significant lists of unique Greek strings.  For each list, I converted the TLG's beta code word to UTF-8, then converted the resulting UTF-8 back to beta code, and compared that result to the original.  (For an overview of the TLG's beta code conventions, see &lt;a href="http://www.tlg.uci.edu/BetaCode.html"&gt;this guide&lt;/a&gt;.)&lt;br /&gt;&lt;br /&gt;The first list was composed of 858715 words excluding proper names.  The transcoder round tripped to its starting point in  858709 cases.  Six failures doesn't sound bad (99.999% success rate).  But look more closely: in five of the six failures, the TLG entry in fact breaks the TLG's encoding rules about order of accents, breathings and iota subscripts, while the transcoder correctly follows the rules with the consequence that its conversion back to beta code actually &lt;em&gt;corrects&lt;/em&gt; a data entry error in the TLG!  The sixth case is a sequence found only in a papyrus fragment.  The beta code series &lt;code&gt;&lt;bold&gt;o(=&lt;/bold&gt;&lt;/code&gt; should represent an omicron with rough breathing and circumflex – an accentuation that is not possible in Greek. &lt;br /&gt;&lt;br /&gt;The second word list I tried was composed of proper names, including the tricky sequences beta code introduces in its conventions for capitalization.  Out of 53167 capitalized words, the transcoder round tripped perfectly in all but one – again, an error in the TLG data entry that the transcoder corrected!&lt;br /&gt;&lt;br /&gt;That's a total  of 911882 unique strings.  (That's going way beyond carefully chosen unit tests!)  Remarkably, the transcoder had a 100% success rate in correctly formed words.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-5846894043075962036?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/5846894043075962036/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=5846894043075962036' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/5846894043075962036'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/5846894043075962036'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2008/08/epidoc-transcoding-transformer-bats.html' title='Epidoc transcoding transformer bats 1.000'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-3314130796917264961</id><published>2008-07-10T05:51:00.001-07:00</published><updated>2008-07-10T11:20:43.245-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='tlg'/><category scheme='http://www.blogger.com/atom/ns#' term='Greek'/><title type='text'>Half empty or half full?</title><content type='html'>I frequently assert that classicists, along with biblical scholars, share the distinction of using logical citation schemes to refer to the works they study. This practice is important, since it means that references can apply to any version of a work, in print or digital form. (Briefly, in an &lt;a href="http://vitruviandesign.blogspot.com/2008/04/citation-schemes-empty-content-elements.html"&gt;earlier post&lt;/a&gt;.)&lt;br /&gt;&lt;br /&gt;I have made this claim so often, that I decided it would be a good idea to find out if it were true.&lt;br /&gt;&lt;br /&gt;The TLG offers the largest corpus of ancient Greek, so one way to evaluate how classicists cite their works would be simply to count and summarize the citation schemes used in the TLG.  Sadly, athough this would have been possible until 2000 when the TLG distributed data to its licensees, there is in 2008 no way around the preconceived query interface of the TLG web site.  (The fact that such a simple question as "what citation schemes are used?" is now out of reach illustrates the catastropic consequences for classical studies of the TLG's decision to reverse its decades-old policy of distributing data, in favor of selling access to predetermined user interfaces.)&lt;br /&gt;&lt;br /&gt;As in an earlier post &lt;a href="http://vitruviandesign.blogspot.com/2008/03/first-thousand-years-of-greek.html"&gt;estimating the size of the surviving Greek corpus by period&lt;/a&gt;, we can still use the 2000 version of the TLG Canon distributed on the TLG E disk to get an impression of classicists' citation practice, however.&lt;br /&gt;&lt;br /&gt;As in that post, we'll want to limit ourselves to works transmitted by manuscript copying.  I'll take the simplest approach possible:  count the number of "works" that use each citation scheme.  I won't attempt to normalize in any way the definition of a work:  the five-line &lt;em&gt;Homeric Hymn to the Dioscuri&lt;/em&gt; is one work, as is the entire &lt;em&gt;Iliad&lt;/em&gt;.  With that caveat in mind, let's look at the results.&lt;br /&gt;&lt;br /&gt;The TLG E canon includes 3810 works transmitted by manuscript and having defined citation schemes.  (Note that the Canon includes works not in the E disk; 584 of these works did not yet have a defined citation scheme at the time of the E disk's publication, so I exclude them from our results.)  These 3810 works are represented by an astonishing 194 distinct citation schemes!&lt;br /&gt;&lt;br /&gt;As we might expect, however, the distribution of these schemes is very uneven:  104 citation schemes are used for a single work; only 16 citation schemes are used for more than 13 works.  Let's look more closely at these top 16 citation schemes, which cover 3426 (90%) of the works surveyed.&lt;br /&gt;&lt;br /&gt;&lt;table&gt;&lt;tr&gt;&lt;th style="text-align:left"&gt;Citation scheme&lt;/th&gt;&lt;th style="text-align:left"&gt;Number&lt;/th&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td style="color:red"&gt;volume/page/line&lt;/td&gt;&lt;td&gt;1014&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt; section/line&lt;/td&gt;&lt;td&gt;710&lt;/td&gt;&lt;/tr&gt;    &lt;tr&gt;&lt;td  style="color:red"&gt; page/line&lt;/td&gt;&lt;td&gt;517&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt; line&lt;/td&gt;&lt;td&gt;348&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt; chapter/section/line&lt;/td&gt;&lt;td&gt;334&lt;/td&gt;&lt;/tr&gt;    &lt;tr&gt;&lt;td  style="color:red"&gt; stephanus page/section/line&lt;/td&gt;&lt;td&gt;114&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt; book/chapter/section/line&lt;/td&gt;&lt;td&gt;75&lt;/td&gt;&lt;/tr&gt;    &lt;tr&gt;&lt;td  style="color:red"&gt; jebb page/line&lt;/td&gt;&lt;td&gt;54&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt; book/section/line&lt;/td&gt;&lt;td&gt;49&lt;/td&gt;&lt;/tr&gt;    &lt;tr&gt;&lt;td  style="color:red"&gt; bekker page/line&lt;/td&gt;&lt;td&gt;44&lt;/td&gt;&lt;/tr&gt;    &lt;tr&gt;&lt;td  style="color:red"&gt; kuehn volume/page/line&lt;/td&gt;&lt;td&gt;39&lt;/td&gt;&lt;/tr&gt;    &lt;tr&gt;&lt;td  style="color:red"&gt; harduin page/section/line&lt;/td&gt;&lt;td&gt;32&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt; epistle/section/line&lt;/td&gt;&lt;td&gt;25&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt; chapter/line&lt;/td&gt;&lt;td&gt;25&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt; epistle/line&lt;/td&gt;&lt;td&gt;23&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt; scholion/line&lt;/td&gt;&lt;td&gt;23&lt;/td&gt;&lt;/tr&gt;    &lt;tr&gt;&lt;th style="color:red"&gt;Total physical schemes&lt;/th&gt;&lt;th style="text-align:left; color:red"&gt;1814 (53%)&lt;/th&gt;&lt;/tr&gt;    &lt;tr&gt;&lt;th&gt;Total logical schemes&lt;/th&gt;&lt;th style="text-align:left"&gt;1612 (47%)&lt;/th&gt;&lt;/tr&gt;    &lt;tr&gt;&lt;th&gt;Grand total&lt;/th&gt;&lt;th style="text-align:left"&gt;3426&lt;/th&gt;&lt;/tr&gt;&lt;/table&gt;&lt;br /&gt;The overall results are not encouraging.  The entries in black are logical schemes:  they total only 47% of the 3426 works.  The entries in red refer instead to physical artifacts like book pages, 53% of the group.  It's small consolation that the numbers are a worst-case scenario:  some works may be cited by both logical and physical reference;  where the TLG uses a logical reference, we can be sure that a logical scheme exists, but where the TLG uses a physical reference system, we can't always exclude the possibility that an alternative logical scheme is available.  For example, the 44 works cited by Bekker page are, of course, the Aristotelian corpus:   many of these have alternative citation schemes by chapter or section.&lt;br /&gt;&lt;br /&gt;If we break the numbers down further by the chronological period of the original text, however, the picture changes.  With the notable exception of Plato, where Stephanus' great edition became the standard for citation, citation by logical scheme is much more prevalent in works of the classical period.  The following table breaks out from the previous listing works dating before about 300 BC.  &lt;br /&gt;&lt;br /&gt;&lt;table&gt;    &lt;caption&gt;&lt;strong&gt;Citation schemes in works of classical date&lt;/strong&gt;&lt;/caption&gt;    &lt;tbody&gt;        &lt;tr&gt;            &lt;td&gt;section/line&lt;/td&gt;            &lt;td&gt;229&lt;/td&gt;        &lt;/tr&gt;        &lt;tr&gt;            &lt;td&gt;line&lt;/td&gt;            &lt;td&gt;98&lt;/td&gt;        &lt;/tr&gt;        &lt;tr&gt;            &lt;td style="color:red"&gt;bekker page/line&lt;/td&gt;            &lt;td&gt;43&lt;/td&gt;        &lt;/tr&gt;        &lt;tr&gt;            &lt;td style="color:red"&gt;stephanus page/section/line&lt;/td&gt;            &lt;td&gt;38&lt;/td&gt;        &lt;/tr&gt;        &lt;tr&gt;            &lt;td&gt;chapter/section/line&lt;/td&gt;            &lt;td&gt;20&lt;/td&gt;        &lt;/tr&gt;        &lt;tr&gt;            &lt;td style="color:red"&gt;volume/page/line&lt;/td&gt;            &lt;td&gt;18&lt;/td&gt;        &lt;/tr&gt;        &lt;tr&gt;            &lt;td style="color:red"&gt;page/line&lt;/td&gt;            &lt;td&gt;16&lt;/td&gt;        &lt;/tr&gt;        &lt;tr&gt;            &lt;td&gt;book/chapter/section/line&lt;/td&gt;            &lt;td&gt;11&lt;/td&gt;        &lt;/tr&gt;        &lt;tr&gt;            &lt;td&gt;fable/line&lt;/td&gt;            &lt;td&gt;9&lt;/td&gt;        &lt;/tr&gt;        &lt;tr&gt;            &lt;td&gt;book/line&lt;/td&gt;            &lt;td&gt;5&lt;/td&gt;        &lt;/tr&gt;        &lt;tr&gt;            &lt;td&gt;ode/line&lt;/td&gt;            &lt;td&gt;4&lt;/td&gt;        &lt;/tr&gt;        &lt;tr&gt;            &lt;td&gt;book/section/line&lt;/td&gt;            &lt;td&gt;4&lt;/td&gt;        &lt;/tr&gt;        &lt;tr&gt;            &lt;td&gt;tetralogy/section/line&lt;/td&gt;            &lt;td&gt;3&lt;/td&gt;        &lt;/tr&gt;        &lt;tr&gt;            &lt;td&gt;demonstratio/line&lt;/td&gt;            &lt;td&gt;3&lt;/td&gt;        &lt;/tr&gt;        &lt;tr&gt;            &lt;td&gt;epistle/section/line&lt;/td&gt;            &lt;td&gt;3&lt;/td&gt;        &lt;/tr&gt;        &lt;tr&gt;            &lt;td&gt;book/demonstratio/line&lt;/td&gt;            &lt;td&gt;2&lt;/td&gt;        &lt;/tr&gt;        &lt;tr&gt;            &lt;td style="color:red"&gt;thevenot page/line&lt;/td&gt;            &lt;td&gt;2&lt;/td&gt;        &lt;/tr&gt;        &lt;tr&gt;            &lt;td&gt;epistle/line&lt;/td&gt;            &lt;td&gt;2&lt;/td&gt;        &lt;/tr&gt;        &lt;tr&gt;            &lt;td&gt;idyll/line&lt;/td&gt;            &lt;td&gt;1&lt;/td&gt;        &lt;/tr&gt;        &lt;tr&gt;            &lt;td style="color:red"&gt;page+column/line&lt;/td&gt;            &lt;td&gt;1&lt;/td&gt;        &lt;/tr&gt;        &lt;tr&gt;            &lt;td&gt;sententia/line&lt;/td&gt;            &lt;td&gt;1&lt;/td&gt;        &lt;/tr&gt;        &lt;tr&gt;            &lt;td&gt;lexical entry/line&lt;/td&gt;            &lt;td&gt;1&lt;/td&gt;        &lt;/tr&gt;        &lt;tr&gt;            &lt;td&gt;proverb/line&lt;/td&gt;            &lt;td&gt;1&lt;/td&gt;        &lt;/tr&gt;        &lt;tr&gt;            &lt;td style="color:red"&gt;folio/line&lt;/td&gt;            &lt;td&gt;1&lt;/td&gt;        &lt;/tr&gt;        &lt;tr&gt;            &lt;td&gt;fable/version/line&lt;/td&gt;            &lt;td&gt;1&lt;/td&gt;        &lt;/tr&gt;        &lt;tr&gt;            &lt;td&gt;exordium/section/line&lt;/td&gt;            &lt;td&gt;1&lt;/td&gt;        &lt;/tr&gt;        &lt;tr&gt;            &lt;td style="color:red"&gt;usener page/line&lt;/td&gt;            &lt;td&gt;1&lt;/td&gt;        &lt;/tr&gt;      &lt;tr&gt;&lt;th style="color:red"&gt;Total physical schemes&lt;/th&gt;&lt;th style="text-align:left; color:red"&gt;120 (23%)&lt;/th&gt;&lt;/tr&gt;    &lt;tr&gt;&lt;th&gt;Total logical schemes&lt;/th&gt;&lt;th style="text-align:left"&gt;399 (77%)&lt;/th&gt;&lt;/tr&gt;    &lt;tr&gt;&lt;th&gt;Grand total&lt;/th&gt;&lt;th style="text-align:left"&gt;519&lt;/th&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;br /&gt;&lt;br /&gt; The 519 works are cited in 27 different citation schemes.  We could think of that as an "average density" of about 19-20 works per citation scheme, essentially the same as for the overall corpus (194 schemes for 3810 works is also a density of about 19-20 works per citation scheme).   But in this listing, only 23% (120) of the classical works use physical reference systems.  The corpora of Plato and Aristotle constitute the bulk of this material (81 works);  apart from the two great philosophical corpora, only 39 works of the classical period are cited in the TLG by physical reference system – about 8%.&lt;br /&gt;&lt;br /&gt;It's probably the height of political incorrectness to suggest that the most traditional canon of work has been the object of better quality scholarly study (although it's plausible enough that more scholarship should produce better results), but by the single, one-dimensional yardstick of how a work is cited, editors of classical texts have done a far better job capturing the logical structure of their texts than have editors of ancient Greek overall. &lt;br /&gt;&lt;br /&gt;So for classicists interested in creating a digital corpus of Greek, the "news" is mixed.  Roughly half the works in the TLG E Canon already depend on logical reference systems, so we already have a good standard in place for many of our texts.  The classical period is in markedly better shape.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-3314130796917264961?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/3314130796917264961/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=3314130796917264961' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/3314130796917264961'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/3314130796917264961'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2008/07/half-empty-or-half-full.html' title='Half empty or half full?'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-1390042865347785259</id><published>2008-04-11T14:53:00.000-07:00</published><updated>2008-04-11T15:03:04.262-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='TEI'/><category scheme='http://www.blogger.com/atom/ns#' term='XML'/><title type='text'>Citation schemes: empty content elements considered harmful</title><content type='html'>Classicists have, by and large, relied on standard, logical citation schemes to cite works of ancient literature.  In the scheme of the &lt;a href="http://en.wikipedia.org/wiki/FRBR"&gt;Functional Requirements for Bibliographic Records&lt;/a&gt; (or FRBR), we could say that classicists have cited notional works using references that could then be applied to any manifestation or expression of that work.&lt;br /&gt;&lt;br /&gt;In the print world, this practice has made it possible for scholars to apply a reference to different printed editions or translations of a work.  As the internet becomes our library, this practice can turn references into machine-actionable entry points to the library (whether the reference is automatically discovered, or manually cited by a scholars).  It is therefore a vital prerequisite that digital editions encode standard, logical citation data such as the book/chapter/section divisions of Thucydides, or the book/line divisions of the Iliad.&lt;br /&gt;&lt;br /&gt;The TEI Guidelines (as so often) offer more than one way to approach the problem.  It is valid TEI to encode citation values as attributes on containing elements that define the logical structure of a document.  Book/chapter/section in Thucydides might be represented by a successive hierarchy of TEI &lt;code&gt;div&lt;/code&gt; elements, for example, or book/line in the Iliad by &lt;code&gt;div&lt;/code&gt; elements containing &lt;code&gt;l&lt;/code&gt; elements;  the citation values could be placed in the &lt;code&gt;@n&lt;/code&gt; attribute of each container.&lt;br /&gt;&lt;br /&gt;Alternatively, since the earliest work of the TEI in the 1980s, the Guidelines have included empty elements (such as the &lt;code&gt;milestone&lt;/code&gt;) that could be used to mark transitional points in a document.  It is easy to find examples of scholarly texts using such empty elements to mark the beginning of a new unit like a chapter or section.&lt;br /&gt;&lt;br /&gt;Arguably, there was little difference between these two approaches in SGML.  In XML, however, scholars should avoid using empty elements to encode citation data.  &lt;br /&gt;&lt;br /&gt;A host of supporting and related technologies have developed around XML in its first decade.  One of the most important is XPath, a notation for referring to parts of an XML document by the document's structure.  Higher-level technologies such as XSLT or implementations of the DOM model in many programming languages in turn support XPath expressions.  The result is that programmers working in many environments can succinctly retrieve a unit like "book 2, chapter 5" of Thucydides with a simple XPath expression like &lt;blockquote&gt;&lt;code&gt;/TEI.2/text/body/div[@type='book' and @n='2']/div[@type='chapter' and @n='5']&lt;/code&gt;&lt;/blockquote&gt;  Content between empty elements, on the other hand, cannot be addressed directly with XPath expressions.  &lt;br /&gt;&lt;br /&gt;Placing citation data on empty elements cuts programmers off from a galaxy of technologies they can use when citation data is kept on containing elements.  Empty citation elements should never be necessary if the citation scheme is in fact a logical hierarchy:  if it is not, consider whether there is a problem either with your choice of citation scheme or with your design of the rest of the document's structure.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-1390042865347785259?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/1390042865347785259/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=1390042865347785259' title='4 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/1390042865347785259'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/1390042865347785259'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2008/04/citation-schemes-empty-content-elements.html' title='Citation schemes: empty content elements considered harmful'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>4</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-45587163046448340</id><published>2008-04-11T14:26:00.000-07:00</published><updated>2008-04-11T15:04:52.008-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='TEI'/><category scheme='http://www.blogger.com/atom/ns#' term='XML'/><title type='text'>Separation of concerns applies to document content, too</title><content type='html'>Twenty years ago, before the internet was open to the public, the print publishing industry was a leader in SGML document markup, and scholarly markup projects tended to think of "documents" as the content bound between a pair of covers.  This heritage is clearly reflected in the TEI Guidelines' thorough inventory of elements to identify "front" and "back" material of documents, or a variety of groupings or collections of texts.&lt;br /&gt;&lt;br /&gt;The major syntactic differences between XML and SGML — insistence on a single hierarchy of elements, each with explicitly marked end — were introduced in part to adapt markup to the needs of a very different environment:  a network of computers exchanging information dynamically.   The already well-understood distinction between semantic markup and presentational markup certainly contributed to the articulation of "separation of concerns" in the design of network applications.  Individuals with different skills could apply appropriate technologies to the different parts of a network application, so in creating an application to run in a web browser, programmers might write the controlling code in javascript, and design specialists define its appearance with CSS.  In a network of semantically structured content, XML plays the vital roles of defining the data structure (explicitly via a schema or DTD, or implicitly in the case of well-formed XML), and of providing a format for data exchange.  The question of what this XML should look like — the kind of question the TEI has considered since the 1980s — had to be rethought.  Humanists might rephrase Sun Microsystem's famous slogan, "The network is the computer," as "The network is the library."&lt;br /&gt;&lt;br /&gt;When applications can exchange structured content, it is straightforward to create compound documents.  Asymmetrically, it can be more difficult to disaggregate a complex document into component parts, since an application then needs a more detailed knowledge of the internal structure of a necessarily more complex document.  An application could easily juxtapose a document in original language with a document in translation, or weave together a commentary with a text associated through a common citation system, for example, but disentangling interleaved translation or commentary from a complex document is more problematic.&lt;br /&gt;&lt;br /&gt;I've been thinking about this in designing a set of TEI documents to represent the multiple texts of the famous Venetus A manuscript of the Iliad.  There are four distinct sets of scholia, in addition to the manuscript's text of the Iliad.  I chose to treat each set as an independent document, and as I am now reaching the stage of putting together applications drawing on those documents, I am glad that I did:  cleanly separated, discrete documents are making that job much easier than it otherwise would be.&lt;br /&gt;&lt;br /&gt;I expect that I will never use the &lt;a href="http://www.tei-c.org/release/doc/tei-p5-doc/en/html/PH.html#PHFAX"&gt;elaborate TEI mechanisms to document the relation of a transcribed document to graphic images&lt;/a&gt;.  In keeping with the guiding principle of separate, discrete documents, I'm associating images of the manuscript with ranges of text through external indices:  here, too, the standoff markup of a separate, simple (non-TEI) document is easy to marshall together with the TEI document of the transcribed text.&lt;br /&gt;&lt;br /&gt;In many ways, TEI P5, with its support for XML namespaces, is nudging scholars towards this kind of document organization.  But we need to push harder:  it's time to move away from monolithic TEI replicas of print or even manuscript sources.  In editing scholarly texts for use on the internet, let each logical component stand alone.&lt;br /&gt;&lt;br /&gt;Coordinating separate documents in a networked library requires a common understanding of how to cite them.  I'll follow up with a note on &lt;a href="http://vitruviandesign.blogspot.com/2008/04/citation-schemes-empty-content-elements.html"&gt;how editors of TEI texts should think about that part of their markup&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-45587163046448340?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/45587163046448340/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=45587163046448340' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/45587163046448340'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/45587163046448340'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2008/04/separation-of-concerns-applies-to.html' title='Separation of concerns applies to document content, too'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-1862658194711086854</id><published>2008-03-05T07:22:00.000-08:00</published><updated>2008-03-05T12:43:30.022-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='diogenes'/><category scheme='http://www.blogger.com/atom/ns#' term='tlg'/><category scheme='http://www.blogger.com/atom/ns#' term='Greek'/><title type='text'>The first thousand years of Greek</title><content type='html'>How much Greek survives from the classical period?  From the Hellenistic period?&lt;br /&gt;&lt;br /&gt;Those questions were impossible to quantify when I was an undergraduate.  It still might be difficult to get a very precise answer if we wanted to consider inscriptions and papyri, but if we limit ourselves to ancient Greek transmitted to us by manuscript copying, we can get a pretty satisfactory answer for the first thousand years or so of ancient Greek very quickly using the Canon from the TLG E disk.&lt;br /&gt;&lt;br /&gt;The data in the Canon can be systematically manipulated using the &lt;a href="http://www.dur.ac.uk/p.j.heslin/Software/Diogenes/"&gt;Diogenes perl library&lt;/a&gt;.  For each work in the TLG, the Canon contains three fields of information that are of special interest for this question:  one indicates the method of transmission; another contains the word count of the TLG's on line text; and a third field contains a date description.  The method of transmission is important, because the TLG includes "works" that are known only through testimonia or citation — "fragments," as classicists misleadingly call them — where we instead want to estimate how much Greek actually exists.  (We don't care about geographic "fragments" of Hipparchus that are really passages of Strabo.  To get an idea of how much of the TLG is made up of this doubling of content, the TLG E disk contains roughly 75-76 million words;  over 4 million words — roughly 5% of the whole TLG E disk — are quoted "fragments" or testimonia!)  &lt;br /&gt;&lt;br /&gt;While it would be possible to write perl code to query the TLG Canon directly via the Diogenes API, most people would probably find it easier to transform the contents of the Canon into some format where they can use standard technologies.  (I have created both a hierarchical XML version of the Canon, and a normalized relational database version;  possible topics for another blog entry perhaps.)&lt;br /&gt;&lt;br /&gt;The word counts are integer values;  the methods of transmission are indicated by a controlled vocabulary:  manuscript transmission is either 'Cod' or 'cod'.  The only challenge is parsing the Canon's quasi-regular strings describing dates, but there are only a little over 100 unique strings, so scripting a little text munging in your favorite language that supports regular expressions is pretty straightforward.&lt;br /&gt;&lt;br /&gt;The Canon's dates are to a precision of a century, so I interpret all dates as ranges.  A date of "first century AD" could be interpreted as a range of 1-100 AD, and a date of "third or second century BC" could be interepreted as a range of 299 - 100 BC, for example.&lt;br /&gt;&lt;br /&gt;At this point, it's time to let the computer do the counting.  Here are some results to consider:  through 300 AD, the TLG contains over 20 million words, but their chronological distribution is very uneven:&lt;br /&gt;&lt;table &gt;&lt;tr&gt;&lt;th&gt;For works dated after or equal to...&lt;/th&gt; &lt;th&gt;... but before&lt;/th&gt; &lt;th&gt;Number of words&lt;/th&gt; &lt;th&gt;Running total&lt;/th&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;Earliest Greek writing&lt;/td&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;500 BC&lt;/td&gt;&lt;td style="margin : 0; padding : 0; text-align; text-align : right; border: solid black 1px"&gt;384528&lt;/td&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;384528&lt;/td&gt;&lt;/tr&gt;&lt;br /&gt;&lt;tr&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;500 BC&lt;/td&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;400 BC&lt;/td&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;2251766&lt;/td&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;2636294&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;400 BC&lt;/td&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;300 BC&lt;/td&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;1762944&lt;/td&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;4399238&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;300 BC&lt;/td&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;200 BC&lt;/td&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;921255&lt;/td&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;5320493&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;200 BC&lt;/td&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;100 BC&lt;/td&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;178655&lt;/td&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;5499148&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;100 BC&lt;/td&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;1 AD&lt;/td&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;1745320&lt;/td&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;7244468&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;1 AD&lt;/td&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;200 AD&lt;/td&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;7583759&lt;/td&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;14828227&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;200 AD&lt;/td&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;300 AD&lt;/td&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;5373095&lt;/td&gt;&lt;td  style="margin : 0; padding : 0; text-align; text-align : right; border : solid black 1px"&gt;20201322&lt;/td&gt;&lt;/tr&gt;&lt;/table&gt;&lt;br /&gt;        &lt;br /&gt;&lt;h2&gt;Caveats&lt;/h2&gt;&lt;br /&gt;Roughly 10% of the contents of the TLG E corpus (7680878 words) have dates given as "INCERTUM" or "VARIA":  these are completely omitted from the counts. We can't really know how Greek is distributed beyond the period of the TLG E Canon's coverage, because the TLG project no longer makes the Canon available, except through its "one-size-fits-all" interface (or to answer the questions raised here, "one size fits none").  This is the more troubling since the TLG's online corpus is now a third again as large as it was in 2000 when the E disk was prepared (by the estimate of the TLG website, 99 million words vs. 76 million words for the E disk).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-1862658194711086854?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/1862658194711086854/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=1862658194711086854' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/1862658194711086854'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/1862658194711086854'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2008/03/first-thousand-years-of-greek.html' title='The first thousand years of Greek'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-2818975069960240227</id><published>2008-02-22T05:14:00.000-08:00</published><updated>2008-02-22T05:29:22.335-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='diogenes'/><title type='text'>Prediction confirmed:  ubiquitous Diogenes</title><content type='html'>In January, when I blogged a &lt;a href="http://vitruviandesign.blogspot.com/2008/01/looking-for-honest-man-on-linux-ppc.html"&gt;short note about running Diogenes on Linux PPC&lt;/a&gt;, I called Diogenes "ultra portable," and ended by asking, "Diogenes on your XO laptop, iPhone, or other device, anyone?"&lt;br /&gt;&lt;br /&gt;Bruce Robertson has now answered.  He emails, "I thought you'd like to know that I've been running diogenes quite happily on a Nokia N800 palmtop computer running OS2008 for the past two days."&lt;br /&gt;&lt;br /&gt;Another great example of what happens when scholarship and software are open:  other people can (in this instance, quite literally!) take it places the original author probably never imagined.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-2818975069960240227?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/2818975069960240227/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=2818975069960240227' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/2818975069960240227'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/2818975069960240227'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2008/02/prediction-confirmed-ubiquitous.html' title='Prediction confirmed:  ubiquitous Diogenes'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-6489498070386423299</id><published>2008-02-19T03:44:00.001-08:00</published><updated>2008-04-11T15:04:04.903-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='TEI'/><category scheme='http://www.blogger.com/atom/ns#' term='XML'/><title type='text'>Scholarly markup in XML's second decade</title><content type='html'>XML is now ten years old.  (For those interested in an insider's view of how that all happened, Tim Bray has republished &lt;a href="http://www.tbray.org/ongoing/When/200x/2008/02/10/XML-People"&gt;XML People&lt;/a&gt;.)  For scholarly projects involving semantically structured texts, it is practically a given that they will rely on XML. &lt;br /&gt;&lt;br /&gt;But in actual practice, texts produced by current projects often don't look very different from scholarship based on SGML in the 1980s.  In the next postings on this blog, I want to discuss three suggestions based on my experience with XML over the last decade, and how it contrasts with my experience of SGML in the preceding decade.  In each case, I'll focus on how to follow these suggestions using the &lt;a href="http://www.tei-c.org"&gt;Text Encoding Initiative&lt;/a&gt;'s guidelines.&lt;br /&gt;&lt;br /&gt;&lt;ol&gt;&lt;br /&gt;&lt;li&gt;Separation of concerns applies to document content, too. (Now &lt;a href="http://vitruviandesign.blogspot.com/2008/04/separation-of-concerns-applies-to.html"&gt;here&lt;/a&gt;.)&lt;/li&gt;&lt;br /&gt;&lt;li&gt;Citation schemes:  empty content elements considered harmful (Now &lt;a href="http://vitruviandesign.blogspot.com/2008/04/citation-schemes-empty-content-elements.html"&gt;here&lt;/a&gt;.&lt;/li&gt;&lt;br /&gt;&lt;li&gt;What's the diff? Rethinking the critical apparatus.&lt;/li&gt;&lt;br /&gt;&lt;/ol&gt;&lt;br /&gt;&lt;br /&gt;Stay tuned.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-6489498070386423299?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/6489498070386423299/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=6489498070386423299' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/6489498070386423299'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/6489498070386423299'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2008/02/scholarly-markup-in-xmls-second-decade.html' title='Scholarly markup in XML&apos;s second decade'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-3193201999124840327</id><published>2008-01-09T07:45:00.001-08:00</published><updated>2008-01-09T08:35:31.659-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='diogenes'/><category scheme='http://www.blogger.com/atom/ns#' term='xul'/><category scheme='http://www.blogger.com/atom/ns#' term='linux'/><title type='text'>Looking for an honest man — on Linux PPC</title><content type='html'>Peter Heslin's &lt;a href="http://www.dur.ac.uk/p.j.heslin/Software/Diogenes/index.php"&gt;Diogenes 3.1 &lt;/a&gt; is extremely cleanly  designed, and ultra portable. The server functionality is written in perl, and the new user interface is a &lt;a href="http://www.mozilla.org/projects/xul/"&gt;XUL application&lt;/a&gt;.  One result is that Heslin can provide simple binary installations for Mac OS X, various Windows operating systems, and Linux on x86 architecture. &lt;br /&gt; &lt;br /&gt;This design also makes it easy to install and run Diogenes on any operating system with perl and a XUL application environment.  Using Ubuntu Linux 7.04 on a PPC system, for example, after you download and install the Linux version of Diogenes, you can run Diogenes at least three different ways:&lt;br /&gt;&lt;div style="background-color : #DDDDDD; padding : 1em; padding-top : 0em;"&gt;&lt;br /&gt;1) &lt;span style="color: black"&gt;use &lt;code&gt;xulrunner&lt;/code&gt; to run the graphic interface&lt;/span&gt;&lt;br /&gt;&lt;div style="padding-left: 0.5em; padding-top: 0em;"&gt;&lt;br /&gt;If &lt;code&gt;xulrunner&lt;/code&gt; is not already installed on your system, use Synaptic or &lt;code&gt;apt-get&lt;/code&gt; to install it.  (&lt;code&gt;xulrunner&lt;/code&gt; is in the Development section of the Ubuntu universe repository.) You can now start Diogenes from a terminal with the command &lt;br /&gt;&lt;pre&gt;   xulrunner /usr/local/diogenes/application.ini&lt;/pre&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_72-VxQGtGLY/R4T0NLPvgWI/AAAAAAAAAAM/HgL8nlESNYg/s1600-h/menu-config.png"&gt;&lt;img style="float:right; margin:0 0 10px 10px;cursor:pointer; cursor:hand;" src="http://2.bp.blogspot.com/_72-VxQGtGLY/R4T0NLPvgWI/AAAAAAAAAAM/HgL8nlESNYg/s320/menu-config.png" border="0" alt=""id="BLOGGER_PHOTO_ID_5153512380812984674" /&gt;&lt;/a&gt;&lt;br /&gt;Better still, edit the properties for the Diogenes menu item that was created by the Diogenes installer.  In the Launcher Properties, enter the command to start &lt;code&gt;xulrunner&lt;/code&gt; as illustrated  here.  Now you can run diogenes from the menu selection.&lt;/div&gt;&lt;br /&gt;2) &lt;span style="color: black"&gt;use Firefox 3 to run the graphic interface&lt;/span&gt;&lt;br /&gt;&lt;div style="padding: 0.5em"&gt;&lt;br /&gt;Version 3 of Firefox includes a full XUL environment that can run external XUL programs like Diogenes.  Beta version 2 of FF3 was released in December;  when a stable release version appears, look for it to show up as an upgrade to Firefox in your Ubuntu repository.  When Firefox 3 is installed on your system, you may alternatively start Diogenes with the command&lt;br /&gt;&lt;pre&gt;   firefox -app /usr/local/diogenes/application.ini&lt;/pre&gt;&lt;br /&gt;As with option 1, you can edit the Diogenes menu item to run this command.&lt;br /&gt;Technically inclined users who are eager to play with the beta version can download source code for the beta release, and follow the very clear instructions &lt;a href="http://developer.mozilla.org/en/docs/Build_Documentation"&gt;here&lt;/a&gt; to install it.  All the prerequisites are standard libraries that are available in Ubuntu repositories.&lt;/div&gt;&lt;br /&gt;3) &lt;span style="color: black"&gt;Browse and search texts from the command line&lt;/span&gt;&lt;br /&gt;&lt;div style="padding: 0.5em"&gt;&lt;br /&gt;The command line user program (named &lt;code&gt;dio&lt;/code&gt;) works just as it does on any other Linux.  Run &lt;code&gt;dio&lt;/code&gt; with no arguments to see its various options.&lt;/div&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;The importance of this flexibility is not that it opens up Diogenes to a vast number of Greek scholars using Linux PPC, Solaris, or some other particular operating system.  Its importance is rather that it keeps Diogenes open to &lt;em&gt;any&lt;/em&gt; platform meeting its simple requirements — including future platforms.&lt;br /&gt;&lt;br /&gt;Diogenes on your XO laptop, iPhone, or other device, anyone?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-3193201999124840327?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/3193201999124840327/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=3193201999124840327' title='8 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/3193201999124840327'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/3193201999124840327'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2008/01/looking-for-honest-man-on-linux-ppc.html' title='Looking for an honest man — on Linux PPC'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_72-VxQGtGLY/R4T0NLPvgWI/AAAAAAAAAAM/HgL8nlESNYg/s72-c/menu-config.png' height='72' width='72'/><thr:total>8</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-6489032235693808408</id><published>2007-12-27T08:34:00.000-08:00</published><updated>2007-12-27T08:39:08.076-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='grumpy old man'/><category scheme='http://www.blogger.com/atom/ns#' term='open access'/><category scheme='http://www.blogger.com/atom/ns#' term='apa'/><title type='text'>Open access to federally funded research</title><content type='html'>Linked from Slashdot today: tucked into the appropriations act just signed by President Bush, a requirement that &lt;a href="http://www.sciencecodex.com/public_access_mandate_made_law"&gt;the NIH must provide online access to research it has funded&lt;/a&gt;.  This is a tremendous precedent, the first time that the US federal government has made open online access a condition of receiving federal funding for research.&lt;br /&gt;&lt;br /&gt;The NIH is the focus, not the NEH, in part because people understand that medical research matters (as the respective budgets of the NIH and NEH also show). But the NIH was also in the Congressional spotlight because of the sustained advocacy of leading scientists, such as the open letter to Congress signed by &lt;a href="http://www.taxpayeraccess.org/nobelists2004.html"&gt;25 Nobel laureates in 2004&lt;/a&gt; and by &lt;a href="http://www.taxpayeraccess.org/bof.html"&gt;26 Nobel winners in 2007&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Meanwhile, the American Philological Association, the professional organization that purports to represent classical studies, has inaugurated &lt;a href=""&gt;a multimillion dollar fundraising campaign to establish a "Digital Portal" centered on subscription-based access to a bibliography of print publications&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;em&gt;fuimus Troes, fuit Ilium&lt;/em&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-6489032235693808408?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/6489032235693808408/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=6489032235693808408' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/6489032235693808408'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/6489032235693808408'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2007/12/open-access-to-federally-funded.html' title='Open access to federally funded research'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-8618675781537033329</id><published>2007-12-27T07:17:00.000-08:00</published><updated>2007-12-27T07:22:27.779-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='images'/><category scheme='http://www.blogger.com/atom/ns#' term='lycian'/><category scheme='http://www.blogger.com/atom/ns#' term='citation'/><category scheme='http://www.blogger.com/atom/ns#' term='homer multitext'/><category scheme='http://www.blogger.com/atom/ns#' term='google maps'/><title type='text'>Zoom!</title><content type='html'>In the Ur-web of the early 1990s, images came in fixed sizes. You might get a thumbnail-sized image, a smaller version or a larger version, but generally what appeared in your browser was a full, one-to-one view of a distinct image as it was delivered to you from a Web server.&lt;br /&gt;&lt;br /&gt;Today, it's increasingly common for server- and client-side applications to manipulate what is, at least notionally, a single image that a user can navigate through.  Google defined the current state of the art in browser-based image navigation when it introduced Google Maps in 2005.  Its clever use of AJAX to load adjacent tiles at appropriate scales creates the illusion of continuous navigation of the whole earth.&lt;br /&gt;&lt;br /&gt;The same technology can be applied to any image.  At University College, London, the Centre for Advanced Spatial Analysis has developed "&lt;a href="http://www.casa.ucl.ac.uk/software/googlemapimagecutter.asp"&gt;The Google Maps Image Cutter&lt;/a&gt;," an application to generate from any digital image the image tiles required by a Google maps-style web application.&lt;br /&gt;&lt;br /&gt;A couple of projects I'm working on apply this technique to browse images that cannot be displayed in full detail in a single view because of their high resolution or awkward shape. The Center for Hellenic Studies' Homer Multitext Project has &lt;a href="http://chs75.harvard.edu/manuscripts/"&gt;Google-mapped high-resolution photographs of Iliadic manuscripts&lt;/a&gt;.  I've recently &lt;a href="http://aristarchus.holycross.edu/trmmili/documentation/gmappedImages"&gt;Google-mapped drawings and photographs of several dozen inscriptions in the Lycian language&lt;/a&gt;.  &lt;br /&gt;&lt;br /&gt;This is an easily implemented and effective way to let users explore an image.  It comes at the cost of one tiny little white lie:  we have to pretend to Google that the coordinate space of our rectangular image works like a Mercator projection of a spheroid (the earth).  &lt;br /&gt;&lt;br /&gt;This is innocent enough, if we recognize what we're doing, but it should provoke more serious reflection about how we use images and cite them in scholarly work.  We need to define recognizable ways of referring to parts of an image independently of the state of a user's panning and zooming.  I'll post more on that topic before long.  For now, enjoy the pictures.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-8618675781537033329?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/8618675781537033329/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=8618675781537033329' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/8618675781537033329'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/8618675781537033329'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2007/12/zoom.html' title='Zoom!'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-4841955462893445877</id><published>2007-12-11T04:36:00.000-08:00</published><updated>2007-12-11T04:41:52.514-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='perseus'/><category scheme='http://www.blogger.com/atom/ns#' term='living dinosaur'/><title type='text'>Vingt ans après</title><content type='html'>Tonight, several of the Perseus project's original musketeers are gathering to observe the twentieth anniversary of the grant proposal to the Annenberg Foundation that jump-started the project.  I'm sure that gray hair, sagging waist lines and altered career paths will prompt private reflections, but here's the fact that grabs me now:  the Perseus project is older than three quarters of the undergraduates I teach.&lt;br /&gt;&lt;br /&gt;My current students were still toddlers when the first public version of Perseus was released on CD.  I doubt any of them have heard of, much less remember, Apple's HyperCard;  it will be hard for them to imagine how exciting it was when a hypertext system first became available on personal computers.&lt;br /&gt;&lt;br /&gt;They were learning to read or just beginning elementary school when Perseus made its astonishingly rapid transition to a Web delivery system.  They probably are unaware that the internet was not always open to commercial use, and have little experience that would help them appreciate the importance of design decisions early in the history of Perseus.  Can they grasp how the choice of SGML for markup of texts made it possible to generate both HyperCard stacks and Web pages from a single source?&lt;br /&gt;&lt;br /&gt;Now they are in college, and the Perseus project has open-sourced both its code and key data including all its ancient texts (as I observed on &lt;a href="http://vitruviandesign.blogspot.com/2007/11/thanksgiving.html"&gt;Thanksgiving&lt;/a&gt;).  Will they understand how this opens up to them unprecedented opportunities to build on the work of their predecessors, or have we conditioned them to see themselves only as passive consumers? &lt;br /&gt;&lt;br /&gt;Are we raising up a new generation to join in the hard work ahead of us?   All for one, and one for all!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-4841955462893445877?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/4841955462893445877/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=4841955462893445877' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/4841955462893445877'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/4841955462893445877'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2007/12/vingt-ans-aprs.html' title='Vingt ans après'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-8205748197726393802</id><published>2007-11-24T17:42:00.000-08:00</published><updated>2007-11-24T17:52:15.394-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='diogenes'/><category scheme='http://www.blogger.com/atom/ns#' term='grumpy old man'/><category scheme='http://www.blogger.com/atom/ns#' term='perseus'/><title type='text'>Thanksgiving</title><content type='html'>If you study ancient Greek, you can be thankful in 2007.  This fall, two of our discipline's most important scholarly instruments have gone through extraordinary metatmorphoses.  First, Peter Heslin released version 3 of Diogenes (&lt;a href="http://www.dur.ac.uk/p.j.heslin/Software/Diogenes/"&gt;http://www.dur.ac.uk/p.j.heslin/Software/Diogenes/&lt;/a&gt;);  then this month, the Perseus project (&lt;a href="http://www.perseus.tufts.edu/hopper"&gt;http://www.perseus.tufts.edu/hopper&lt;/a&gt;)&lt;br /&gt;announced that source code and text data are being made available under open licenses. &lt;br /&gt;&lt;br /&gt;Diogenes now directly integrates automated morphological analyses of ancient Greek from the Perseus project's morphological parser.  The Perseus project's new open licenses guarantee that Peter Heslin will not be the last scholar to draw on the rich resources created at Perseus over the past two decades.&lt;br /&gt;&lt;br /&gt;Perhaps these developments would be unremarkable in disciplines where contributions through collaborative work and critical assessment of evidence are valued more highly than career advancement.  In the humanities, they stand out against a bleak landscape of subscription services and other forms of restrictions on access to scholarly work.&lt;br /&gt;&lt;br /&gt;Taken together, Diogenes and Perseus illustrate the kind of cross-pollination that is possible when reuse of digital scholarly works is not outlawed.  If enough classicists notice, we may have more good Thanksgivings ahead of us in the future.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-8205748197726393802?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/8205748197726393802/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=8205748197726393802' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/8205748197726393802'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/8205748197726393802'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2007/11/thanksgiving.html' title='Thanksgiving'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-1755996254369989707</id><published>2007-11-01T05:38:00.000-07:00</published><updated>2007-11-24T17:54:12.297-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='diogenes'/><category scheme='http://www.blogger.com/atom/ns#' term='tlg'/><category scheme='http://www.blogger.com/atom/ns#' term='perseus'/><title type='text'>Remembering Ted Brunner</title><content type='html'>This summer I read the Washington &lt;em&gt;Post&lt;/em&gt;'s lengthy obituary of Ted Brunner.  Few classical scholars are made the subject of so many column inches in a national paper, so I was surprised this fall to discover that none of my Classics students knew who Ted Brunner was.  The same quite serious majors who recognized the authors of eminently forgettable footnotes on Greek or Latin texts had apparently never heard of the director of one of the later twentieth century's most influential digital projects in the humanities.  We classicists really have a lot of teaching to undo.&lt;br /&gt;I leave it to others who knew Ted better than I to eulogize or analyze him.  I offer only two observations from first-hand experience.&lt;br /&gt;First, he remained always relentlessly focussed on &lt;strong&gt;data&lt;/strong&gt;.  The TLG was not about producing software:  if you wanted software, Ted's attitude was that you should write your own.  (Dinosaurs like myself will recall how far he could take this position.  In the early years of the TLG, the project's at best arcane, in many ways bizarre data formats were almost aggressively undocumented:  you got a nine-track tape, and if you wanted to understand the data, you were welcome to reverse-engineer the format as best you could.) In his own way, Ted Brunner was an early advocate of separation of concerns, and his view has been validated by the range of software developed over the past two decades for using the TLG's data. Most recently, Peter Heslin's release of &lt;a href="http://www.dur.ac.uk/p.j.heslin/Software/Diogenes/"&gt;version 3.x of Diogenes&lt;/a&gt; is a stunning piece of work (and deserves far more recognition than it has received).  It integrates the TLG data with output from the Perseus project's morphological parser — a piece of software that in turn would probably  never have been developed if the TLG had not existed.  What a pity that since Ted's retirement the TLG has turned its back on this principle, and permits access to material digitized since 2000 only through its own, one-size-fits-all web interface.&lt;br /&gt;Second, however sharply he could react to people he saw as threatening the TLG's work, he was extremely generous with his time to anyone interested in the TLG, no matter how unimportant.  When I was a very lowly graduate student at Berkeley, I had a chance to visit the TLG project at Irvine, and Ted set aside an entire morning to give me a personal tour and answer my questions.  (I am sure that I am not the only visitor to the TLG to come away with a vivid memory of Ted  starting the standard pre-recorded TLG slide show and proudly pointing out that the narrator's incredible bass voice was none other than the voice of Tony the Tiger.)&lt;br /&gt;So two small points — he focused on his data, and was generous to people who could not obviously or immediately help him.&lt;br /&gt;I hope someone could remember as much about me after reading my obituary.&lt;br /&gt;&lt;span style="font-variant: small-caps;font-family:'Trebuchet MS',Trebuchet,Arial,Verdana,Sans-serif;font-size:78%;color:gray;"   &gt;The Feast of All Saints, 2007.&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-1755996254369989707?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/1755996254369989707/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=1755996254369989707' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/1755996254369989707'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/1755996254369989707'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2007/11/remembering-ted-brunner.html' title='Remembering Ted Brunner'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-8578632694580957106</id><published>2007-10-31T05:49:00.000-07:00</published><updated>2007-10-31T05:58:37.389-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='dnid'/><title type='text'>Identifying discrete objects using the dnid scheme</title><content type='html'>I've been working with Sebastian Heath at the &lt;a href="http://numismatics.org/"&gt;American Numismatic Society&lt;/a&gt; on a scheme for referring to discrete objects with unique identifiers.  &lt;br /&gt;The fundamental idea is very simple.  Since internet domain names already provide a means of uniquely identifying a namespace (think of XML namespaces), we can apply domain names as qualifiers to ensure the uniqueness of existing, stable IDs for many kinds of materials that humanists cite.&lt;br /&gt;I'll have more to post on this topic in the future, but for now, you can see where we're headed at &lt;a href="http://www.dnid-community.org"&gt;http://www.dnid-community.org&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-8578632694580957106?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/8578632694580957106/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=8578632694580957106' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/8578632694580957106'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/8578632694580957106'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2007/10/identifying-discrete-objects-using-dnid.html' title='Identifying discrete objects using the dnid scheme'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7419001987588063203.post-3963943091276242368</id><published>2007-10-31T05:23:00.000-07:00</published><updated>2007-10-31T05:45:47.947-07:00</updated><title type='text'>Vitruvian design</title><content type='html'>&lt;blockquote&gt;All people, not only architects, are able to appreciate what is good work.  The difference between architects and uneducated people is that the uneducated person cannot understand what the work will be unless he has seen it completed;  whereas the architect, as soon as he has built it in his mind, but before he has actually begun, has a complete vision  of what kind of work it will be in respect to the elegance, the efficiency, and the correctness of its design.&lt;/blockquote&gt;&lt;br /&gt;That is my rendering of Vitruvius 6.8.10:&lt;br /&gt;&lt;blockquote&gt;namque omnes homines, non solum architecti, quod est bonum, possunt probare, sed inter idiotas et eos hoc est discrimen, quod idiota, nisi factum viderit, non potest scire, quid sit futurum, architectus autem, simul animo constituerit, antequam inceperit, et venustate et usu et decore quale sit futurum, habet definitum.&lt;/blockquote&gt;&lt;br /&gt;Today we are desperately short of architects for scholarship in the era of the internet.  In this blog I will occasionally comment on some of my own work, and on other digital scholarship in the  humanities that, in my view, is contributing to the construction of a better edifice.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7419001987588063203-3963943091276242368?l=vitruviandesign.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vitruviandesign.blogspot.com/feeds/3963943091276242368/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7419001987588063203&amp;postID=3963943091276242368' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/3963943091276242368'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7419001987588063203/posts/default/3963943091276242368'/><link rel='alternate' type='text/html' href='http://vitruviandesign.blogspot.com/2007/10/vitruvian-design.html' title='Vitruvian design'/><author><name>Neel Smith</name><uri>http://www.blogger.com/profile/10590621399352493304</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry></feed>
