<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-9916290</id><updated>2012-02-16T09:46:49.090-05:00</updated><title type='text'>Blog Droppings</title><subtitle type='html'>Always leaving behind a tell-tale trail of info-nuggets...</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://blog-droppings.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9916290/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://blog-droppings.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Dan Lentz</name><uri>http://www.blogger.com/profile/04898791309856994697</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://photos2.flickr.com/2889006_e72da475be_s.jpg'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>5</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-9916290.post-110815956398165272</id><published>2005-02-11T17:06:00.000-05:00</published><updated>2006-10-12T21:45:44.656-04:00</updated><title type='text'>The Journal of Machine Learning Research</title><content type='html'>&lt;p&gt;Microtome Publishing provides excellent online open-content, including every issue of JMLR (in cooperation with MIT cSAIL).
&lt;/p&gt;
&lt;p&gt;&lt;a href="http://www.jmlr.org"&gt;The Journal of Machine Learning Research&lt;/a&gt;
&lt;table&gt;&lt;tr&gt;&lt;td&gt;&lt;a href="http://jmlr.csail.mit.edu/images/cover.jpg"&gt;&lt;img src="http://jmlr.csail.mit.edu/images/cover.jpg" height="100" width="70" border="0" align="left" hspace="4" vspace="4" alt="JMLR Cover Image" title="JMLR Cover Image" /&gt;&lt;/a&gt;&lt;/td&gt;
&lt;td&gt;&lt;blockquote&gt;&lt;span style="font-family:Sans;font-size:0.8em;"&gt;The Journal of Machine Learning Research (JMLR) provides an international forum for the electronic and paper publication of high-quality scholarly articles in all areas of machine learning. JMLR has a commitment to rigorous yet rapid reviewing. Final versions are  &lt;/span&gt;&lt;span style="font-family:Sans;font-size:0.8em;"&gt;&lt;a href="http://jmlr.csail.mit.edu/papers"&gt;published electronically&lt;/a&gt;&lt;/span&gt;&lt;span style="font-family:Sans;font-size:0.8em;"&gt; (ISSN 1533-7928) immediately upon receipt. Until the end of 2004, paper volumes (ISSN 1532-4435) were published 8 times annually and sold to libraries and individuals by  &lt;/span&gt;&lt;span style="font-family:Sans;font-size:0.8em;"&gt;&lt;a href="http://mitpress.mit.edu/JMLR"&gt;the MIT Press&lt;/a&gt;&lt;/span&gt;&lt;span style="font-family:Sans;font-size:0.8em;"&gt;. Paper volumes (ISSN 1532-4435) are now published and sold by &lt;/span&gt;&lt;span style="font-family:Sans;font-size:0.8em;"&gt;&lt;a href="http://www.mtome.com/Publications/jmlr.html"&gt;Microtome Publishing&lt;/a&gt;&lt;/span&gt;&lt;span style="font-family:Sans;font-size:0.8em;"&gt;.&lt;/span&gt;
&lt;/blockquote&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/table&gt;&lt;/p&gt;
&lt;p&gt;
Some recent publications include:
&lt;ul&gt;
&lt;li&gt;&lt;a href="http://jmlr.csail.mit.edu/papers/special/colt02.html" title="JMLR Special Issue on Computational Learning Theory"&gt;JMLR Special Issue on Computational Learning Theory&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="http://jmlr.csail.mit.edu/papers/special/ilp03.html" title="JMLR Special Issue on Inductive Logic Programming"&gt;JMLR Special Issue on Inductive Logic Programming&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="http://jmlr.csail.mit.edu/papers/v5/" title="JMLR Volume 5"&gt;JMLR Volume 5&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="http://jmlr.csail.mit.edu/papers/v6/" title="JMLR Volume 6"&gt;JMLR Volume 6&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/P&gt;
&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9916290-110815956398165272?l=blog-droppings.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://blog-droppings.blogspot.com/feeds/110815956398165272/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=9916290&amp;postID=110815956398165272&amp;isPopup=true' title='60 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9916290/posts/default/110815956398165272'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9916290/posts/default/110815956398165272'/><link rel='alternate' type='text/html' href='http://blog-droppings.blogspot.com/2005/02/journal-of-machine-learning-research.html' title='The Journal of Machine Learning Research'/><author><name>Dan Lentz</name><uri>http://www.blogger.com/profile/04898791309856994697</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://photos2.flickr.com/2889006_e72da475be_s.jpg'/></author><thr:total>60</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9916290.post-110744979142983034</id><published>2005-02-03T11:56:00.000-05:00</published><updated>2006-10-12T21:45:44.600-04:00</updated><title type='text'>ROR Metadata - “Divide and Describe”</title><content type='html'>&lt;p&gt;
The major complaint that I have about currently implementable semantic web technologies is the seemingly endless pit of complexity combined with the meager resulting functionality.  You can spend literally days on end going learning XML vocabularies, starting with a general study of RDF, through the Ontology Swamps (&lt;a href="http://www.w3.org/TR/owl-guide/" title="OWL Guide"&gt;OWL&lt;/a&gt;, &lt;a href="http://www.daml.org/" title="DAML.org"&gt;DAML&lt;/a&gt;, &lt;a href="http://www.daml.org/language/" title="DAML+OIL"&gt;DAML+OIL&lt;/a&gt;, &lt;a href="http://www.daml.org/services/owl-s/1.1/" title="OWL-S 1.1 Release"&gt;OWL-S&lt;/a&gt;, &lt;a href="http://www.cs.umd.edu/projects/plus/SHOE/" title="SHOE"&gt;SHOE&lt;/a&gt;, ... ) and then on to the specific vocabulary applications which are only now (slowly) gaining recognition.  Some of the more useful ones that are showing up these days include: &lt;a href="http://www.foaf-project.org/" title="FOAF"&gt;FOAF&lt;/a&gt;, &lt;a href="http://usefulinc.com/doap" title="DOAP: Description of a Project"&gt;DOAP&lt;/a&gt;, and &lt;a href="http://www.rddl.org/" title="XML Resource Directory Description Language (RDDL)"&gt;RDDL.&lt;/a&gt;
&lt;/p&gt;

&lt;p&gt;&lt;a href="http://www.rorweb.com/" title="ROR - Resources of a Resource"&gt;ROR - Resources of a Resource&lt;/a&gt; is a new vocabulary, however, that is immediately implementable, and which strikes me as being nicely balanced between the general abstraction required to be useful in a range of use-cases, and the grounded specificity that allows it to be easily understood, implemented, and (hopefully) made use of in the real world.  Refer to the &lt;a href="http://www.rorweb.com/spec.htm" title="ROR Spec"&gt;ROR Specification&lt;/a&gt; for details, but in many cases the ROR metadata can (probably &lt;em&gt;should&lt;/em&gt;) be automatically generated, and provides a clear first-cut at semantically mapping the available components of an online resource.  In addition to an &lt;a href="http://www.rorweb.com/template.htm" title="Get your ROR file!"&gt;implementation template&lt;/a&gt;, the author provides a nifty browser-based &lt;a href="http://www.rorweb.com/servlet/rorex" title="ROR Explorer"&gt;ROR Explorer&lt;/a&gt; application, which highlights some basic examples of ROR metadata in action. 
&lt;/p&gt;

&lt;p&gt;Of course the chicken and egg conundrum is still out there -- in order to be useful, a large number of sites must implement this technology, which won't be compelling until they do.  However, this vocabulary seems both easy to use and easy to make substantial use &lt;em&gt;of&lt;/em&gt;, and that, at least, is a big step in the right direction.&lt;/p&gt;
&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9916290-110744979142983034?l=blog-droppings.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://blog-droppings.blogspot.com/feeds/110744979142983034/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=9916290&amp;postID=110744979142983034&amp;isPopup=true' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9916290/posts/default/110744979142983034'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9916290/posts/default/110744979142983034'/><link rel='alternate' type='text/html' href='http://blog-droppings.blogspot.com/2005/02/ror-metadata.html' title='ROR Metadata - &amp;#8220;Divide and Describe&amp;#8221;'/><author><name>Dan Lentz</name><uri>http://www.blogger.com/profile/04898791309856994697</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://photos2.flickr.com/2889006_e72da475be_s.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9916290.post-110739166074173045</id><published>2005-02-02T19:47:00.000-05:00</published><updated>2006-10-12T21:45:44.544-04:00</updated><title type='text'>Porter Stemming</title><content type='html'>&lt;p&gt;
The English language is a fairly inflexible algorithmic domain, which makes porter-stemming all-the-more useful an algorithm, and impressive in its accuracy.  From the author's &lt;a href="http://www.tartarus.org/~martin/PorterStemmer/"&gt;official web page&lt;/a&gt;, which also contains useful starter-implementations in various modern languages:
&lt;/p&gt;&lt;blockquote&gt;
The Porter stemming algorithm (or &amp;#8216;Porter stemmer&amp;#8217;) is a process for removing the commoner morphological and inflexional endings from words in English. Its main use is as part of a term normalisation process that is usually done when setting up Information Retrieval systems.
&lt;/blockquote&gt;&lt;p&gt;
Essentially, this means that English-language words are reduced to their stem forms, for example:
&lt;/p&gt;&lt;ul&gt;
&lt;li&gt; tags =&amp;gt; tag&lt;/li&gt;
&lt;li&gt; nicely =&amp;gt; nice&lt;/li&gt;
&lt;li&gt; drawers =&amp;gt; drawer&lt;/li&gt;
&lt;li&gt; usefulness =&amp;gt; use&lt;/li&gt;
&lt;/ul&gt;&lt;p&gt;
And it gets far more complicated from there, due to the bizarre contortions required for spelling in English, whose vocabulary draws so widely on other languages.  As the Internet becomes increasingly an information retrieval application, and as English-language content spreads throughout the Internet as the new dataset of choice, semantically intelligent processing will nearly always require this type of stemming in order to normalize text into a canonical form. And so Porter Stemming will become more and more ubiquitous.
&lt;/p&gt;&lt;p&gt;
An excellent online example of how useful this can be is &lt;a href="http://www.hackdiary.com/" title="hackdiary"&gt;hackdiary's&lt;/a&gt; utility for stem-checking the category tags used for a  &lt;a href="http://del.icio.us/" title="del.icio.us"&gt;del.icio.us&lt;/a&gt; account: &lt;a href="http://www.hackdiary.com/stemtags/" title="del.icio.us tag stemmer"&gt;del.icio.us tag stemmer&lt;/a&gt;
&lt;/p&gt;&lt;p&gt;
For the very serious student of stemming, Porter's latest work is the essentially domain specific language &lt;a href="http://snowball.tartarus.org/" title="Snowball"&gt;Snowball&lt;/a&gt;, which is distributed with English, French, Spanish, Portuguese, Italian, German, Dutch, Swedish, Norwegian, Danish, Russian, and Finnish stemming code.  This is the site he directs readers toward for future enhancements and research in stemming algorithms.
&lt;/p&gt;&lt;p&gt;
&lt;strong&gt;Quick access to the code:&lt;/strong&gt; &lt;a href="http://www.tartarus.org/~martin/PorterStemmer/commonlisp.txt"&gt;Common Lisp version&lt;/a&gt;, &lt;a href="http://www.tartarus.org/~martin/PorterStemmer/perl.txt"&gt;Perl version&lt;/a&gt;, &lt;a href="http://www.tartarus.org/~martin/PorterStemmer/python.txt"&gt;Python version&lt;/a&gt;, &lt;a href="http://www.tartarus.org/~martin/PorterStemmer/ruby.txt"&gt;Ruby version&lt;/a&gt;, and &lt;a href="http://www.tartarus.org/~martin/PorterStemmer/js.txt"&gt;Javascript version&lt;/a&gt;.
&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9916290-110739166074173045?l=blog-droppings.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://blog-droppings.blogspot.com/feeds/110739166074173045/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=9916290&amp;postID=110739166074173045&amp;isPopup=true' title='75 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9916290/posts/default/110739166074173045'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9916290/posts/default/110739166074173045'/><link rel='alternate' type='text/html' href='http://blog-droppings.blogspot.com/2005/02/porter-stemming.html' title='Porter Stemming'/><author><name>Dan Lentz</name><uri>http://www.blogger.com/profile/04898791309856994697</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://photos2.flickr.com/2889006_e72da475be_s.jpg'/></author><thr:total>75</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9916290.post-110490126875126851</id><published>2005-01-05T00:01:00.000-05:00</published><updated>2006-10-12T21:45:44.484-04:00</updated><title type='text'>Why flickr is such a definitive step forward</title><content type='html'>&lt;style type="text/css"&gt;.flickr-photo { border: solid 1px #000000; }.flickr-frame {	float: right; text-align: center; margin-left: 15px; margin-bottom: 15px; }.flickr-caption { font-size: 0.8em; margin-top: 0px; }&lt;/style&gt;&lt;div class="flickr-frame"&gt;	&lt;a href="http://www.flickr.com/photos/81991766@N00/2889025/" title="photo sharing"&gt;&lt;img src="http://photos3.flickr.com/2889025_64fb7853a4_t.jpg" class="flickr-photo" alt="2002_0820_200212AA" /&gt;&lt;/a&gt;&lt;br /&gt;	&lt;span class="flickr-caption"&gt;		&lt;a href="http://www.flickr.com/photos/81991766@N00/2889025/"&gt;2002_0820_200212AA&lt;/a&gt;,&lt;br /&gt; originally uploaded by &lt;a href="http://www.flickr.com/people/81991766@N00/"&gt;danlentz&lt;/a&gt;.	&lt;/span&gt;&lt;/div&gt;Digital photography is compelling for any number of reasons, but just about all of the sundry "photo album" sites have been kind of lacking (I don't even remember the password to get back into half of the albums I've left hanging around in various dark corners of the internet.  Flickr combines all of the expected niceties of a modern, internet enabled photo album, with the dynamic character and unpredictability that comes from social software architecture -- and it seems to be an effective conceptual "twist" that makes digital albums as compelling as digital photography has turned out to be.&lt;br /&gt;&lt;br /&gt;If course, it also provides an ideal platform for integration with blogs and other syndicated content -- an ideal publishing medium that can now be leveraged with one's unlimited supply of personal digital image content.  Feeds for my Flickr PhotoStreams: &lt;a href="http://www.flickr.com/services/feeds/photos_public.gne?id=81991766@N00&amp;format=rss_200"&gt;RSS 2.0&lt;/a&gt;  &lt;a href="http://www.flickr.com/services/feeds/photos_public.gne?id=81991766@N00&amp;format=atom_03"&gt;Atom&lt;/a&gt;&lt;br clear="all" /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9916290-110490126875126851?l=blog-droppings.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://blog-droppings.blogspot.com/feeds/110490126875126851/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=9916290&amp;postID=110490126875126851&amp;isPopup=true' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9916290/posts/default/110490126875126851'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9916290/posts/default/110490126875126851'/><link rel='alternate' type='text/html' href='http://blog-droppings.blogspot.com/2005/01/why-flickr-is-such-definitive-step.html' title='Why flickr is such a definitive step forward'/><author><name>Dan Lentz</name><uri>http://www.blogger.com/profile/04898791309856994697</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://photos2.flickr.com/2889006_e72da475be_s.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-9916290.post-110477620439970203</id><published>2005-01-03T13:16:00.000-05:00</published><updated>2006-10-12T21:45:44.418-04:00</updated><title type='text'></title><content type='html'>&lt;h2&gt;&lt;em&gt;"HitMaps"&lt;/em&gt; are one-up on the old-fashioned Hit-Counters&lt;/h2&gt;

&lt;a href='http://hitmaps.open.ac.uk/stats/blog-droppings.blogspot.com-/map-world.html' id='hitMapsLink'&gt;
&lt;img src='http://hitmaps.open.ac.uk/counter/index2.php?url=http://blog-droppings.blogspot.com/' border=0 alt='Locations of visitors to this page' onError="this.onError=null; this.src='http://kmi.open.ac.uk/projects/hitmaps/imgs/hitmaps-soon.jpg'; document.getElementById('hitMapsLink').href='http://kmi.open.ac.uk/projects/hitmaps/'"&gt;&lt;br&gt;
Where are visitors to this page?&lt;br&gt;
(Auto-update daily since 03-JAN-05)&lt;/a&gt;


The &lt;a href="http://kmi.open.ac.uk/projects/hitmaps/"&gt;HitMap&lt;/a&gt; project, essentially a combination of Geo-encoded URL semantics with a nice clustering algorithm, is one of the more mundane projects going on at &lt;a href="http://www.open.ac.uk/"&gt;The Open University&lt;/a&gt; in the UK.  There are a very wide variety of SemanticWeb and Social Software projects under way, many of which seem to be producing tangible results.  The reasearch is a component of the &lt;a href="http://kmi.open.ac.uk/projects/buddyspace/home-body.html"&gt;BuddySpace&lt;/a&gt; projects, hosted by the &lt;a href="http://kmi.open.ac.uk/"&gt;Knowledge Media Institute&lt;/a&gt; group.  Really fun stuff.
&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/9916290-110477620439970203?l=blog-droppings.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://blog-droppings.blogspot.com/feeds/110477620439970203/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=9916290&amp;postID=110477620439970203&amp;isPopup=true' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9916290/posts/default/110477620439970203'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9916290/posts/default/110477620439970203'/><link rel='alternate' type='text/html' href='http://blog-droppings.blogspot.com/2005/01/hitmaps-are-one-up-on-old-fashioned.html' title=''/><author><name>Dan Lentz</name><uri>http://www.blogger.com/profile/04898791309856994697</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://photos2.flickr.com/2889006_e72da475be_s.jpg'/></author><thr:total>0</thr:total></entry></feed>
