<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Terrell Russell: This Old Network &#187; flickr</title>
	<atom:link href="http://weblog.terrellrussell.com/tag/flickr/feed/" rel="self" type="application/rss+xml" />
	<link>http://weblog.terrellrussell.com</link>
	<description>Ideas on interconnections, identity, and information from all sides.</description>
	<lastBuildDate>Thu, 22 Dec 2011 15:31:03 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1.3</generator>
		<item>
		<title>Flickr Commons adds tags to Library of Congress images</title>
		<link>http://weblog.terrellrussell.com/2008/01/flickr-commons-adds-tags-to-library-of-congress-images/</link>
		<comments>http://weblog.terrellrussell.com/2008/01/flickr-commons-adds-tags-to-library-of-congress-images/#comments</comments>
		<pubDate>Wed, 16 Jan 2008 19:58:04 +0000</pubDate>
		<dc:creator>Terrell Russell</dc:creator>
				<category><![CDATA[Default]]></category>
		<category><![CDATA[flickr]]></category>
		<category><![CDATA[flickrcommons]]></category>
		<category><![CDATA[libraryofcongress]]></category>
		<category><![CDATA[loc]]></category>
		<category><![CDATA[PowerOfMany]]></category>
		<category><![CDATA[SocialTagging]]></category>

		<guid isPermaLink="false">http://weblog.terrellrussell.com/2008/01/flickr-commons-adds-tags-to-library-of-congress-images/</guid>
		<description><![CDATA[Just announced this morning &#8211; a fantastic partnership between Flickr and the Library of Congress. Flickr Commons The Library of Congress Pilot Project The Library of Congress has a Prints and Photographs Online Catalog comprised of over 1 million images (and growing) that have been available online for over 10 years. Back in June of [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.loc.gov/blog/?p=233">Just announced this morning</a> &#8211; <a href="http://blog.flickr.com/en/2008/01/16/many-hands-make-light-work/">a fantastic partnership between Flickr and the Library of Congress</a>.</p>
<p><a href="http://flickr.com/commons">Flickr Commons</a></p>
<blockquote><p>The Library of Congress Pilot Project</p>
<p>The Library of Congress has a Prints and Photographs Online Catalog comprised of over 1 million images (and growing) that have been available online for over 10 years.</p>
<p>Back in June of 2007, we began our first collaboration with a civic institution to facilitate giving people a voice in describing the content of a publicly-held photography collection.</p>
<p>The key goals of this pilot project are to firstly give you a taste of the hidden treasures in the huge Library of Congress collection, and secondly to how your input of a tag or two can make the collection even richer.</p>
<p>You&#8217;re invited to help describe photographs in the Library of Congress&#8217; collection on Flickr, by adding tags or leaving comments.*</p>
<p>*Any Flickr member is able to add tags or comment on these collections. If you&#8217;re a dork about it, shame on you. This is for the good of humanity, dude!!</p></blockquote>
<p>I&#8217;m very excited about this and will be participating.  Just look at all that good old-fashioned well-formed library data in each photo&#8217;s description&#8230;</p>
<p>However, I think there&#8217;s a missed opportunity here to leverage some of the extra power in having many people tag.</p>
<p>At Flickr&#8217;s sister site, del.icio.us, we&#8217;ve seen wonderful growth and understanding around how communities of users tag collectively.  They&#8217;re not necessarily collaborating, which is why del.icio.us holds some special properties we do not see in the tagging at Flickr.  However, I think Flickr should expose the identities/usernames along with the tags associated with a photo.  Most photos are only tagged by the owner &#8211; it&#8217;s a safe assumption that this will continue to occur into the future.  However, when the tagger is NOT the owner/uploader of the photo, this information is currently lost and not passed along in the Flickr interface.</p>
<p>Please, Flickr, expose the &#8216;who&#8217; part of the tagging triumvirate (see <a href="http://www.vanderwal.net/folksonomy.html">last paragraph of Vander Wal&#8217;s definition</a>).  Especially now that we&#8217;ll have such rich data around our collective history.</p>
<blockquote><p>This is still a strong belief the three tenets of a folksonomy: 1) tag; 2) object being tagged; and 3) identity, are core to disambiguation of tag terms and provide for a rich understanding of the object being tagged.</p></blockquote>
<p>Another interesting note about this pilot &#8211; this is the first time we&#8217;ve seen a distinction of &#8216;no known copyright&#8217;:</p>
<blockquote><p>Can anyone use &#8220;no known copyright restrictions?&#8221;<br />
For the time being on Flickr this new usage is being contained to the Library of Congress account. If the pilot works &#8211; or, when it works! &#8211; we&#8217;ll look to allow other interested cultural institutions the opportunity to extend the application of &#8220;no known restrictions&#8221; to their catalogues.</p></blockquote>
<p>Hooray, Library of Congress + Flickr!</p>
]]></content:encoded>
			<wfw:commentRss>http://weblog.terrellrussell.com/2008/01/flickr-commons-adds-tags-to-library-of-congress-images/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Clean and store your raw tags like Flickr</title>
		<link>http://weblog.terrellrussell.com/2007/06/clean-and-store-your-raw-tags-like-flickr/</link>
		<comments>http://weblog.terrellrussell.com/2007/06/clean-and-store-your-raw-tags-like-flickr/#comments</comments>
		<pubDate>Tue, 19 Jun 2007 04:07:16 +0000</pubDate>
		<dc:creator>Terrell Russell</dc:creator>
				<category><![CDATA[Default]]></category>
		<category><![CDATA[flickr]]></category>
		<category><![CDATA[SocialTagging]]></category>

		<guid isPermaLink="false">http://weblog.terrellrussell.com/2007/06/clean-and-store-your-raw-tags-like-flickr/</guid>
		<description><![CDATA[Today I was working through how a new application would be handling tags and realized that I strongly believe Flickr has the most robust method of storing and querying tags. I think they do it well and wanted to copy their lead. The main reason I feel they&#8217;ve got the best system is how they [...]]]></description>
			<content:encoded><![CDATA[<p>Today I was working through how a new application would be handling tags and realized that I strongly believe Flickr has the most robust method of storing and querying tags.  I think they do it well and wanted to copy their lead.</p>
<p>The main reason I feel they&#8217;ve got the best system is how they handle their &#8216;raw&#8217; tags and their &#8216;clean&#8217; tags.</p>
<p>When a photo is tagged at Flickr, the tag itself is saved in two different formats &#8211; raw and clean.   If you were to tag a photo with &#8220;St. Patrick&#8217;s Day&#8221;, that&#8217;s what would remain in your list of tags, visible on screen.  But how Flickr encodes and cleans up that tag results in &#8220;stpatricksday&#8221;.  This is a subtle, but powerful model.</p>
<p>It keeps the original tagger happy (&#8220;I know how I want to tag things, darn it&#8221;) and it makes the tags more functional in terms of finding things later (both for the tagger, and everyone else).  The clean tag is what is used in URLs, in the tagclouds, and wherever aggregation is important for statistics.  The tags &#8220;N.Y.C.&#8221; and &#8220;NYC&#8221; and &#8220;nyc&#8221; are all &#8216;cleaned&#8217; down to the same thing (&#8220;nyc&#8221;) so when a query comes in for nyc, all three original photos would be presented in the results.</p>
<p>I wanted that cleaning function for myself.  I looked everywhere today, and couldn&#8217;t find it detailed in any one place any more than <a href="http://www.flickr.com/services/api/misc.tags.html">on the Flickr API pages themselves</a>.</p>
<blockquote><p> raw:   The &#8216;raw&#8217; version of the tag &#8211; as entered by the user. This version can contain spaces and punctuation.</p>
<p>tag-body: The &#8216;clean&#8217; version of the tag &#8211; as processed by Flickr. This version is used for constructing urls.</p></blockquote>
<p><strong>Flickr-Like Tag Cleaning Regular Expression</strong></p>
<p>Please let me know if you find any errors, or if this gets out of date.  I&#8217;ll try to keep it current over time.</p>
<p>The function can be described fairly simply &#8211; &#8220;remove spaces and punctuation, then lowercase&#8221;.  I wrote a one line ruby regular expression to do this:</p>
<pre>clean_tag = raw_tag.gsub(/[\\s"!@#\\$\\%^&amp;*():\\-_+=\\'\\/.;`&lt;&gt;\\[\\]?\\\\]/,"").downcase</pre>
<p>The rest of my test code is included below.  As far as I could test today, these are the &#8216;punctuation&#8217; that Flickr is scrubbing from your raw tags:</p>
<pre>#!/usr/local/bin/ruby -w

require 'cgi'

# removes whitespace
# downcases A-Z
# removes 27 different punctuation characters
  # quotation marks
  # exclamation point
  # at symbol
  # pound sign
  # dollar sign
  # percent sign
  # carat
  # ampersand
  # asterisk
  # open parenthesis
  # close parenthesis
  # colon
  # hyphen
  # underscore
  # plus sign
  # equals sign
  # apostrophe
  # forward slash
  # period
  # semicolon
  # backtick
  # open angle bracket
  # close angle bracket
  # open square bracket
  # close square bracket
  # question mark
  # backslash
# does not affect other characters (you should safely CGI.escape these)
  # curly brackets
  # tilda
  # pipe
  # british pound
  # euro symbol
  # chinese characters
def clean_tag(raw_tag)
  clean_tag = raw_tag.gsub(/[\\s"!@#\\$\\%^&amp;*():\\-_+=\\'\\/.;`&lt;&gt;\\[\\]?\\\\]/,"").downcase
end

tags = [
  # should remove the offending characters
  "\\"double\\" quotes",          # quotation marks                     doublequotes
  "!excited!iam!",              # exclamation point                   excitediam
  "test@example.com",           # at symbol                           testexamplecom
  "pound#it",                   # pound sign                          poundit
  "$ave on everyThing",         # dollar sign                         aveoneverything
  "i feel 30% better",          # percent sign                        ifeel30better
  "carats^aretasty",            # carat                               caratsaretasty
  "and&#038;this&#038;and&#038;that",          # ampersand                           andthisandthat
  "maris*61",                   # asterisk                            maris61
  "i think (maybe)",            # open and close parentheses          ithinkmaybe
  "F:ooBar",                    # colon                               foobar
  "hyphen-ated",                # hyphen                              hyphenated
  "under_my_score",             # underscore                          undermyscore
  "1+1=2",                      # plus and equals                     112
  "Saint Patrick's Day",        # apostrophe                          saintpatricksday
  "/leaning/forward/ish",       # forward slash                       leaningforwardish
  "Mrs. Jones",                 # period                              mrsjones
  "semi;automatic;parsing",     # semicolon                           semiautomaticparsing
  "back`tick`here",             # backtick                            backtickhere
  "open&lt;and&gt;close",             # open and close angle brackets       openandclose
  "don't[be]square",            # open and close square brackets      dontbesquare
  "you?sure",                   # question mark                       yousure
  "back\\\\slash",                # backslash                           backslash
  # should only encode the rest of these
  "crab|vs|pipe",               # pipe                                crab%7Cvs%7Cpipe
  "東京",                         # chinese characters                  %E6%9D%B1%E4%BA%AC
  "£",                          # british pound                       %C2%A3
  "nice {curly} brackets",      # curly brackets                      nice%7Bcurly%7Dbrackets
  "Mötley Crüe",                # umlauts                             m%C3%B6tleycr%C3%BCe
  "Tōkyō"                       # long o                              t%C5%8Dky%C5%8D
].each do |t|
  print t
  print "\\n\\tcleaned  --&gt;  "
  print clean_tag(t)
  print "\\n\\tescaped  --&gt;  "
  print CGI.escape(clean_tag(t))
  print "\\n"
end
</pre>
]]></content:encoded>
			<wfw:commentRss>http://weblog.terrellrussell.com/2007/06/clean-and-store-your-raw-tags-like-flickr/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>

