<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-19005369</id><updated>2011-07-14T17:42:00.236-07:00</updated><title type='text'>Improving Web-based Image Search Via Content Based Clustering</title><subtitle type='html'></subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://joogle5000.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://joogle5000.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Boris</name><uri>http://www.blogger.com/profile/01177828772052167022</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>21</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-19005369.post-114685814645023312</id><published>2006-05-05T12:37:00.000-07:00</published><updated>2006-05-05T12:42:26.466-07:00</updated><title type='text'>SLAM 2006 in association with CVPR!</title><content type='html'>We submitted a paper to a workshop called SLAM in association with CVPR, and got accepted.  If you would like to read the paper, click &lt;a href="http://www.cs.ucsd.edu/%7Esjb/slam06.pdf"&gt;here&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;We are currently not working on this project due to heavy course load and other factors.  Stay tuned for more info in the future!  This was definitely a fun project.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19005369-114685814645023312?l=joogle5000.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://joogle5000.blogspot.com/feeds/114685814645023312/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19005369&amp;postID=114685814645023312' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/114685814645023312'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/114685814645023312'/><link rel='alternate' type='text/html' href='http://joogle5000.blogspot.com/2006/05/slam-2006-in-association-with-cvpr.html' title='SLAM 2006 in association with CVPR!'/><author><name>Naydav</name><uri>http://www.blogger.com/profile/00782594825927856590</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19005369.post-114247471942074934</id><published>2006-03-15T17:56:00.000-08:00</published><updated>2006-03-15T18:05:26.110-08:00</updated><title type='text'></title><content type='html'>We are currently in the process of writing up a paper on our findings.  Until then, here are two things worth checking out:&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://photos1.blogger.com/blogger/7761/1872/1600/flow2.jpg"&gt;&lt;img style="margin: 0pt 10px 10px 0pt; float: left; cursor: pointer;" src="http://photos1.blogger.com/blogger/7761/1872/200/flow2.jpg" alt="" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;A new &amp;amp; improved flow chart of our algorithm (click to see bigger version)&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Also, &lt;a href="ieng9.ucsd.edu/%7Ebbabenko/presentation.ppt"&gt;click here&lt;/a&gt; to view our powerpoint slides from the talk we gave in CSE 190A. It includes more examples of our results. We will post a final draft of our report soon.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19005369-114247471942074934?l=joogle5000.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://joogle5000.blogspot.com/feeds/114247471942074934/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19005369&amp;postID=114247471942074934' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/114247471942074934'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/114247471942074934'/><link rel='alternate' type='text/html' href='http://joogle5000.blogspot.com/2006/03/we-are-currently-in-process-of-writing.html' title=''/><author><name>Boris</name><uri>http://www.blogger.com/profile/01177828772052167022</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19005369.post-114160595068646167</id><published>2006-03-05T16:42:00.000-08:00</published><updated>2006-03-05T16:45:50.700-08:00</updated><title type='text'>Finding objects in unlabelled images</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://photos1.blogger.com/blogger/6919/1299/1600/imageblob00004.jpg"&gt;&lt;img style="margin: 0pt 10px 10px 0pt; float: left; cursor: pointer;" src="http://photos1.blogger.com/blogger/6919/1299/320/imageblob00004.jpg" alt="" border="0" /&gt;&lt;/a&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://photos1.blogger.com/blogger/6919/1299/1600/image_00004.jpg"&gt;&lt;img style="margin: 0pt 10px 10px 0pt; float: left; cursor: pointer;" src="http://photos1.blogger.com/blogger/6919/1299/320/image_00004.jpg" alt="" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;This was the top rank picture on the query "orange" for a set of 10 unlabelled photos on my computer.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19005369-114160595068646167?l=joogle5000.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://joogle5000.blogspot.com/feeds/114160595068646167/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19005369&amp;postID=114160595068646167' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/114160595068646167'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/114160595068646167'/><link rel='alternate' type='text/html' href='http://joogle5000.blogspot.com/2006/03/finding-objects-in-unlabelled-images.html' title='Finding objects in unlabelled images'/><author><name>Naydav</name><uri>http://www.blogger.com/profile/00782594825927856590</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19005369.post-114160412925838556</id><published>2006-03-05T16:13:00.000-08:00</published><updated>2006-03-05T16:15:29.273-08:00</updated><title type='text'></title><content type='html'>Here are some more examples (in a new and improved layout):&lt;br /&gt;&lt;br /&gt;&lt;a href="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/examples/cokecan.xml"&gt;Coke can&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/examples/jersey.xml"&gt;Lakers Jersey&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19005369-114160412925838556?l=joogle5000.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://joogle5000.blogspot.com/feeds/114160412925838556/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19005369&amp;postID=114160412925838556' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/114160412925838556'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/114160412925838556'/><link rel='alternate' type='text/html' href='http://joogle5000.blogspot.com/2006/03/here-are-some-more-examples-in-new-and.html' title=''/><author><name>Boris</name><uri>http://www.blogger.com/profile/01177828772052167022</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19005369.post-114089356182015487</id><published>2006-02-25T10:47:00.000-08:00</published><updated>2006-02-25T14:00:05.196-08:00</updated><title type='text'>Interesting day at Eureka</title><content type='html'>Here are some pictures from the "konference."  We got one of the top 3 "best poster" awards (&lt;a href="http://www.jacobsschool.ucsd.edu/events/eureka/"&gt;http://www.jacobsschool.ucsd.edu/events/eureka/&lt;/a&gt;)!&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://photos1.blogger.com/blogger/6919/1299/1600/IMG_0821.jpg"&gt;&lt;img style="margin: 0pt 10px 10px 0pt; float: left; cursor: pointer;" src="http://photos1.blogger.com/blogger/6919/1299/320/IMG_0821.jpg" alt="" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://photos1.blogger.com/blogger/6919/1299/1600/IMG_0812.jpg"&gt;&lt;img style="margin: 0pt 10px 10px 0pt; float: left; cursor: pointer;" src="http://photos1.blogger.com/blogger/6919/1299/320/IMG_0812.jpg" alt="" border="0" /&gt;&lt;/a&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://photos1.blogger.com/blogger/6919/1299/1600/IMG_0809.jpg"&gt;&lt;img style="margin: 0pt 10px 10px 0pt; float: left; cursor: pointer;" src="http://photos1.blogger.com/blogger/6919/1299/320/IMG_0809.jpg" alt="" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://photos1.blogger.com/blogger/6919/1299/1600/IMG_0808.jpg"&gt;&lt;img style="margin: 0pt 10px 10px 0pt; float: left; cursor: pointer;" src="http://photos1.blogger.com/blogger/6919/1299/320/IMG_0808.jpg" alt="" border="0" /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19005369-114089356182015487?l=joogle5000.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://joogle5000.blogspot.com/feeds/114089356182015487/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19005369&amp;postID=114089356182015487' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/114089356182015487'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/114089356182015487'/><link rel='alternate' type='text/html' href='http://joogle5000.blogspot.com/2006/02/interesting-day-at-eureka.html' title='Interesting day at Eureka'/><author><name>Naydav</name><uri>http://www.blogger.com/profile/00782594825927856590</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19005369.post-114072963885049697</id><published>2006-02-23T13:18:00.000-08:00</published><updated>2006-02-23T13:20:38.863-08:00</updated><title type='text'>Strawberries</title><content type='html'>In the past we were doing the re-ranking step on only 100 images.  We realized that if we include more images, then there will be more correct images at the top of the list, so we tried the query "strawberries" and performed the re-ranking on 500 images.  The results look great!  &lt;a href="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/strawberries/output.xml"&gt;Click here&lt;/a&gt; to see the output.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19005369-114072963885049697?l=joogle5000.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://joogle5000.blogspot.com/feeds/114072963885049697/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19005369&amp;postID=114072963885049697' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/114072963885049697'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/114072963885049697'/><link rel='alternate' type='text/html' href='http://joogle5000.blogspot.com/2006/02/strawberries.html' title='Strawberries'/><author><name>Boris</name><uri>http://www.blogger.com/profile/01177828772052167022</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19005369.post-114058028494533903</id><published>2006-02-21T19:45:00.000-08:00</published><updated>2006-02-21T19:52:10.140-08:00</updated><title type='text'>Eureka!</title><content type='html'>This Friday we will be presenting a poster at the 7th annual Eureka (Engineering Undergrad Research "K"onnference &amp;amp; Assembly). You can check out a list of other presenters and their title/abstracts at &lt;a href="http://www.jacobsschool.ucsd.edu/events/eureka/"&gt;http://www.jacobsschool.ucsd.edu/events/eureka/&lt;/a&gt; . Please come and support us and look very impressed when we talk to you :)&lt;br /&gt;&lt;br /&gt;Here is a link to the poster: &lt;a href="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/eureka%20poster.ppt"&gt;http://ieng9.ucsd.edu/~bbabenko/joogle/eureka%20poster.ppt&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19005369-114058028494533903?l=joogle5000.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://joogle5000.blogspot.com/feeds/114058028494533903/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19005369&amp;postID=114058028494533903' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/114058028494533903'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/114058028494533903'/><link rel='alternate' type='text/html' href='http://joogle5000.blogspot.com/2006/02/eureka.html' title='Eureka!'/><author><name>Boris</name><uri>http://www.blogger.com/profile/01177828772052167022</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19005369.post-114020848663372770</id><published>2006-02-17T10:45:00.000-08:00</published><updated>2006-02-17T12:34:47.416-08:00</updated><title type='text'>L A B histograms</title><content type='html'>The past few days I was spending trying to understand how LAB color histograms were constructed in Blobworld.  I think I have a good grasp on how the histograms get constructed and I was able to implement LAB histograms in our code.  The results of LAB are almost exactly what we wanted.  Visually similar blobs are closer in distance than visually dissimilar blobs.  Of course there are still some strange cases we need to look into, but for the most part the results are much better than other color spaces we used.  The strange thing is that it's hard to get the main cluster when using meanshift with LAB histograms.  Maybe we haven't gotten the parameters right for this colorspace yet, but it seems like it either gets a very small cluster that is good, or a large cluster that is very bad, depending on the parameters.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19005369-114020848663372770?l=joogle5000.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://joogle5000.blogspot.com/feeds/114020848663372770/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19005369&amp;postID=114020848663372770' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/114020848663372770'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/114020848663372770'/><link rel='alternate' type='text/html' href='http://joogle5000.blogspot.com/2006/02/l-b-histograms.html' title='L A B histograms'/><author><name>Naydav</name><uri>http://www.blogger.com/profile/00782594825927856590</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19005369.post-114006437544811630</id><published>2006-02-15T20:25:00.000-08:00</published><updated>2006-02-15T20:32:55.476-08:00</updated><title type='text'>More flexible histograms</title><content type='html'>One shortcoming with histograms is in the case when the blobs you compare are extremely flat in texture.  Here is an example of the problem:&lt;br /&gt;&lt;br /&gt;Let's say there are three colored blobs, dark red, blue, and medium red.  Also, let's assume 100 pixels and 3 bins per color channel.  The histograms would look like this (in RGB):&lt;br /&gt;dark red: (&lt;span style="color: rgb(204, 0, 0);"&gt;0,1,0&lt;/span&gt;,&lt;span style="color: rgb(51, 204, 0);"&gt;0,0,0&lt;/span&gt;,&lt;span style="color: rgb(0, 0, 153);"&gt;0,0,0&lt;/span&gt;)&lt;br /&gt;medium red:  (&lt;span style="color: rgb(204, 0, 0);"&gt;0,0,1&lt;/span&gt;,&lt;span style="color: rgb(51, 204, 0);"&gt;0,0,0&lt;/span&gt;,&lt;span style="color: rgb(0, 0, 153);"&gt;0,0,0&lt;/span&gt;)&lt;br /&gt;blue:  (&lt;span style="color: rgb(204, 0, 0);"&gt;0,0,0&lt;/span&gt;,&lt;span style="color: rgb(51, 204, 0);"&gt;0,0,0&lt;/span&gt;,&lt;span style="color: rgb(0, 0, 153);"&gt;0,0,1&lt;/span&gt;)&lt;br /&gt;&lt;br /&gt;Clearly the distance between any two histograms is equal.  This is an undesired effect since we would hope that medium red would be much closer to dark red, than blue.  To make things worse, the problem is not resolved by switching colorspaces to hsv or lab - if the blobs have uniform color, the problem is still present.  We are currently working to fix this by convolving all the histograms with a gaussian filter.  Unfortunately this will introduce a new parameter (the sigma of the filter) which will have to be tweaked.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19005369-114006437544811630?l=joogle5000.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://joogle5000.blogspot.com/feeds/114006437544811630/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19005369&amp;postID=114006437544811630' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/114006437544811630'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/114006437544811630'/><link rel='alternate' type='text/html' href='http://joogle5000.blogspot.com/2006/02/more-flexible-histograms.html' title='More flexible histograms'/><author><name>Boris</name><uri>http://www.blogger.com/profile/01177828772052167022</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19005369.post-113926436111372139</id><published>2006-02-06T13:51:00.000-08:00</published><updated>2006-02-06T14:19:21.133-08:00</updated><title type='text'>Nice Diagram and info on the Voted Perceptron</title><content type='html'>Here is a diagram of our program, to make things a little bit more clear.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://photos1.blogger.com/blogger/7761/1872/1600/diagram.png"&gt;&lt;img style="margin: 0pt 10px 10px 0pt; float: left; cursor: pointer;" src="http://photos1.blogger.com/blogger/7761/1872/320/diagram.jpg" alt="" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Also, to those who are not familiar with the Voted Perceptron algorithm, here is a snapshot of the pseudocode from the original paper (Freund, Y. &amp; Schapire, R. (1999). Large Margin Classification using the Perceptron Algorithm. In Machine Learning, 37(3), 277–296.)&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://photos1.blogger.com/blogger/7761/1872/1600/voted_perc_alg.png"&gt;&lt;img style="margin: 0pt 10px 10px 0pt; float: left; cursor: pointer;" src="http://photos1.blogger.com/blogger/7761/1872/320/voted_perc_alg.jpg" alt="" border="0" /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19005369-113926436111372139?l=joogle5000.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://joogle5000.blogspot.com/feeds/113926436111372139/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19005369&amp;postID=113926436111372139' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/113926436111372139'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/113926436111372139'/><link rel='alternate' type='text/html' href='http://joogle5000.blogspot.com/2006/02/nice-diagram-and-info-on-voted.html' title='Nice Diagram and info on the Voted Perceptron'/><author><name>Boris</name><uri>http://www.blogger.com/profile/01177828772052167022</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19005369.post-113926410514093743</id><published>2006-02-06T13:48:00.000-08:00</published><updated>2006-02-06T14:15:05.913-08:00</updated><title type='text'>Classifiers</title><content type='html'>Our goal initially was to create a small training set from the first 20 results using the meanshift algorithm, and then to create a classifier that goes through the rest of the pages and collects "similar" images.  We came upon a major problem.  Our training set consisted only of positive examples.  In order to get negative examples we used the first 200 images from the query "things."  We then ran a simple k nearest neighbor classifier to classify each new blob as being either belonging to the class (e.g. "oranges") or not belonging.  Our results were not impressive, knn only picked up about three more images at most and this was not good enough.&lt;br /&gt;&lt;br /&gt;We also tried running the voted perceptron algorithm on the training data and got mediocre results as well.&lt;br /&gt;&lt;br /&gt;After discussing our attempts with Serge, we realized that we may be approaching this the wrong way.  We were trying classify images as either belonging to the set, or not belonging to the set.  Perhaps instead, we can provide a simple way to "re-rank" the image results. &lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Plans for this week.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;First, let me define the sets&lt;br /&gt;Let A = the first X images retrieved from an image search (X=20 in our case)&lt;br /&gt;Let B = the X+1 .... N images retrieved from an image search (We plan on getting 100 or so more images)&lt;br /&gt;&lt;br /&gt;After we get the biggest cluster from set A using meanshift, we can compute the distance from each blob in set B to the mean of the biggest cluster.  Now we can sort the blobs in set B by distance and re rank the corresponding images based on the order of the blobs.&lt;br /&gt;&lt;br /&gt;For our distance function we will use euclidian distance, and chi-squared, and compare the two methods. &lt;br /&gt;&lt;br /&gt;We are also planning to add the hack that takes blobs with white color on the edges out of consideration.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19005369-113926410514093743?l=joogle5000.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://joogle5000.blogspot.com/feeds/113926410514093743/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19005369&amp;postID=113926410514093743' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/113926410514093743'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/113926410514093743'/><link rel='alternate' type='text/html' href='http://joogle5000.blogspot.com/2006/02/classifiers.html' title='Classifiers'/><author><name>Naydav</name><uri>http://www.blogger.com/profile/00782594825927856590</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19005369.post-113895160471985873</id><published>2006-02-02T23:24:00.000-08:00</published><updated>2006-02-02T23:26:44.733-08:00</updated><title type='text'>Progress</title><content type='html'>We have made progress on training a classifier, and we'll talk about this in the next post.  Here are some notes we have on our progress.&lt;br /&gt;&lt;br /&gt;compare rgb with hsv histogram performance&lt;br /&gt;ask serge about:&lt;br /&gt;    best classifier for our data points.  (the main problem is that we don't have negative examples, only positive ones).&lt;br /&gt;    should we use chi-squared in our knn function?&lt;br /&gt;    block segmentation? (Boris is really into trying this)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19005369-113895160471985873?l=joogle5000.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://joogle5000.blogspot.com/feeds/113895160471985873/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19005369&amp;postID=113895160471985873' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/113895160471985873'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/113895160471985873'/><link rel='alternate' type='text/html' href='http://joogle5000.blogspot.com/2006/02/progress.html' title='Progress'/><author><name>Naydav</name><uri>http://www.blogger.com/profile/00782594825927856590</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19005369.post-113813748011962751</id><published>2006-01-24T13:17:00.000-08:00</published><updated>2006-01-24T13:34:52.316-08:00</updated><title type='text'>More Screenshots</title><content type='html'>After fixing our histogram bugs, our results are starting to look a lot more promising.. here are some examples we tried:&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;"oranges"&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/oranges1.jpg"&gt;&lt;img src="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/oranges1.jpg" alt="" border="0" width="500" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/oranges2.jpg"&gt;&lt;img src="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/oranges2.jpg" alt="" border="0" width="500" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;"us flag"&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/usflag1.jpg"&gt;&lt;img src="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/usflag1.jpg" alt="" border="0" width="500" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/usflag2.jpg"&gt;&lt;img src="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/usflag2.jpg" alt="" border="0" width="500" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Then there were some examples, where not only the largest cluster of blobs looked significant, but also the second largest.. we'll have to figure out how to deal with that..&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/banana1.jpg"&gt;&lt;img src="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/banana1.jpg" alt="" border="0" width="500" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/banana2.jpg"&gt;&lt;img src="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/banana2.jpg" alt="" border="0" width="500" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/banana3.jpg"&gt;&lt;img src="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/banana3.jpg" alt="" border="0" width="500" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;and finally...&lt;br /&gt;&lt;span style="color: rgb(204, 0, 0);font-size:130%;" &gt;&lt;span style="font-size:180%;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;span style="font-weight: bold;"&gt;&lt;span style="font-size:180%;"&gt;WATCH OUT ROBART III, WE'RE HOT ON YOUR TRAIL!!&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/sprite1.jpg"&gt;&lt;img src="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/sprite1.jpg" alt="" border="0" width="500" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/sprite2.jpg"&gt;&lt;img src="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/sprite2.jpg" alt="" border="0" width="500" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/sprite3.jpg"&gt;&lt;img src="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/sprite3.jpg" alt="" border="0" width="500" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/robart.jpg"&gt;&lt;img src="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/robart.jpg" alt="" border="0" width="500" /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19005369-113813748011962751?l=joogle5000.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://joogle5000.blogspot.com/feeds/113813748011962751/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19005369&amp;postID=113813748011962751' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/113813748011962751'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/113813748011962751'/><link rel='alternate' type='text/html' href='http://joogle5000.blogspot.com/2006/01/more-screenshots.html' title='More Screenshots'/><author><name>Boris</name><uri>http://www.blogger.com/profile/01177828772052167022</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19005369.post-113813758371118595</id><published>2006-01-24T12:57:00.000-08:00</published><updated>2006-01-26T11:16:26.733-08:00</updated><title type='text'>Issues/ Ideas</title><content type='html'>Here are some issues were currently dealing with:&lt;br /&gt;&lt;br /&gt;- black and white images don't work - bug in code&lt;br /&gt;-white blobs touching edges should be excluded from being "significant clusters"&lt;br /&gt;-talk to serge about how to add features such as texture and shape&lt;br /&gt;-lab histograms, read blobworld paper&lt;br /&gt;-playing with minimum blob area to include in clustering- seems pretty sensitive.  We will post examples soon.&lt;br /&gt;-in calculating the biggest cluster, count the number of IMAGES in the cluster, not the number of blobs.  Multiple blobs may be present from the same image.  We want the cluster with the most images, not blobs&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19005369-113813758371118595?l=joogle5000.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://joogle5000.blogspot.com/feeds/113813758371118595/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19005369&amp;postID=113813758371118595' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/113813758371118595'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/113813758371118595'/><link rel='alternate' type='text/html' href='http://joogle5000.blogspot.com/2006/01/issues-ideas.html' title='Issues/ Ideas'/><author><name>Naydav</name><uri>http://www.blogger.com/profile/00782594825927856590</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19005369.post-113797528883542826</id><published>2006-01-22T16:10:00.000-08:00</published><updated>2006-01-23T14:08:09.466-08:00</updated><title type='text'>Debugging</title><content type='html'>We re-traced our steps to check our code. To get a sanity check we created a small toy example with red blobs, and that seemed to work well:&lt;br /&gt;&lt;img src="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/toy_example.jpg" alt="" border="0" width="400" /&gt;&lt;br /&gt;&lt;br /&gt;&lt;img src="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/toy_example2.jpg" alt="" border="0" width="400" /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Next we used a "united states flag" example, and just did average L*a*b for each blob, so the blob space had 3 dimensions. The results looked good, here is the biggest blob cluster:&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;img src="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/usflag_average_lab.jpg" width="500" /&gt;&lt;br /&gt;But when we looked at all the blobs in the 3-D blob space, it looked kind of peculiar and we are not sure if it is correct:&lt;br /&gt;&lt;br /&gt;&lt;img src="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/LAB_graph.jpg" width="800" /&gt;&lt;br /&gt;&lt;br /&gt;For some reason the points always look like they are in a straight line... which does not really make sense.&lt;br /&gt;&lt;br /&gt;Next we tried to debug our histogram code. We had a huge a bug in our code where we were building the histograms. We wanted to bring this up becuase it's something that other people might get caught on. It has to do with the "hist" function in Matlab... We were calling it like this:&lt;br /&gt;&lt;br /&gt;hist(points, 15) - to put things into 15 bins&lt;br /&gt;&lt;br /&gt;However, this does not define the start and end bins of the histogram so it centers the whole thing. This is very bad! Instead you should do:&lt;br /&gt;&lt;br /&gt;hist(points, [1:15]).&lt;br /&gt;&lt;br /&gt;Nevertheless, we are still having problems with the histograms that we don't really get. Here is an example of a picture and the histogram that our code computes.. we are confused as to whether this is correct or not:&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;img src="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/hist5.jpg" jpg="" width="300" /&gt;&lt;br /&gt;&lt;br /&gt;&lt;img src="http://ieng9.ucsd.edu/%7Ebbabenko/joogle/joogle/hist6.jpg" jpg="" width="700" /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19005369-113797528883542826?l=joogle5000.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://joogle5000.blogspot.com/feeds/113797528883542826/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19005369&amp;postID=113797528883542826' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/113797528883542826'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/113797528883542826'/><link rel='alternate' type='text/html' href='http://joogle5000.blogspot.com/2006/01/debugging.html' title='Debugging'/><author><name>Boris</name><uri>http://www.blogger.com/profile/01177828772052167022</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19005369.post-113753972925909481</id><published>2006-01-17T15:12:00.000-08:00</published><updated>2006-01-17T15:15:29.270-08:00</updated><title type='text'>Color spaces and histograms</title><content type='html'>We are playing with the mean shift algorithm and different color spaces / historgrams to cluster groups of images.  Our main focus is simple objects such as bananas and strawberries.  We have tried clustering using rgb averages, rgb histograms, hsv averages, hsv histograms, and l*a*b* averages.  Our next step is to try l*a*b* histograms but we need to understand it before we go on.  We plan on reading up on these color spaces and then trying to cluster with lab histograms.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19005369-113753972925909481?l=joogle5000.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://joogle5000.blogspot.com/feeds/113753972925909481/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19005369&amp;postID=113753972925909481' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/113753972925909481'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/113753972925909481'/><link rel='alternate' type='text/html' href='http://joogle5000.blogspot.com/2006/01/color-spaces-and-histograms.html' title='Color spaces and histograms'/><author><name>Naydav</name><uri>http://www.blogger.com/profile/00782594825927856590</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19005369.post-113684191228684223</id><published>2006-01-09T13:18:00.000-08:00</published><updated>2006-01-09T13:42:29.300-08:00</updated><title type='text'></title><content type='html'>Some more screenshots:&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;img src="http://ieng9.ucsd.edu/%7Ebbabenko/joogle7.jpg" alt="" border="0" /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;img src="http://ieng9.ucsd.edu/%7Ebbabenko/joogle8.jpg" alt="" border="0" /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19005369-113684191228684223?l=joogle5000.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://joogle5000.blogspot.com/feeds/113684191228684223/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19005369&amp;postID=113684191228684223' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/113684191228684223'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/113684191228684223'/><link rel='alternate' type='text/html' href='http://joogle5000.blogspot.com/2006/01/some-more-screenshots.html' title=''/><author><name>Boris</name><uri>http://www.blogger.com/profile/01177828772052167022</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19005369.post-113684208096649481</id><published>2006-01-09T13:02:00.000-08:00</published><updated>2006-01-09T13:28:00.980-08:00</updated><title type='text'>Retrieving and segmenting images</title><content type='html'>So after looking at the available tools, we have decided to use Yahoo! image search api to retrieve our images (Google has no image search api).  After retrieving the images, we need to segment them into "blobs."  We will be using &lt;a href="http://people.cs.uchicago.edu/%7Epff/"&gt;Pedro Felzenszwalb's&lt;/a&gt; image segmenting program to do this.&lt;br /&gt;&lt;br /&gt;Since the segment program takes input files in ppm format, we need to convert the jpg files retrieved from yahoo to ppm format.  Matlab makes that easy by just letting you specify what format to read in an image using imread, and what format to output a new image using imwrite.&lt;br /&gt;&lt;br /&gt;So the process will be like this:&lt;br /&gt;&lt;br /&gt;-The input query is sent to an image search engine&lt;br /&gt;-First several pages of thumbnail images are downloaded to a folder&lt;br /&gt;-Convert all jpg images to ppm format&lt;br /&gt;-Run the segment program on every converted image&lt;br /&gt;-Separate into "blobs" and cluster&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19005369-113684208096649481?l=joogle5000.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://joogle5000.blogspot.com/feeds/113684208096649481/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19005369&amp;postID=113684208096649481' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/113684208096649481'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/113684208096649481'/><link rel='alternate' type='text/html' href='http://joogle5000.blogspot.com/2006/01/retrieving-and-segmenting-images.html' title='Retrieving and segmenting images'/><author><name>Naydav</name><uri>http://www.blogger.com/profile/00782594825927856590</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19005369.post-113498042776230377</id><published>2005-12-19T00:11:00.000-08:00</published><updated>2005-12-19T00:20:27.776-08:00</updated><title type='text'></title><content type='html'>While Nadav has been studying for the GREs, I couldn't help my curiosity and did some preliminary testing of our idea.  I used &lt;a href="http://vision.ucsd.edu/%7Epdollar/toolbox/doc/index.html"&gt;Piotr Dollar's&lt;/a&gt; Matlab toolbox (which has a working implementation of the meanshift clustering algorithm). The first thing I did was split each image into blocks (rather than actually segmenting it into meaningful blobs) and clustered these blobs using 3 features - the average RGB. I then took the largest cluster and found the original images these blobs belonged to. Ironically, the very first example I tried worked very well.. unfortunately that wasn't the case for most of the other examples, but here are two that worked well:&lt;br /&gt;&lt;br /&gt;&lt;img src="http://ieng9.ucsd.edu/%7Ebbabenko/joogle.jpg" /&gt;&lt;br /&gt;&lt;br /&gt;Here is the biggest cluster of blocks:&lt;br /&gt;&lt;br /&gt;&lt;img src="http://ieng9.ucsd.edu/%7Ebbabenko/joogle2.jpg" /&gt;&lt;br /&gt;&lt;br /&gt;And another example:&lt;br /&gt;&lt;img src="http://ieng9.ucsd.edu/%7Ebbabenko/joogle3.jpg" /&gt;&lt;br /&gt;&lt;br /&gt;&lt;img src="http://ieng9.ucsd.edu/%7Ebbabenko/joogle4.jpg" /&gt;&lt;br /&gt;&lt;br /&gt;Lastly, I tried segmenting the images in a slightly less naive way than into square blocks - I used the built in kmeans function and used only color to segment. Here is a cool example of a banana:&lt;br /&gt;&lt;br /&gt;&lt;img src="http://ieng9.ucsd.edu/%7Ebbabenko/joogle5.jpg" /&gt;&lt;br /&gt;&lt;br /&gt;&lt;img src="http://ieng9.ucsd.edu/%7Ebbabenko/joogle6.jpg" /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19005369-113498042776230377?l=joogle5000.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://joogle5000.blogspot.com/feeds/113498042776230377/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19005369&amp;postID=113498042776230377' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/113498042776230377'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/113498042776230377'/><link rel='alternate' type='text/html' href='http://joogle5000.blogspot.com/2005/12/while-nadav-has-been-studying-for-gres.html' title=''/><author><name>Boris</name><uri>http://www.blogger.com/profile/01177828772052167022</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19005369.post-113381593365641391</id><published>2005-12-05T12:46:00.000-08:00</published><updated>2005-12-05T12:52:14.433-08:00</updated><title type='text'>Starting Out</title><content type='html'>To get a start on the project I have been concentrating on learning two important tools - LATeX and Matlab:&lt;br /&gt;1) LATeX is a typesetting system that most researchers in computer science use to write papers.  It seems to be very useful especially when writing various mathy things.  I had trouble finding an interface for windows, but a friend recommended MikTex and WinEDT, so I have installed the two and started to play around with them. &lt;br /&gt;2) My first real exposure to Matlab was in CSE 166 this quarter (Math 20 had assignments in Matlab, but they were very far from programming.  I also borrowed a book from Serge called "Matlab for Engineers" by Chapman, to start looking over more involved Matlab features such as cell arrays and structs (since I'm quite used to Object Oriented design).&lt;br /&gt;&lt;br /&gt;That is all for now.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19005369-113381593365641391?l=joogle5000.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://joogle5000.blogspot.com/feeds/113381593365641391/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19005369&amp;postID=113381593365641391' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/113381593365641391'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/113381593365641391'/><link rel='alternate' type='text/html' href='http://joogle5000.blogspot.com/2005/12/starting-out.html' title='Starting Out'/><author><name>Boris</name><uri>http://www.blogger.com/profile/01177828772052167022</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19005369.post-113329994299555817</id><published>2005-11-29T13:30:00.000-08:00</published><updated>2005-11-29T13:32:22.996-08:00</updated><title type='text'>First Post</title><content type='html'>Welcome to our 'CSE 190: Topics in Vision and Learning' blog for Winter 2006.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19005369-113329994299555817?l=joogle5000.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://joogle5000.blogspot.com/feeds/113329994299555817/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19005369&amp;postID=113329994299555817' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/113329994299555817'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19005369/posts/default/113329994299555817'/><link rel='alternate' type='text/html' href='http://joogle5000.blogspot.com/2005/11/first-post_29.html' title='First Post'/><author><name>Boris</name><uri>http://www.blogger.com/profile/01177828772052167022</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry></feed>
