<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-4220755233477842379</id><updated>2012-02-12T12:25:14.400-08:00</updated><category term='semantic'/><category term='data mining'/><category term='admin'/><category term='web'/><category term='php'/><category term='security'/><category term='perl'/><category term='3com'/><category term='shopping'/><category term='search engine'/><category term='Cisco'/><category term='recherche'/><category term='big data'/><category term='resume'/><category term='moteur de recherche'/><category term='nlp'/><category term='jquery'/><category term='Wikipedia'/><category term='job'/><category term='programmation'/><category term='réseau'/><category term='python'/><category term='Linux'/><category term='neo4j'/><category term='natural language'/><category term='script'/><category term='windows'/><category term='shop'/><category term='semantics'/><category term='web analysis'/><category term='intranet'/><category term='extjs'/><category term='pentest'/><title type='text'>Christophe Boudet</title><subtitle type='html'>Computer Science Engineer, searching for a job</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://john-bouday.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4220755233477842379/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://john-bouday.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>John Bouday</name><uri>http://www.blogger.com/profile/16390048419725870067</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>8</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-4220755233477842379.post-2130438259762768367</id><published>2012-02-08T16:36:00.000-08:00</published><updated>2012-02-08T16:58:48.350-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='web analysis'/><category scheme='http://www.blogger.com/atom/ns#' term='nlp'/><category scheme='http://www.blogger.com/atom/ns#' term='semantics'/><category scheme='http://www.blogger.com/atom/ns#' term='natural language'/><category scheme='http://www.blogger.com/atom/ns#' term='shopping'/><category scheme='http://www.blogger.com/atom/ns#' term='data mining'/><category scheme='http://www.blogger.com/atom/ns#' term='semantic'/><category scheme='http://www.blogger.com/atom/ns#' term='python'/><category scheme='http://www.blogger.com/atom/ns#' term='big data'/><category scheme='http://www.blogger.com/atom/ns#' term='shop'/><category scheme='http://www.blogger.com/atom/ns#' term='web'/><category scheme='http://www.blogger.com/atom/ns#' term='search engine'/><title type='text'>Futur of online shopping, natural language shopping</title><content type='html'>&lt;span class="Apple-style-span" style="font-size: large;"&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;When you bought a product such as TV on internet, you spend lot of time to choose the good model, you need to search the perfect one which owns all your requirements, and on your budget.&lt;br /&gt;So you will select a range of prices, a size and then read the specifications of each models.&lt;br /&gt;That's a really waste of time, reading specifications of tens product. Will you spend spend 30 minutes to choose a TV ? and for a radio at 30$ ?&lt;br /&gt;Not really, but you have not choice.&lt;br /&gt;&lt;br /&gt;But not for long, imagine a world where you will just ask what you want, what you expect, and get the perfect products directly. Imagine the Internet simpler&lt;br /&gt;&lt;br /&gt;Futur of search engine pass by NLP and semantic shopping&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-size: large;"&gt;Examples&lt;/span&gt; :&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/-4aH_DSy2DK8/TzMPjwEzppI/AAAAAAAAAb8/xh_eacteDwA/s1600/Image+32.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="316" src="http://2.bp.blogspot.com/-4aH_DSy2DK8/TzMPjwEzppI/AAAAAAAAAb8/xh_eacteDwA/s640/Image+32.png" width="640" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;Maybe you have an idea more accurate of your need ?&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/-LH84vrCyn7s/TzMQT-8HqzI/AAAAAAAAAcE/TPbT22G2NrM/s1600/Image+28.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="318" src="http://4.bp.blogspot.com/-LH84vrCyn7s/TzMQT-8HqzI/AAAAAAAAAcE/TPbT22G2NrM/s640/Image+28.png" width="640" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;Or maybe you're a professional and need something really specific ?&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/-r585-AGk3aU/TzMQVvePdJI/AAAAAAAAAcM/1brX8oGDr0k/s1600/Image+30.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="316" src="http://4.bp.blogspot.com/-r585-AGk3aU/TzMQVvePdJI/AAAAAAAAAcM/1brX8oGDr0k/s640/Image+30.png" width="640" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: large;"&gt;&lt;b&gt;Reviews&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Great you have a list of corresponding products, but how to choose the good one ? how to be sure of his quality ? of the sound perfection ?&lt;br /&gt;Rating score isn't adequate for these questions, it's perfect to perfom a first step of sorts, but not enough to decide which product to buy.&lt;br /&gt;Are you ready to read 50 reviews for each model ? are you interesting by the whole life of other buyer or just by the essential, the pros and the cons ?&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/-Wmk-aspl3rA/TzMQWrRRFUI/AAAAAAAAAcU/_yFg5Jjx5RU/s1600/Image+31.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="362" src="http://3.bp.blogspot.com/-Wmk-aspl3rA/TzMQWrRRFUI/AAAAAAAAAcU/_yFg5Jjx5RU/s640/Image+31.png" width="640" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;With this search engine, you will be able to choose any products before buy it, without open hundred of pages and search specifications, reviews and more.&lt;br /&gt;Imagine this on a website of used cars, before you had to open a page for :&lt;br /&gt;&lt;br /&gt;&lt;ol&gt;&lt;li&gt;the specification of this model&lt;/li&gt;&lt;li&gt;test of the car&lt;/li&gt;&lt;li&gt;review of different owners&lt;/li&gt;&lt;/ol&gt;And that for each cars on the site. You could save time to do more interesting things, and get better informations for a wide range of products. Just ask, read pros and cons and choose product in some minutes among millions&lt;br /&gt;&lt;br /&gt;Seach engines are made to help your life, and save your time.&lt;br /&gt;&lt;br /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4220755233477842379-2130438259762768367?l=john-bouday.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://john-bouday.blogspot.com/feeds/2130438259762768367/comments/default' title='Publier les commentaires'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4220755233477842379&amp;postID=2130438259762768367' title='1 commentaires'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4220755233477842379/posts/default/2130438259762768367'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4220755233477842379/posts/default/2130438259762768367'/><link rel='alternate' type='text/html' href='http://john-bouday.blogspot.com/2012/02/futur-of-online-shopping.html' title='Futur of online shopping, natural language shopping'/><author><name>John Bouday</name><uri>http://www.blogger.com/profile/16390048419725870067</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/-4aH_DSy2DK8/TzMPjwEzppI/AAAAAAAAAb8/xh_eacteDwA/s72-c/Image+32.png' height='72' width='72'/><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4220755233477842379.post-3607726851300599625</id><published>2012-01-25T15:54:00.000-08:00</published><updated>2012-01-26T12:52:47.454-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='natural language'/><category scheme='http://www.blogger.com/atom/ns#' term='semantics'/><category scheme='http://www.blogger.com/atom/ns#' term='Wikipedia'/><category scheme='http://www.blogger.com/atom/ns#' term='recherche'/><category scheme='http://www.blogger.com/atom/ns#' term='web'/><category scheme='http://www.blogger.com/atom/ns#' term='search engine'/><category scheme='http://www.blogger.com/atom/ns#' term='nlp'/><category scheme='http://www.blogger.com/atom/ns#' term='web analysis'/><category scheme='http://www.blogger.com/atom/ns#' term='neo4j'/><category scheme='http://www.blogger.com/atom/ns#' term='programmation'/><category scheme='http://www.blogger.com/atom/ns#' term='python'/><category scheme='http://www.blogger.com/atom/ns#' term='semantic'/><category scheme='http://www.blogger.com/atom/ns#' term='moteur de recherche'/><title type='text'>Natural Language Search Engine</title><content type='html'>&lt;span style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="color: #333333; line-height: 19px;"&gt;Here are the evolutions of my &lt;/span&gt;&lt;span class="Apple-style-span" style="color: #333333; line-height: 19px;"&gt;&lt;a href="http://john-bouday.blogspot.com/2010/11/web-analysis.html"&gt;web analysis engine.&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #333333; font-family: inherit; line-height: 19px;"&gt;The project aims to analyze web pages (Wikipedia for the moment) in providing a context understanding to the engine. In this way, the system is able to answer to human questions about web content, and to learn new facts by itself.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif; line-height: 19px;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif; line-height: 19px;"&gt;&lt;/span&gt;&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif; line-height: 19px;"&gt;Preview :&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif; line-height: 19px;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;table align="center" cellpadding="0" cellspacing="0" class="tr-caption-container" style="margin-left: auto; margin-right: auto; text-align: center;"&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/-acu7tvO3Pqo/TyCK5ISNgJI/AAAAAAAAAbM/8E7EL-4Y15U/s1600/Image+14.png" imageanchor="1" style="margin-left: auto; margin-right: auto;"&gt;&lt;img border="0" height="316" src="http://3.bp.blogspot.com/-acu7tvO3Pqo/TyCK5ISNgJI/AAAAAAAAAbM/8E7EL-4Y15U/s640/Image+14.png" width="640" /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="tr-caption" style="text-align: center;"&gt;&lt;span style="font-size: small;"&gt;Question about Obama&lt;/span&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;span class="Apple-style-span" style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif; line-height: 19px;"&gt;We could find direct anwsers at the question "Who is Barack Obama" in blue titles, and the original sentences behind.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif; line-height: 19px;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif; line-height: 19px;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif; line-height: 19px;"&gt;&lt;/span&gt;&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif; line-height: 19px;"&gt;New Architecture&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif; line-height: 19px;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif; line-height: 19px;"&gt;For this version, I rewrote all the code in Python, and get a huge gain of performance for NLP (from 20s to less than 1s for big pages)&amp;nbsp;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif; line-height: 19px;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;table align="center" cellpadding="0" cellspacing="0" class="tr-caption-container" style="margin-left: auto; margin-right: auto; text-align: center;"&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/-kWMyQBYmTeU/TNdK98pxGuI/AAAAAAAAAZs/hxQtoB3Tis4/s1600/schema.png" imageanchor="1" style="margin-left: auto; margin-right: auto;"&gt;&lt;img border="0" height="240" src="http://2.bp.blogspot.com/-kWMyQBYmTeU/TNdK98pxGuI/AAAAAAAAAZs/hxQtoB3Tis4/s320/schema.png" width="320" /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="tr-caption" style="text-align: center;"&gt;&lt;span style="font-size: small;"&gt;Architecture&lt;/span&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;span class="Apple-style-span" style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif;"&gt;&lt;span class="Apple-style-span" style="line-height: 19px;"&gt;I used queue to provide inter-communication between each module and easily scalable architecture, with &lt;a href="http://rabbitmq.com/"&gt;RabbitMQ&lt;/a&gt;. The new database is running under &lt;a href="http://neo4j.org/"&gt;Neo4J&lt;/a&gt;, and stores around 300 nodes for each article issued from Wikipedia. The web server is now &lt;a href="http://tornadoweb.org/"&gt;Tornado&lt;/a&gt;. All these modules and technologies made system impressively fast (for a laptop development platform). I will improve the system with the help of Memcache in some weeks.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif;"&gt;&lt;span class="Apple-style-span" style="line-height: 19px;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif;"&gt;&lt;span class="Apple-style-span" style="line-height: 19px;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif;"&gt;&lt;span class="Apple-style-span" style="line-height: 19px;"&gt;Examples :&lt;/span&gt;&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif;"&gt;&lt;span class="Apple-style-span" style="line-height: 19px;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif;"&gt;&lt;span class="Apple-style-span" style="line-height: 19px;"&gt;We could ask simple questions about definition&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;table align="center" cellpadding="0" cellspacing="0" class="tr-caption-container" style="margin-left: auto; margin-right: auto; text-align: center;"&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/-JHtzuIdsPUk/TyCO8sD6PVI/AAAAAAAAAbU/JHxaNSDu4Pg/s1600/Image+21.png" imageanchor="1" style="margin-left: auto; margin-right: auto;"&gt;&lt;img border="0" height="270" src="http://2.bp.blogspot.com/-JHtzuIdsPUk/TyCO8sD6PVI/AAAAAAAAAbU/JHxaNSDu4Pg/s640/Image+21.png" width="640" /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="tr-caption" style="text-align: center;"&gt;&lt;span style="font-size: small;"&gt;Medical question&lt;/span&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;table align="center" cellpadding="0" cellspacing="0" class="tr-caption-container" style="margin-left: auto; margin-right: auto; text-align: center;"&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/-U9Gt7wU1Oo4/TyCQMUn54_I/AAAAAAAAAb0/s4HAkwY_yPI/s1600/Image+141.png" imageanchor="1" style="margin-left: auto; margin-right: auto;"&gt;&lt;img border="0" height="275" src="http://3.bp.blogspot.com/-U9Gt7wU1Oo4/TyCQMUn54_I/AAAAAAAAAb0/s4HAkwY_yPI/s640/Image+141.png" width="640" /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="tr-caption" style="text-align: center;"&gt;&lt;span style="font-size: small;"&gt;Physic question&lt;/span&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;About some facts&lt;br /&gt;&lt;br /&gt;&lt;table align="center" cellpadding="0" cellspacing="0" class="tr-caption-container" style="margin-left: auto; margin-right: auto; text-align: center;"&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/-hwY7dRu2bkw/TyCPEjzqjcI/AAAAAAAAAbk/cPq9jXvKKPw/s1600/Image+25.png" imageanchor="1" style="margin-left: auto; margin-right: auto;"&gt;&lt;img border="0" height="308" src="http://4.bp.blogspot.com/-hwY7dRu2bkw/TyCPEjzqjcI/AAAAAAAAAbk/cPq9jXvKKPw/s640/Image+25.png" width="640" /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="tr-caption" style="text-align: center;"&gt;&lt;span style="font-size: small;"&gt;Geographie question&lt;/span&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;Or more precise questions&lt;br /&gt;&lt;table align="center" cellpadding="0" cellspacing="0" class="tr-caption-container" style="margin-left: auto; margin-right: auto; text-align: center;"&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/-ZUY2Yt9HmMk/TyCPDJQOt_I/AAAAAAAAAbc/NHfO4LXA4mg/s1600/Image+23.png" imageanchor="1" style="margin-left: auto; margin-right: auto;"&gt;&lt;img border="0" height="292" src="http://3.bp.blogspot.com/-ZUY2Yt9HmMk/TyCPDJQOt_I/AAAAAAAAAbc/NHfO4LXA4mg/s640/Image+23.png" width="640" /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="tr-caption" style="text-align: center;"&gt;&lt;span style="font-size: small;"&gt;People question&lt;/span&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;And &amp;nbsp;really more specified &lt;br /&gt;&lt;br /&gt;&lt;table align="center" cellpadding="0" cellspacing="0" class="tr-caption-container" style="margin-left: auto; margin-right: auto; text-align: center;"&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/-HeDFnMr4tNY/TyCPGdte5eI/AAAAAAAAAbs/IbdZs9FCvAU/s1600/Image+26.png" imageanchor="1" style="margin-left: auto; margin-right: auto;"&gt;&lt;img border="0" height="290" src="http://2.bp.blogspot.com/-HeDFnMr4tNY/TyCPGdte5eI/AAAAAAAAAbs/IbdZs9FCvAU/s640/Image+26.png" width="640" /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="tr-caption" style="text-align: center;"&gt;&lt;span style="font-size: small;"&gt;Complex question&lt;/span&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;span class="Apple-style-span" style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif;"&gt;&lt;span class="Apple-style-span" style="line-height: 19px;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif;"&gt;&lt;span class="Apple-style-span" style="line-height: 19px;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif;"&gt;&lt;span class="Apple-style-span" style="line-height: 19px;"&gt;&lt;b&gt;Futur steps&lt;/b&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif;"&gt;&lt;span style="line-height: 19px;"&gt;First of all, I will give more and more articles to the crawler to populate the database with facts.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif;"&gt;&lt;span style="line-height: 19px;"&gt;Next step will be to improve the question analyser in the goal to anwser at more involved questions.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif;"&gt;&lt;span style="line-height: 19px;"&gt;Then bring more data, and semantic into the webpage with maps, pictures, video, automatic summary, and why not with interactive graphs, slides.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif;"&gt;&lt;span style="line-height: 19px;"&gt;Integrate external API to get information with natural language about restaurant, films, music, books, travel etc&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif;"&gt;&lt;span style="line-height: 19px;"&gt;Include more datas, issued from newspaper's website and semantic plateform.&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif;"&gt;&lt;span style="line-height: 19px;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="color: #333333; font-family: 'Helvetica Neue Light', HelveticaNeue-Light, 'Helvetica Neue', Helvetica, Arial, sans-serif;"&gt;&lt;span style="line-height: 19px;"&gt;But, all this work recquires lot of time, and I don't have it for the moment, so it could be a little long before the next version.&lt;/span&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4220755233477842379-3607726851300599625?l=john-bouday.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://john-bouday.blogspot.com/feeds/3607726851300599625/comments/default' title='Publier les commentaires'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4220755233477842379&amp;postID=3607726851300599625' title='0 commentaires'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4220755233477842379/posts/default/3607726851300599625'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4220755233477842379/posts/default/3607726851300599625'/><link rel='alternate' type='text/html' href='http://john-bouday.blogspot.com/2012/01/natural-language-search-engine.html' title='Natural Language Search Engine'/><author><name>John Bouday</name><uri>http://www.blogger.com/profile/16390048419725870067</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/-acu7tvO3Pqo/TyCK5ISNgJI/AAAAAAAAAbM/8E7EL-4Y15U/s72-c/Image+14.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4220755233477842379.post-3588115582951703384</id><published>2011-06-12T18:11:00.000-07:00</published><updated>2011-06-12T18:11:11.670-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='pentest'/><category scheme='http://www.blogger.com/atom/ns#' term='security'/><title type='text'>Pentest</title><content type='html'>This is the report of one of my pentest.&lt;br /&gt;Sorry this is in french.&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://dl.dropbox.com/u/1608518/Rapport-8.pdf" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://3.bp.blogspot.com/-iicgG1JzXvc/TfVgRLpwsoI/AAAAAAAAAbA/V1MnSWSMwtA/s1600/application_pdf.png" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4220755233477842379-3588115582951703384?l=john-bouday.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://john-bouday.blogspot.com/feeds/3588115582951703384/comments/default' title='Publier les commentaires'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4220755233477842379&amp;postID=3588115582951703384' title='0 commentaires'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4220755233477842379/posts/default/3588115582951703384'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4220755233477842379/posts/default/3588115582951703384'/><link rel='alternate' type='text/html' href='http://john-bouday.blogspot.com/2011/06/pentest.html' title='Pentest'/><author><name>John Bouday</name><uri>http://www.blogger.com/profile/16390048419725870067</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/-iicgG1JzXvc/TfVgRLpwsoI/AAAAAAAAAbA/V1MnSWSMwtA/s72-c/application_pdf.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4220755233477842379.post-8260508434417545041</id><published>2010-11-17T14:23:00.000-08:00</published><updated>2010-11-17T14:23:05.613-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='web analysis'/><category scheme='http://www.blogger.com/atom/ns#' term='semantics'/><category scheme='http://www.blogger.com/atom/ns#' term='perl'/><title type='text'>Web Analysis</title><content type='html'>&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;The project aims to analyze web pages in providing a context understanding to the engine. In this way, the system is able to answer human questions about web content.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;The project is composed of many modules, or bricks, each one has a precise role, and could easily be scalable on several servers, and is in Perl or in C.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_G2MDq1c2FCo/TNoTO6LDGLI/AAAAAAAAAZ8/jBk1nUz3Ids/s1600/schema.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="240" src="http://4.bp.blogspot.com/_G2MDq1c2FCo/TNoTO6LDGLI/AAAAAAAAAZ8/jBk1nUz3Ids/s320/schema.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-size: x-large;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;The crawler&lt;/span&gt;&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;The crawler in Perl is able to explore Internet and download the content of pages. It's a simple part of the project for the moment, just designed to provide content for other bricks.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_G2MDq1c2FCo/TNdORyVZBPI/AAAAAAAAAZw/r58_r0Rldyc/s1600/crawler.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="240" src="http://4.bp.blogspot.com/_G2MDq1c2FCo/TNdORyVZBPI/AAAAAAAAAZw/r58_r0Rldyc/s320/crawler.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-size: x-large;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;The Scrapper&lt;/span&gt;&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;This brick code in Perl too, split the content&amp;nbsp;automatically extracted from the crawler,&amp;nbsp;without template.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_G2MDq1c2FCo/TNdQtFRsx1I/AAAAAAAAAZ0/iOxSFXPM1-Q/s1600/scrapper.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="240" src="http://3.bp.blogspot.com/_G2MDq1c2FCo/TNdQtFRsx1I/AAAAAAAAAZ0/iOxSFXPM1-Q/s320/scrapper.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;It could be use on different types as :&lt;/span&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;blog&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;forum&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;personal site&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;corporate&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;...&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Thanks to this module, the system is focusing on useful part on the page, without menus, ads, disclaimers and other.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;We could find each articles on a blog or forum, comments for blog ecommerce, and products for shopping site.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Then the system analyze each article in order to extract :&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;title&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;date of publication&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;user&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;number of comments&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;links&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;keyword&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;categories&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;images&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;text&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;So we could estimate the delay before a new article and adjust our crawler period before a new visit on this site, and find users who influence the community.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Thus for our engine pages becomes articles. If we use ranking algorithms (Hits or Pagerank) on articles and not on pages, we will obtain better pertinence of portion of document. In the case of blog or personal web site, &amp;nbsp;long thread from forum, it could be interesting to obtain accurate ranking.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Example :&lt;/span&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;page view by a web browser&lt;/span&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_G2MDq1c2FCo/S7jH32nyg4I/AAAAAAAAAXE/pT9GwabXWU0/s1600/2.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;img border="0" height="320" src="http://1.bp.blogspot.com/_G2MDq1c2FCo/S7jH32nyg4I/AAAAAAAAAXE/pT9GwabXWU0/s320/2.jpg" width="120" /&gt;&lt;/span&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;XML generated from the scrapper :&lt;/span&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;/div&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&amp;lt;page url='http://mostlylisa.com/'&amp;gt;&lt;br /&gt;&amp;lt;article&amp;gt;&lt;br /&gt;&amp;lt;titre&amp;gt;Mostly Macworld Keynote 2009&amp;lt;/titre&amp;gt;&lt;br /&gt;&amp;lt;comment&amp;gt;22&amp;lt;/comment&amp;gt;&lt;br /&gt;&amp;lt;publish&amp;gt;11 January 2009&amp;lt;/publish&amp;gt;&lt;br /&gt;&amp;lt;content&amp;gt;I apologize for my lousy blogging lately. Macworld has been insane for me. I was on my feet from 7am - 3am, running around the expo doing Macbreak interviews, being a guest on Macbreak Weekly, recording TWiP, and looting booths for schwag (the most important thing at MW), and attending a few shindigs.I plan on writing a detailed post on my reflections of Macworld and my top picks of the Expo in a few days. Before I give you my thoughts on the keynote, I’d like to hear yours.Were you disappointed with this year’s Macworld keynote?Like say the fact that they didn’t even mention Snow Leopard or release a new mini or iMac or, like announce something cool other than the ability to DRM-free your previously bought itunes music for $0.30 a pop? 30 x 14GB of music = I don’t know, you do the math.There is a super awesome prize for the person who makes the best comment. So breathe in and let it all out. Please don’t make Steve cry too much. Think about his hormone imbalance. Please.22 Comments ». Tagged in Apple, Geeky Stuff, Tech/Web, Travel, Videos&amp;lt;/content&amp;gt;&lt;br /&gt;&amp;lt;key&amp;gt;macworld&amp;lt;/key&amp;gt;&lt;br /&gt;&amp;lt;key&amp;gt;keynote&amp;lt;/key&amp;gt;&lt;br /&gt;&amp;lt;key&amp;gt;macbreak&amp;lt;/key&amp;gt;&lt;br /&gt;&amp;lt;key&amp;gt;ability&amp;lt;/key&amp;gt;&lt;br /&gt;&amp;lt;key&amp;gt;announce&amp;lt;/key&amp;gt;&lt;br /&gt;&amp;lt;link href='http://mostlylisa.com/2009/01/mostly-macworld-keynote/'&amp;gt;Mostly Macworld Keynote 2009&amp;lt;/link&amp;gt;&lt;br /&gt;&amp;lt;link href='http://www.pixelcorps.tv/macbreak173'&amp;gt;Macbreak interviews,&amp;lt;/link&amp;gt;&lt;br /&gt;&amp;lt;link href='http://twit.tv/mbw122'&amp;gt;being a guest on Macbreak Weekly,&amp;lt;/link&amp;gt;&lt;br /&gt;&amp;lt;link href='http://feeds.feedburner.com/mostlylisa/yKBd'&amp;gt;&amp;lt;/link&amp;gt;&lt;br /&gt;&amp;lt;cat&amp;gt;apple&amp;lt;/cat&amp;gt;&lt;br /&gt;&amp;lt;cat&amp;gt;geeky-stuff&amp;lt;/cat&amp;gt;&lt;br /&gt;&amp;lt;cat&amp;gt;techweb&amp;lt;/cat&amp;gt;&lt;br /&gt;&amp;lt;cat&amp;gt;travel&amp;lt;/cat&amp;gt;&lt;br /&gt;&amp;lt;cat&amp;gt;videos&amp;lt;/cat&amp;gt;&lt;br /&gt;&amp;lt;/article&amp;gt;&lt;br /&gt;&amp;lt;article&amp;gt;&lt;br /&gt;&amp;lt;titre&amp;gt;Mostly Macworld 2009&amp;lt;/titre&amp;gt;&lt;br /&gt;&amp;lt;comment&amp;gt;15&amp;lt;/comment&amp;gt;&lt;br /&gt;&amp;lt;publish&amp;gt;6 January 2009&amp;lt;/publish&amp;gt;&lt;br /&gt;&amp;lt;content&amp;gt;Photo by Scott Meizner’s slick Canon 5D Mark II.It’s just after midnight, the day before Macworld keynote ‘09. I can see the glow of the Moscone Center from my hotel room. I can’t quite see the line o’fanboys, but if I crane my neck a wee bit, I can see the twinkle of their MBP and a glint in their eyes. They miss Jobs. Ahh, don’t we all.For those of you not able to come to Macworld, I’ll be covering all of its geeky goodness with the MacBreak crew. So I want to ask you: What Macworld inside scoop would you like hear about? If you think of person, company, or Mac-related product you’d like to learn about, fire a comment here or @lisabettany on twitter or squint your eyes, distort the Space-Time continuum, and leave me a scroll somewhere near the Moscone Center. No guarantees that I’ll get it, but good effort, none-the-less.15 Comments ». Tagged in Apple, Geeky Stuff&amp;lt;/content&amp;gt;&lt;br /&gt;&amp;lt;key&amp;gt;macworld&amp;lt;/key&amp;gt;&lt;br /&gt;&amp;lt;key&amp;gt;moscone&amp;lt;/key&amp;gt;&lt;br /&gt;&amp;lt;key&amp;gt;comment&amp;lt;/key&amp;gt;&lt;br /&gt;&amp;lt;key&amp;gt;company&amp;lt;/key&amp;gt;&lt;br /&gt;&amp;lt;key&amp;gt;continuum&amp;lt;/key&amp;gt;&lt;br /&gt;&amp;lt;img width='386'&amp;gt;http://farm4.static.flickr.com/3258/3169628089 ea8a1c0931.jpg&amp;lt;/img&amp;gt;&lt;br /&gt;&amp;lt;link href='http://mostlylisa.com/2009/01/mostly-macworld-2009/'&amp;gt;Mostly Macworld 2009&amp;lt;/link&amp;gt;&lt;br /&gt;&amp;lt;link href='http://www.flickr.com/photos/redpilotmedia/3169628089/'&amp;gt;&amp;lt;/link&amp;gt;&lt;br /&gt;&amp;lt;link href='http://flickr.com/photos/smeinzer/page4/'&amp;gt;Scott Meizner’s&amp;lt;/link&amp;gt;&lt;br /&gt;&amp;lt;link href='http://www.pixelcorps.tv/macbreak'&amp;gt;MacBreak &amp;lt;/link&amp;gt;&lt;br /&gt;&amp;lt;link href='http://twitter.com/lisabettany'&amp;gt;@lisabettany&amp;lt;/link&amp;gt;&lt;br /&gt;&amp;lt;link href='http://feeds.feedburner.com/mostlylisa/yKBd'&amp;gt;&amp;lt;/link&amp;gt;&lt;br /&gt;&amp;lt;cat&amp;gt;apple&amp;lt;/cat&amp;gt;&lt;br /&gt;&amp;lt;cat&amp;gt;geeky-stuff&amp;lt;/cat&amp;gt;&lt;br /&gt;&amp;lt;/article&amp;gt;&lt;/span&gt;&lt;/i&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;....&lt;/span&gt;&lt;/i&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&amp;lt;/page&amp;gt;&lt;/span&gt;&lt;/i&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;/i&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-size: x-large;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Textual analysis&lt;/span&gt;&lt;/span&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;This brick in Perl the most recent in the project. It aims to understand english sentences.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_G2MDq1c2FCo/TNoTXYmXD8I/AAAAAAAAAaA/ui9aPz-UT5I/s1600/textanalyse.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="158" src="http://3.bp.blogspot.com/_G2MDq1c2FCo/TNoTXYmXD8I/AAAAAAAAAaA/ui9aPz-UT5I/s320/textanalyse.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;First, it splits each article in sentences and send it to a C server to analyze grammar structure.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Then, it extracts the subject, the verb, the object of each sentence.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;After, it find date person and place by using Perl server.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Examples with&amp;nbsp;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Angelina Jolie's&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&amp;nbsp;&lt;/span&gt;&lt;/b&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;page on&amp;nbsp;&lt;/span&gt;&lt;/b&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;wikipedia&lt;/span&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;u&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Jolie was estranged from her father for many years&amp;nbsp;&lt;/span&gt;&lt;/u&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; subject : &amp;nbsp;Angelina Jolie&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&amp;nbsp;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; verb : was &amp;nbsp;estranged &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[estrange] &amp;nbsp; &amp;nbsp; (past)&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&amp;nbsp;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;what : [Angelina Jolie] father&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;=&amp;gt; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[Angelina Jolie] father&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;=&amp;gt; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;many years&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&amp;nbsp;&amp;nbsp;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;i&gt;&lt;/i&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;i&gt;&lt;/i&gt;&lt;/span&gt;&lt;br /&gt;&lt;i&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-style: normal;"&gt;&lt;b&gt;&lt;u&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Jolie also portrayed Margret Legs Sadovsky , one of five teenage girls who &amp;nbsp;&lt;/span&gt;&lt;/u&gt;&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-style: normal;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&amp;nbsp;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; subject : &amp;nbsp;Angelina Jolie&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-style: normal;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&amp;nbsp;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; verb : &amp;nbsp; portrayed &amp;nbsp; &amp;nbsp; [portray] &amp;nbsp; &amp;nbsp; &amp;nbsp;(past)&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-style: normal;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&amp;nbsp;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;what : Margret Legs&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-style: normal;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;=&amp;gt; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;OF &amp;nbsp; &amp;nbsp; &amp;nbsp;five teenage girls&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;/div&gt;&lt;/i&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;b&gt;&lt;u&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Jolie starred as Mariane Pearl in Michael Winterbottom 's documentary-style drama A Mighty Heart ( 2007 ) , about the kidnapping and murder of Wall Street Journal reporter Daniel Pearl in Pakistan&lt;/span&gt;&lt;/u&gt;&lt;/b&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&amp;nbsp;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; subject : &amp;nbsp;Angelina Jolie&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&amp;nbsp;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; verb : starred &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; [star] (past)&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&amp;nbsp;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;what : Mariane Pearl&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;=&amp;gt; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Mariane Pearl &amp;nbsp;&amp;nbsp;[People]&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;=&amp;gt; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Michael Winterbottom&amp;nbsp;&amp;nbsp;[People]&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&amp;nbsp;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;documentary-style drama&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&amp;nbsp;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;A Mighty Heart&amp;nbsp;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;=&amp;gt; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;kidnapping and murder&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;=&amp;gt; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Wall Street Journal reporter &amp;nbsp;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&amp;nbsp;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Daniel Pearl &amp;nbsp;&amp;nbsp;&amp;nbsp;[People]&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;=&amp;gt; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Pakistan &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[Place]&lt;/span&gt;&lt;br /&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;br /&gt;&lt;b&gt;&lt;u&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;she returned to Cambodia for two weeks and later met with Afghan refugees in Pakistan &lt;/span&gt;&lt;/u&gt;&lt;/b&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&amp;nbsp;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; subject : Angelina Jolie&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&amp;nbsp;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; verb : &amp;nbsp;returned &amp;nbsp; &amp;nbsp; &amp;nbsp; [return] &amp;nbsp; &amp;nbsp; &amp;nbsp; (past)&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&amp;nbsp;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;adverb : later&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&amp;nbsp;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;what : Cambodia&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;=&amp;gt; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Cambodia [Place]&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;=&amp;gt; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;two weeks&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&amp;nbsp;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; met&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;=&amp;gt; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Afghan refugees&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;=&amp;gt; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Pakistan &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;[Place]&lt;/span&gt;&lt;br /&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;/div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-size: x-large;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Database&lt;/span&gt;&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;For the moment, &amp;nbsp;is programed in&amp;nbsp;&lt;/span&gt;Perl and&amp;nbsp;uses socket. And in the future (in some weeks) it will be graph databases programmed in C and based on KyotoCabinet and libevent.&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;The database contains data issues from the textual analysis.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Nowadays the databse could be asked by simple human questions.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Examples&lt;/span&gt;&lt;/b&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt; :&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;u&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Who is Angelina Jolie ?&lt;/span&gt;&lt;/b&gt;&lt;/u&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Angelina Jolie is american actress&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;She is marcheline bertrand daughter jon voight actors&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;She is chip taylor goddaughter james haven and maximilian schell jacqueline bisset sister&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;She is slovak and german descent&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;She is 2005 action-comedy&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;She is 20 countries field missions world refugees persons&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;She is 2005 world economic forum &amp;nbsp;davos 2006 invited speaker&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;She is first recipient united nations correspondents association citizen world award&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;b&gt;&lt;u&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;When divorce Angelina Jolie ?&lt;/span&gt;&lt;/u&gt;&lt;/b&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Angelina Jolie divorce 27 may 2003.&lt;/span&gt;&lt;br /&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;&lt;u&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Who adopted Angelina Jolie ?&lt;/span&gt;&lt;/u&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Angelina Jolie adopted seven-month-old maddox chivan&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;She adopted july 6 2005 zahara marley six-month-old girl ethiopia&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;She adopted wide horizons for children addis ababa orphanage&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;She adopted zahara&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;She adopted boy tam binh orphanage ho chi minh city&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;b&gt;&lt;u&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;What promote Angelina Jolie ?&lt;/span&gt;&lt;/u&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Angelina Jolie promote humanitarian causes her work refugees good ambassador united nations high commissioner &amp;nbsp;refugees&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;She promote humanitarian causes political level&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;She promote humanitarian causes mass media&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-size: x-large;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Brick in progress&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Semantics :&lt;/span&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Using Freebase and DbPedia to obtain more informations and classifications about ontologies.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Using WordNet to identity sense of common words and synonyms.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-size: x-large;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Future work&lt;/span&gt;&lt;/span&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Database :&lt;/span&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;The new database based on graph, with the KyotoCabinet &amp;nbsp;(high performance database library) and libevent (to allow asynchronous socket). With LZMA compression to minimize space in memory and Gzip or Bzip2 compression for persitent storage.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Many reverse index (object, verbe, subject).&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;A better understanding for complicated questions with the help of the brick of textual analysis, and abilities to build sentence answer.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;And after it distributs fault tolerant and highscability.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Auto Classification :&lt;/span&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;In the goal of classified every articles in one or several categories (Film, Person, Place, Car, Commerce, ...). Based on an auto learning with Bayesians Classifier and/or LSI.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Crawler :&amp;nbsp;&lt;/span&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Improves the crawler to export pages interconnections on graph into the database, to allow construction of aggregats to detect community.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Convert of documents as PDF,DOC, ... in text&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Facial detection / recognition :&lt;/span&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Uses facial detection and recognition (with OpenCV) on photos present on page to auto-tagged persons, objects, vehicles.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Graphical interface :&lt;/span&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Realizes web interface with JQuery to present answer at questions and show text with semantics and text annotations (as Powerset).&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Using Google Maps to show location; gallery of picture, tooltips, and graph.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Supervision :&lt;/span&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Helps to detect faillure and to monitor uses of ressource, using SNMP to be easily plugged to other program such as Nagios.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-size: x-large;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Future steps&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Analysis of all Wikipedia&amp;nbsp;&lt;/span&gt;english pages&amp;nbsp;(more than 3.5 Millions) to constitute a massive knowledge base of the world.&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Monitor articles from newspaper's website and Twitter to detect events.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Adds of&amp;nbsp;&lt;/span&gt;information&amp;nbsp;sources to improove the knowledge based of the program as the CIA world factbook, and others.&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4220755233477842379-8260508434417545041?l=john-bouday.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://john-bouday.blogspot.com/feeds/8260508434417545041/comments/default' title='Publier les commentaires'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4220755233477842379&amp;postID=8260508434417545041' title='0 commentaires'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4220755233477842379/posts/default/8260508434417545041'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4220755233477842379/posts/default/8260508434417545041'/><link rel='alternate' type='text/html' href='http://john-bouday.blogspot.com/2010/11/web-analysis.html' title='Web Analysis'/><author><name>John Bouday</name><uri>http://www.blogger.com/profile/16390048419725870067</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_G2MDq1c2FCo/TNoTO6LDGLI/AAAAAAAAAZ8/jBk1nUz3Ids/s72-c/schema.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4220755233477842379.post-1342260756605903331</id><published>2010-11-10T21:30:00.000-08:00</published><updated>2012-01-10T12:14:36.538-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='nlp'/><category scheme='http://www.blogger.com/atom/ns#' term='resume'/><category scheme='http://www.blogger.com/atom/ns#' term='python'/><category scheme='http://www.blogger.com/atom/ns#' term='job'/><category scheme='http://www.blogger.com/atom/ns#' term='perl'/><title type='text'>My resume</title><content type='html'>&lt;style type="text/css"&gt;p.p1 {margin: 0.0px 0.0px 0.0px 0.0px; font: 14.5px LMSans12 Regular; color: #3366a6}p.p2 {margin: 0.0px 0.0px 0.0px 0.0px; font: 10.0px LMSans12 Regular}p.p3 {margin: 0.0px 0.0px 0.0px 0.0px; font: 11.0px LMSans12 Regular}p.p4 {margin: 0.0px 0.0px 0.0px 0.0px; font: 11.0px LMSans12 Regular; color: #5a5a5a}span.s1 {font: 11.0px LMSans12 Regular}span.s2 {font: 10.0px LMSans12 Regular}span.s3 {color: #000000}span.s4 {color: #4080c0}&lt;/style&gt;   &lt;br /&gt;&lt;style type="text/css"&gt;p.p1 {margin: 0.0px 0.0px 0.0px 0.0px; font: 14.5px LMSans12 Regular; color: #3366a6}p.p2 {margin: 0.0px 0.0px 0.0px 0.0px; font: 10.0px LMSans12 Regular}p.p3 {margin: 0.0px 0.0px 0.0px 0.0px; font: 11.0px LMSans12 Regular}p.p4 {margin: 0.0px 0.0px 0.0px 0.0px; font: 11.0px LMSans12 Regular; color: #5a5a5a}span.s1 {font: 11.0px LMSans12 Regular}span.s2 {font: 10.0px LMSans12 Regular}span.s3 {color: #000000}span.s4 {color: #4080c0}&lt;/style&gt;   &lt;br /&gt;&lt;div class="p1"&gt;&lt;style type="text/css"&gt;p.p1 {margin: 0.0px 0.0px 0.0px 0.0px; font: 38.0px LMSans17 Regular; color: #a6a6a6}p.p2 {margin: 0.0px 0.0px 0.0px 0.0px; font: 11.0px LMSans17 Regular}p.p3 {margin: 0.0px 0.0px 0.0px 0.0px; font: 14.5px LMSans17 Regular; color: #3366a6}p.p4 {margin: 0.0px 0.0px 0.0px 0.0px; font: 10.0px LMSans17 Regular}p.p5 {margin: 0.0px 0.0px 0.0px 0.0px; font: 11.0px LMSans17 Regular; color: #5a5a5a}span.s1 {color: #737373}span.s2 {font: 10.0px LMSans17 Regular}span.s3 {font: 7.0px LMSans17 Regular}span.s4 {font: 11.0px LMSans17 Regular}span.s5 {color: #000000}span.s6 {color: #4080c0}&lt;/style&gt;   &lt;/div&gt;&lt;div class="p1"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="p2"&gt;&lt;i&gt;Mobile : &lt;/i&gt;(+33) 6.78.69.34.64&lt;/div&gt;&lt;div class="p2"&gt;&lt;i&gt;E-mail : &lt;/i&gt;boudetch[at]gmail.com&lt;/div&gt;&lt;div class="p2"&gt;&lt;i&gt;WWW : &lt;/i&gt;john-bouday.blogspot.com&lt;/div&gt;&lt;div class="p3"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="p3"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="p3"&gt;&lt;span class="Apple-style-span" style="font-size: x-large;"&gt;&lt;span class="Apple-style-span" style="color: #3366a6;"&gt;Education&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;b&gt;2007–2011 :&lt;/b&gt; UTC, Université Technologique de Compiegne.&amp;nbsp;Engineer in Computer Science, Major in System and Network &lt;br /&gt;&lt;b&gt;2005–2007 :&lt;/b&gt; BTS Informatique de gestion, best GPA. 2-year computer science technical degree &lt;br /&gt;&lt;b&gt;2005 :&lt;/b&gt; Bac STI génie électronique. French baccalaureat in electronics&lt;br /&gt;&lt;div&gt;&lt;br /&gt;&amp;nbsp; &lt;br /&gt;&lt;div class="p3"&gt;&lt;span class="Apple-style-span" style="font-size: x-large;"&gt;&lt;span class="Apple-style-span" style="color: #3366a6;"&gt;Skills&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;b&gt;Web :&lt;/b&gt; PHP, CSS, Javascript, HTML, ExtJS, Jquery, Flash, ActionScript, conception of intranet, and frameworks &lt;br /&gt;&lt;b&gt;Programmation :&lt;/b&gt; Perl, Python, C, ASM, BASH, SQL &lt;br /&gt;&lt;b&gt;Modeling data :&lt;/b&gt; XML, JSON&lt;br /&gt;&lt;b&gt;Linux Administration (Debian, Ubuntu, OpenBSD, Fedora) :&lt;/b&gt; Ldap, Postfix, Dovecot, SSH, MySQL, Apache, Bind, Samba, Snort, DHCP, Iptables, quota, FTP, NFS, SNMP, Clamav &lt;br /&gt;&lt;b&gt;Windows Administration (2003 Server Entreprise) :&lt;/b&gt; Active Directory, GPO, FTP, share, SMTP, POP3, IMAP, DNS, DHCP, IIS &lt;br /&gt;&lt;b&gt;Network Administration (Cisco, 3com) :&lt;/b&gt; router, switch, VLAN, OSPF, RIP2, trunk, 802.1Q, packet tracer, WAN, MAN, LAN &lt;br /&gt;&lt;b&gt;Security :&lt;/b&gt; Penetration tests of servers and sites, corrections of security breaches, IDS, router,&amp;nbsp;firewall, sniffer &lt;br /&gt;&lt;b&gt;Operating system :&lt;/b&gt; Linux, BSD, Apple OS X, Microsoft Windows, Cisco IOS, Solaris &lt;br /&gt;&lt;b&gt;Database :&lt;/b&gt;&amp;nbsp;Elastic Search, Redis, Neo4J, RabbitMQ, LDAP, MySQL, PostgreSQL, Oracle, TokyoCabinet&lt;br /&gt;&lt;b&gt;API :&lt;/b&gt; Freebase, Google Map, Wikipedia, Flickr, Twitter &lt;br /&gt;&lt;b&gt;NLP:&lt;/b&gt; NLTK, LinkParser&lt;br /&gt;&lt;b&gt;Applications :&lt;/b&gt; TEX, LATEX, Photoshop, Office&lt;br /&gt;&lt;b&gt;English :&lt;/b&gt; Reading, writing, speaking (TOEIC : 775)&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;div class="p4"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="p3"&gt;&lt;span class="Apple-style-span" style="font-size: x-large;"&gt;&lt;span class="Apple-style-span" style="color: #3366a6;"&gt;Work experiences&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;b&gt;20I1 Internship of 6 months&lt;/b&gt;, &lt;a href="http://www.legrand.com/EN/"&gt;Legrand&lt;/a&gt;, Software marketing. Management of project, mock-up and technical expertise on the futur expert system software based on open source technolgy.&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;b&gt;2009 Internship of 6 months, &lt;a href="http://www.sgcib.com/"&gt;Bank Société Générale&lt;/a&gt; :&lt;/b&gt;&amp;nbsp;support front and middle office. Development of monitoring tools for financial trades, intranet and deployment for the support team FMO trading. &lt;br /&gt;&lt;br /&gt;&lt;b&gt;Since 2008 SiMDE :&lt;/b&gt;&amp;nbsp;Computer service of the student home of the UTC. Servers management of associations of the Université de Technologie de Compiégne. &lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;a href="http://john-bouday.blogspot.com/2008/10/olympiades-des-metiers.html"&gt;2007 Olympiades des Métiers :&lt;/a&gt;&lt;/b&gt; management of computer network. Rank 5th of National competition « &lt;a href="http://www.worldskills.org/"&gt;Olympiade des métiers&lt;/a&gt; » 2007 categorie management of computer network &lt;br /&gt;&lt;br /&gt;&lt;b&gt;2006 2-month Internship :&lt;/b&gt;&amp;nbsp;Administration of 300 computers and 20 servers at Lycée Turgot. Installation of inventory system of computer (OCSInventory-NG) and SNMP server supervision (Cacti). &lt;br /&gt;&lt;div class="p5"&gt;&lt;span class="Apple-style-span" style="color: black;"&gt;&lt;i&gt;&lt;br /&gt;&lt;/i&gt;&lt;/span&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class="p1"&gt;&lt;span class="Apple-style-span" style="color: #3366a6; font-size: 15px;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="p1"&gt;&lt;span class="Apple-style-span" style="color: #3366a6;"&gt;&lt;span class="Apple-style-span" style="font-size: x-large;"&gt;Projects&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;b&gt;Snort Plugin :&amp;nbsp;&lt;/b&gt;Development of Perl plugin for the IDS Snort with geolocation, Latex reports of attacks&amp;nbsp;with impacts and advices, auto generation of Iptables rules.&lt;/div&gt;&lt;div class="p1"&gt;&lt;br /&gt;&lt;/div&gt;&lt;b&gt;&lt;a href="http://john-bouday.blogspot.com/2010/04/inrtanet-mde-de-lutc.html"&gt;Intranet MDE UTC :&lt;/a&gt;&lt;/b&gt;&amp;nbsp;Development of the intranet of the MDE of UTC, designed to manage more than&amp;nbsp;120 UTC associations with personnal informations, share and private agenda, web site creator, cloud documents editor, membership management, mailling list administration.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;b&gt;Social Network :&lt;/b&gt;&amp;nbsp;Development of a social network 2.0 based on location, high scability technologies, based on a unique web page which uses custom templates and JSON data with homemade Jquery Modules to save bandwith and provide better user experience.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;b&gt;CNS 2009, Challenge of Network Security :&lt;/b&gt;&amp;nbsp;Creator and technical organization of a one day network security challenge at the UTC with supports of professional security experts and teacher of other schools, creator of 15 level tests, main conference speaker of the ending meeting.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;b&gt;&lt;a href="http://john-bouday.blogspot.com/2010/11/web-analysis.html"&gt;Search engine &lt;/a&gt;:&lt;/b&gt; Development of PERL search engine which extracts content from Wikipedia, realizes context understanding analysis and store the result in homemade graph NoSQL database focus on high performance, to answer at natural language question. With recognition of ontologies and auto-classification by categories. Thanks to semantics API as Freebase it provides more informations on each items, and allow relative content.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;b&gt;NSC Live CD for security :&lt;/b&gt;&amp;nbsp;Creation of a live CD Debian for a security group, embedding collections of security tools designed to learn security through penetration test exercices, and lessons. &lt;br /&gt;UTC Apartment :&amp;nbsp;Creation of web service to allows UTC students to search or rent apartements across the country.&lt;br /&gt;&lt;div class="p2"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="p2"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="p1"&gt;&lt;span class="Apple-style-span" style="color: #3366a6;"&gt;&lt;span class="Apple-style-span" style="font-size: x-large;"&gt;Interest and Activities&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;b&gt;Association :&lt;/b&gt; Active adherent of computer association &lt;a href="http://www.alternatives87.eu.org/"&gt;Alternatives87&lt;/a&gt; for promotion of open source software (member since 2001). &lt;br /&gt;&lt;div class="p4"&gt;&lt;span class="Apple-style-span" style="color: black;"&gt;&lt;span class="Apple-style-span" style="font-size: 7px;"&gt;&lt;i&gt;&lt;br /&gt;&lt;/i&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4220755233477842379-1342260756605903331?l=john-bouday.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://john-bouday.blogspot.com/feeds/1342260756605903331/comments/default' title='Publier les commentaires'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4220755233477842379&amp;postID=1342260756605903331' title='0 commentaires'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4220755233477842379/posts/default/1342260756605903331'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4220755233477842379/posts/default/1342260756605903331'/><link rel='alternate' type='text/html' href='http://john-bouday.blogspot.com/2010/11/my-resume.html' title='My resume'/><author><name>John Bouday</name><uri>http://www.blogger.com/profile/16390048419725870067</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4220755233477842379.post-1459185567662178523</id><published>2010-04-04T11:17:00.000-07:00</published><updated>2010-04-04T11:44:15.348-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='jquery'/><category scheme='http://www.blogger.com/atom/ns#' term='intranet'/><category scheme='http://www.blogger.com/atom/ns#' term='extjs'/><category scheme='http://www.blogger.com/atom/ns#' term='php'/><category scheme='http://www.blogger.com/atom/ns#' term='web'/><title type='text'>Intranet MDE de l'UTC</title><content type='html'>Dans le cadre de mes&amp;nbsp;études, je fais parti de l'association SiMDE en charge du&amp;nbsp;fonctionnement&amp;nbsp;de l'infrastructure informatique (serveurs et pc) de la MDE de l'UTC.&lt;br /&gt;J'ai&amp;nbsp;développé&amp;nbsp;un intranet (HTML, CSS, PHP, ExtJS, Jquery), dans le but de simplifier les démarches de&amp;nbsp;chaque&amp;nbsp;association hébergée sur nos serveurs (plus de 90 assos).&lt;br /&gt;&lt;br /&gt;Voici quelques captures&amp;nbsp;d'écrans&amp;nbsp;de cet intranet.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://4.bp.blogspot.com/_G2MDq1c2FCo/S7jRg5SCzkI/AAAAAAAAAXM/3D2XhqTY9B8/s1600/Image+6.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"&gt;&lt;br /&gt;&lt;/a&gt;&lt;br /&gt;&lt;div style="text-align: left;"&gt;authentification (LDAP)&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;/div&gt;&lt;table&gt;&lt;tbody&gt;  &lt;/tbody&gt;&lt;/table&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_G2MDq1c2FCo/S7jRg5SCzkI/AAAAAAAAAXM/3D2XhqTY9B8/s1600/Image+6.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://4.bp.blogspot.com/_G2MDq1c2FCo/S7jRg5SCzkI/AAAAAAAAAXM/3D2XhqTY9B8/s320/Image+6.png" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;page&amp;nbsp;d'accueil&amp;nbsp;composée de portlet&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;a href="http://1.bp.blogspot.com/_G2MDq1c2FCo/S7jSXjfMe1I/AAAAAAAAAXc/Pyu31Rkey50/s1600/Image+7.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://1.bp.blogspot.com/_G2MDq1c2FCo/S7jSXjfMe1I/AAAAAAAAAXc/Pyu31Rkey50/s320/Image+7.png" /&gt;&lt;/a&gt;&lt;a href="http://1.bp.blogspot.com/_G2MDq1c2FCo/S7jSXjfMe1I/AAAAAAAAAXc/Pyu31Rkey50/s1600/Image+7.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"&gt;&lt;br /&gt;&lt;/a&gt;&lt;a href="http://1.bp.blogspot.com/_G2MDq1c2FCo/S7jSXjfMe1I/AAAAAAAAAXc/Pyu31Rkey50/s1600/Image+7.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"&gt;&lt;br /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;Modifications des informations du compte assos&lt;br /&gt;&lt;br /&gt;&lt;a href="http://4.bp.blogspot.com/_G2MDq1c2FCo/S7jUt6R-e5I/AAAAAAAAAXk/1cDWmwUFKD8/s1600/Image+8.png" imageanchor="1" style="clear: left; display: inline !important; float: left; margin-bottom: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://4.bp.blogspot.com/_G2MDq1c2FCo/S7jUt6R-e5I/AAAAAAAAAXk/1cDWmwUFKD8/s320/Image+8.png" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Réservation de salle&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_G2MDq1c2FCo/S7jVLdZxxwI/AAAAAAAAAXs/OORpio76mLo/s1600/Image+11.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://1.bp.blogspot.com/_G2MDq1c2FCo/S7jVLdZxxwI/AAAAAAAAAXs/OORpio76mLo/s320/Image+11.png" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;Agenda fournisseur&amp;nbsp;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;a href="http://4.bp.blogspot.com/_G2MDq1c2FCo/S7jV0Zm4CjI/AAAAAAAAAX0/BQX5lxGbpsc/s1600/Image+12.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://4.bp.blogspot.com/_G2MDq1c2FCo/S7jV0Zm4CjI/AAAAAAAAAX0/BQX5lxGbpsc/s320/Image+12.png" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;ACL via Drag and Drop&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;a href="http://1.bp.blogspot.com/_G2MDq1c2FCo/S7jWFJROGvI/AAAAAAAAAX8/gXgulH4Qq_s/s1600/Image+17.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://1.bp.blogspot.com/_G2MDq1c2FCo/S7jWFJROGvI/AAAAAAAAAX8/gXgulH4Qq_s/s320/Image+17.png" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;Gestion des membres&amp;nbsp;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;a href="http://2.bp.blogspot.com/_G2MDq1c2FCo/S7jWVRUeu0I/AAAAAAAAAYE/QPNJZ_lloD0/s1600/Image-14.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://2.bp.blogspot.com/_G2MDq1c2FCo/S7jWVRUeu0I/AAAAAAAAAYE/QPNJZ_lloD0/s320/Image-14.png" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;Editeur de document à la Google Docs (partage entre utilisateurs ...) ainsi qu'éditeur&amp;nbsp;de site web&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;a href="http://3.bp.blogspot.com/_G2MDq1c2FCo/S7jWuXArc-I/AAAAAAAAAYM/lKnM4umo0WE/s1600/Image+18.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://3.bp.blogspot.com/_G2MDq1c2FCo/S7jWuXArc-I/AAAAAAAAAYM/lKnM4umo0WE/s320/Image+18.png" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;(Jquery from scratch)&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;a href="http://3.bp.blogspot.com/_G2MDq1c2FCo/S7jW4wXpg6I/AAAAAAAAAYU/LDTW3OaTVWw/s1600/Image+20.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="267" src="http://3.bp.blogspot.com/_G2MDq1c2FCo/S7jW4wXpg6I/AAAAAAAAAYU/LDTW3OaTVWw/s640/Image+20.png" width="640" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;a href="http://3.bp.blogspot.com/_G2MDq1c2FCo/S7jXFldKUxI/AAAAAAAAAYc/wUkNM-tGuXA/s1600/Image+21.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="100" src="http://3.bp.blogspot.com/_G2MDq1c2FCo/S7jXFldKUxI/AAAAAAAAAYc/wUkNM-tGuXA/s320/Image+21.png" width="300" /&gt;&lt;/a&gt;&lt;a href="http://4.bp.blogspot.com/_G2MDq1c2FCo/S7jXIrmPaAI/AAAAAAAAAYk/MYc2PXS21Ys/s1600/Image+22.png" imageanchor="1" style="clear: right; float: right; margin-bottom: 1em; margin-left: 1em;"&gt;&lt;img border="0" src="http://4.bp.blogspot.com/_G2MDq1c2FCo/S7jXIrmPaAI/AAAAAAAAAYk/MYc2PXS21Ys/s320/Image+22.png" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4220755233477842379-1459185567662178523?l=john-bouday.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://john-bouday.blogspot.com/feeds/1459185567662178523/comments/default' title='Publier les commentaires'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4220755233477842379&amp;postID=1459185567662178523' title='0 commentaires'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4220755233477842379/posts/default/1459185567662178523'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4220755233477842379/posts/default/1459185567662178523'/><link rel='alternate' type='text/html' href='http://john-bouday.blogspot.com/2010/04/inrtanet-mde-de-lutc.html' title='Intranet MDE de l&apos;UTC'/><author><name>John Bouday</name><uri>http://www.blogger.com/profile/16390048419725870067</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_G2MDq1c2FCo/S7jRg5SCzkI/AAAAAAAAAXM/3D2XhqTY9B8/s72-c/Image+6.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4220755233477842379.post-1169052108006198103</id><published>2010-04-04T10:28:00.000-07:00</published><updated>2010-04-04T10:47:21.732-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='data mining'/><category scheme='http://www.blogger.com/atom/ns#' term='semantic'/><category scheme='http://www.blogger.com/atom/ns#' term='recherche'/><category scheme='http://www.blogger.com/atom/ns#' term='perl'/><category scheme='http://www.blogger.com/atom/ns#' term='web'/><category scheme='http://www.blogger.com/atom/ns#' term='script'/><category scheme='http://www.blogger.com/atom/ns#' term='moteur de recherche'/><title type='text'>Modélisation d'un système d'analyse de page web</title><content type='html'>&lt;div class="MsoNormal" style="mso-layout-grid-align: none; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none; text-indent: 21.25pt;"&gt;Notre projet tente de modifier la manière dont une page internet est analysée et dont les mots clefs sont extraits.&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman'; font-size: x-large;"&gt;&lt;span class="Apple-style-span" style="font-size: 19px;"&gt;&lt;u&gt;&lt;br /&gt;&lt;/u&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;h2 style="margin-left: 54.0pt; mso-list: l0 level2 lfo1; text-indent: -18.0pt;"&gt;&lt;span style="font-family: 'Times New Roman'; font-size: 14pt;"&gt;&lt;span style="font: normal normal normal 7pt/normal 'Times New Roman';"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;Conception.&lt;u&gt;&lt;span style="font-family: 'Times New Roman'; font-size: 14pt;"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/u&gt;&lt;/h2&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman'; font-size: x-large;"&gt;&lt;span class="Apple-style-span" style="font-size: 19px;"&gt;&lt;u&gt;&lt;br /&gt;&lt;/u&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="mso-layout-grid-align: none; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none; text-indent: 21.25pt;"&gt;Notre projet se compose de plusieurs briques logiciel permettant de concevoir un moteur de recherche, pour le moment nous n’avons qu’un nombre réduit de briques, mais dans le futur nous pourrions avoir toutes les briques nécessaire.&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="mso-layout-grid-align: none; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="mso-layout-grid-align: none; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none;"&gt;Chacune des parties est programmée en Perl ou en C. Plus tard, l’ensemble des parties sera porté en C pour augmenter les performances. Le Perl sert à faciliter le maquettage des applications.&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="mso-layout-grid-align: none; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="mso-layout-grid-align: none; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none;"&gt;Notre architecture se compose de programmes en mode clients/serveurs communicants les uns avec les autres. Cela permet une conception en cluster sur un ensemble de machines en réseau. Nous pouvons donc avoir une puissance de calcul évolutives et à des coûts inférieurs à des supercalculateurs.&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman'; font-size: x-large;"&gt;&lt;span class="Apple-style-span" style="font-size: 19px;"&gt;&lt;u&gt;&lt;br /&gt;&lt;/u&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman'; font-size: x-large;"&gt;&lt;span class="Apple-style-span" style="font-size: 19px;"&gt;&lt;u&gt;&lt;span class="Apple-style-span" style="-webkit-text-decorations-in-effect: none; font-family: Times; font-size: medium;"&gt;&lt;u&gt;&lt;span style="font-family: 'Times New Roman'; font-size: 14pt;"&gt;&lt;o:p&gt;&lt;span style="text-decoration: none;"&gt;&lt;span class="Apple-style-span" style="-webkit-text-decorations-in-effect: none; font-family: Times; font-size: medium;"&gt;&lt;u&gt;&lt;span style="font-family: 'Times New Roman'; font-size: 14pt;"&gt;&lt;o:p&gt;&lt;span style="text-decoration: none;"&gt;&lt;/span&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/u&gt;&lt;/span&gt;&lt;/span&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/u&gt;&lt;/span&gt;&lt;/u&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman'; font-size: x-large;"&gt;&lt;u&gt;&lt;u&gt;&lt;u&gt;&lt;/u&gt;&lt;/u&gt;&lt;/u&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman'; font-size: x-large;"&gt;&lt;u&gt;&lt;u&gt;&lt;u&gt;&lt;/u&gt;&lt;/u&gt;&lt;/u&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman'; font-size: x-large;"&gt;&lt;u&gt;&lt;u&gt;&lt;u&gt;&lt;h2 style="display: inline !important; margin-left: 54pt; text-indent: -18pt;"&gt;&lt;span style="font: normal normal normal 7pt/normal 'Times New Roman';"&gt;&amp;nbsp;&amp;nbsp;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;span class="Apple-style-span" style="font-size: x-large;"&gt;Schéma du système&lt;/span&gt;&lt;/span&gt;&lt;/h2&gt;&lt;/u&gt;&lt;/u&gt;&lt;/u&gt;&lt;/span&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_G2MDq1c2FCo/S7jF9kytVjI/AAAAAAAAAW8/6EMM6cr_HSs/s1600/a.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="640" src="http://2.bp.blogspot.com/_G2MDq1c2FCo/S7jF9kytVjI/AAAAAAAAAW8/6EMM6cr_HSs/s640/a.jpg" width="500" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;span class="Apple-style-span" style="font-size: x-large;"&gt;Fonctionnement&lt;/span&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: auto;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;Notre système est pour l’instant décomposé en 5 parties.&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: auto;"&gt;&lt;/div&gt;&lt;h3 style="margin-left: 90.0pt; mso-list: l0 level3 lfo1; mso-text-indent-alt: -9.0pt; text-indent: -90.0pt;"&gt;&lt;span style="text-decoration: none;"&gt;&lt;span style="font: normal normal normal 7pt/normal 'Times New Roman';"&gt;&amp;nbsp;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;&amp;nbsp;&lt;/span&gt;i.&lt;span style="font: normal normal normal 7pt/normal 'Times New Roman';"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;Crawler.&lt;/h3&gt;&lt;div class="MsoNormal" style="mso-layout-grid-align: none; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none; text-indent: 21.25pt;"&gt;Etant au début parti sur la réalisation d’un crawler multithread, nous avions un début de&amp;nbsp; première brique en mesure de se promener de pages en pages pour récupérer du contenue.&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="mso-layout-grid-align: none; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none;"&gt;Cette partie a été abandonnée au profit d’une autre brique nous n’avons pas eu le temps d’implémenter des fonctionnalités tel que le focus, et la profondeur.&amp;nbsp;&amp;nbsp; &lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;/div&gt;&lt;h3 style="margin-left: 90.0pt; mso-list: l0 level3 lfo1; mso-text-indent-alt: -9.0pt; text-indent: -90.0pt;"&gt;&lt;span style="text-decoration: none;"&gt;&lt;span style="font: normal normal normal 7pt/normal 'Times New Roman';"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;ii.&lt;span style="font: normal normal normal 7pt/normal 'Times New Roman';"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;Scrapper.&lt;/h3&gt;&lt;div class="MsoNormal" style="mso-layout-grid-align: none; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none; text-indent: 21.25pt;"&gt;Le but de cette seconde brique est de récupérer le contenu d’une page web, et en extraire différentes informations (scraping).&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="mso-layout-grid-align: none; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none; text-indent: 21.25pt;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="mso-layout-grid-align: none; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none;"&gt;Notre scrapper est en mesure de découper une page en article. Nous n’utilisons pas de modèle, appelé aussi “Template”, ce qui nous permet de pouvoir extraire des articles sur un maximum de sites ayant une conception très différente comme&amp;nbsp;:&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoListParagraphCxSpFirst" style="margin-left: 54.0pt; mso-add-space: auto; mso-layout-grid-align: none; mso-list: l1 level1 lfo2; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none; text-indent: -18.0pt;"&gt;-&lt;span style="font: normal normal normal 7pt/normal 'Times New Roman';"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;blog&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-layout-grid-align: none; mso-list: l1 level1 lfo2; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none; text-indent: -18.0pt;"&gt;-&lt;span style="font: normal normal normal 7pt/normal 'Times New Roman';"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;forum&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-layout-grid-align: none; mso-list: l1 level1 lfo2; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none; text-indent: -18.0pt;"&gt;-&lt;span style="font: normal normal normal 7pt/normal 'Times New Roman';"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;site personnel&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-layout-grid-align: none; mso-list: l1 level1 lfo2; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none; text-indent: -18.0pt;"&gt;-&lt;span style="font: normal normal normal 7pt/normal 'Times New Roman';"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;site d’information&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-layout-grid-align: none; mso-list: l1 level1 lfo2; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none; text-indent: -18.0pt;"&gt;-&lt;span style="font: normal normal normal 7pt/normal 'Times New Roman';"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;entreprise&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoListParagraphCxSpLast" style="margin-left: 54.0pt; mso-add-space: auto; mso-layout-grid-align: none; mso-list: l1 level1 lfo2; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none; text-indent: -18.0pt;"&gt;-&lt;span style="font: normal normal normal 7pt/normal 'Times New Roman';"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;etc.…&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="mso-layout-grid-align: none; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="mso-layout-grid-align: none; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none;"&gt;Cela marche aussi sur un document n’ayant qu’un seul et unique article.&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="mso-layout-grid-align: none; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none;"&gt;Les résultats ont montré un score de réussite de près de 90%. &lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="mso-layout-grid-align: none; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="mso-layout-grid-align: none; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none;"&gt;Grâce a cela nous pouvons nous concentrer uniquement sur la partie importante et faire abstraction des menus, bannière, publicités, etc.&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="mso-layout-grid-align: none; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="mso-layout-grid-align: none; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none;"&gt;Nous analysons ensuite chacun des articles pour en extraire les informations suivantes :&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoListParagraphCxSpFirst" style="margin-left: 54.0pt; mso-add-space: auto; mso-layout-grid-align: none; mso-list: l1 level1 lfo2; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none; text-indent: -18.0pt;"&gt;-&lt;span style="font: normal normal normal 7pt/normal 'Times New Roman';"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;Le titre&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-layout-grid-align: none; mso-list: l1 level1 lfo2; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none; text-indent: -18.0pt;"&gt;-&lt;span style="font: normal normal normal 7pt/normal 'Times New Roman';"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;La date de publication&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-layout-grid-align: none; mso-list: l1 level1 lfo2; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none; text-indent: -18.0pt;"&gt;-&lt;span style="font: normal normal normal 7pt/normal 'Times New Roman';"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;Le nombre de commentaire pour un blog&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-layout-grid-align: none; mso-list: l1 level1 lfo2; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none; text-indent: -18.0pt;"&gt;-&lt;span style="font: normal normal normal 7pt/normal 'Times New Roman';"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;Le texte&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-layout-grid-align: none; mso-list: l1 level1 lfo2; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none; text-indent: -18.0pt;"&gt;-&lt;span style="font: normal normal normal 7pt/normal 'Times New Roman';"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;Les liens&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-layout-grid-align: none; mso-list: l1 level1 lfo2; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none; text-indent: -18.0pt;"&gt;-&lt;span style="font: normal normal normal 7pt/normal 'Times New Roman';"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;Les mots clefs&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoListParagraphCxSpMiddle" style="margin-left: 54.0pt; mso-add-space: auto; mso-layout-grid-align: none; mso-list: l1 level1 lfo2; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none; text-indent: -18.0pt;"&gt;-&lt;span style="font: normal normal normal 7pt/normal 'Times New Roman';"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;La catégorie de l’article&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="MsoListParagraphCxSpLast" style="margin-left: 54.0pt; mso-add-space: auto; mso-layout-grid-align: none; mso-list: l1 level1 lfo2; mso-pagination: none; tab-stops: 28.0pt 56.0pt 84.0pt 112.0pt 140.0pt 168.0pt 196.0pt 224.0pt 252.0pt 280.0pt 308.0pt 336.0pt; text-align: justify; text-autospace: none; text-indent: -18.0pt;"&gt;-&lt;span style="font: normal normal normal 7pt/normal 'Times New Roman';"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;Les images&lt;/div&gt;&lt;u&gt;&lt;br /&gt;&lt;/u&gt;&lt;br /&gt;&lt;div class="MsoNormal"&gt;&lt;u&gt; &lt;/u&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;u&gt;&lt;/u&gt;&lt;/span&gt;&lt;/div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman'; font-size: x-large;"&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman'; font-size: x-large;"&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Ainsi, nous sommes en mesure de juger de la pertinence d’un article en fonction de son nombre de commentaire, de sa catégorie, sa date, ses mots clefs.&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;De plus, nous pouvons estimer un délai pour que la page soit de nouveau crawler en fonction de la fréquence de parution de nouveaux articles (cela permet d’économiser les ressources de notre crawler).&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Grâce au nombre de commentaire et à la fréquence des articles nous pouvons reconnaître des posteurs qui influencent la communauté et qui sont actifs.&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Suite à ce découpage nous ne parlons plus en terme de page mais d’article. Si on reprend l’algorithme de Hits ou le Pagerank et les appliquons sur des articles, et non plus sur des documents, nous serons en mesure d’avoir un classement par pertinence d’une portion d’un document. Cela est intéressant pour des sites personnels ou des blogs qui traitent de nombreux sujets.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: large;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;b&gt;exemple :&lt;/b&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;page vu par un navigateur :&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_G2MDq1c2FCo/S7jH32nyg4I/AAAAAAAAAXE/pT9GwabXWU0/s1600/2.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="640" src="http://4.bp.blogspot.com/_G2MDq1c2FCo/S7jH32nyg4I/AAAAAAAAAXE/pT9GwabXWU0/s640/2.jpg" width="241" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;span class="Apple-style-span" style="font-family: Times;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman';"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: Times;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman';"&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: Times;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman';"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Voici un exemple d’une sauvegarde en XML de cette page:&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;page url='http://mostlylisa.com/'&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;article&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;titre&amp;gt;Mostly Macworld Keynote 2009&amp;lt;/titre&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;comment&amp;gt;22&amp;lt;/comment&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;publish&amp;gt;11 January 2009&amp;lt;/publish&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;content&amp;gt;I apologize for my lousy blogging lately. Macworld has been insane for me. I was on my feet from 7am - 3am, running around the expo doing Macbreak interviews, being a guest on Macbreak Weekly, recording TWiP, and looting booths for schwag (the most important thing at MW), and attending a few shindigs.I plan on writing a detailed post on my reflections of Macworld and my top picks of the Expo in a few days. Before I give you my thoughts on the keynote, I’d like to hear yours.Were you disappointed with this year’s Macworld keynote?Like say the fact that they didn’t even mention Snow Leopard or release a new mini or iMac or, like announce something cool other than the ability to DRM-free your previously bought itunes music for $0.30 a pop? 30 x 14GB of music = I don’t know, you do the math.There is a super awesome prize for the person who makes the best comment. So breathe in and let it all out. Please don’t make Steve cry too much. Think about his hormone imbalance. Please.22 Comments ». Tagged in Apple, Geeky Stuff, Tech/Web, Travel, Videos&amp;lt;/content&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;key&amp;gt;macworld&amp;lt;/key&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;key&amp;gt;keynote&amp;lt;/key&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;key&amp;gt;macbreak&amp;lt;/key&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;key&amp;gt;ability&amp;lt;/key&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;key&amp;gt;announce&amp;lt;/key&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;link href='http://mostlylisa.com/2009/01/mostly-macworld-keynote/'&amp;gt;Mostly Macworld Keynote 2009&amp;lt;/link&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;link href='http://www.pixelcorps.tv/macbreak173'&amp;gt;Macbreak interviews,&amp;lt;/link&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;link href='http://twit.tv/mbw122'&amp;gt;being a guest on Macbreak Weekly,&amp;lt;/link&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;link href='http://feeds.feedburner.com/mostlylisa/yKBd'&amp;gt;&amp;lt;/link&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;cat&amp;gt;apple&amp;lt;/cat&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;cat&amp;gt;geeky-stuff&amp;lt;/cat&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;cat&amp;gt;techweb&amp;lt;/cat&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;cat&amp;gt;travel&amp;lt;/cat&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;cat&amp;gt;videos&amp;lt;/cat&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;/article&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;article&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;titre&amp;gt;Mostly Macworld 2009&amp;lt;/titre&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;comment&amp;gt;15&amp;lt;/comment&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;publish&amp;gt;6 January 2009&amp;lt;/publish&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;content&amp;gt;Photo by Scott Meizner’s slick Canon 5D Mark II.It’s just after midnight, the day before Macworld keynote ‘09. I can see the glow of the Moscone Center from my hotel room. I can’t quite see the line o’fanboys, but if I crane my neck a wee bit, I can see the twinkle of their MBP and a glint in their eyes. They miss Jobs. Ahh, don’t we all.For those of you not able to come to Macworld, I’ll be covering all of its geeky goodness with the MacBreak crew. So I want to ask you: What Macworld inside scoop would you like hear about? &amp;nbsp;If you think of person, company, or Mac-related product you’d like to learn about, fire a comment here or @lisabettany on twitter or squint your eyes, distort the Space-Time continuum, and leave me a scroll somewhere near the Moscone Center. No guarantees that I’ll get it, but good effort, none-the-less.15 Comments ». Tagged in Apple, Geeky Stuff&amp;lt;/content&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;key&amp;gt;macworld&amp;lt;/key&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;key&amp;gt;moscone&amp;lt;/key&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;key&amp;gt;comment&amp;lt;/key&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;key&amp;gt;company&amp;lt;/key&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;key&amp;gt;continuum&amp;lt;/key&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: Times;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman';"&gt;&lt;div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;img width='386'&amp;gt;http://farm4.static.flickr.com/3258/3169628089 ea8a1c0931.jpg&amp;lt;/img&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;link href='http://mostlylisa.com/2009/01/mostly-macworld-2009/'&amp;gt;Mostly Macworld 2009&amp;lt;/link&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;link href='http://www.flickr.com/photos/redpilotmedia/3169628089/'&amp;gt;&amp;lt;/link&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;link href='http://flickr.com/photos/smeinzer/page4/'&amp;gt;Scott Meizner’s&amp;lt;/link&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;link href='http://www.pixelcorps.tv/macbreak'&amp;gt;MacBreak &amp;lt;/link&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;link href='http://twitter.com/lisabettany'&amp;gt;@lisabettany&amp;lt;/link&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;link href='http://feeds.feedburner.com/mostlylisa/yKBd'&amp;gt;&amp;lt;/link&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;cat&amp;gt;apple&amp;lt;/cat&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;  &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;cat&amp;gt;geeky-stuff&amp;lt;/cat&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;lt;/article&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: large;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: Times;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman';"&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: Times;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman';"&gt;&lt;span class="Apple-style-span" style="font-size: x-large;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: Times;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman';"&gt;&lt;span class="Apple-style-span" style="font-size: x-large;"&gt;Analyse textuelle.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt; &lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Cette brique permet de comprendre le sens d’une phrase en anglais.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;La grammaire française étant plus compliquée à modéliser, c’est pour cela que nous avons préféré utiliser l’anglais.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Suite à la partie faite par le scrapper, le contenu textuel est extrait d’un article. Nous allons le découper en phrases puis chaque phrase va être envoyée à un serveur qui va l’analyser et renvoyer les informations suivantes de la phrase :&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;QUI&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;FAIT&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;QUOI&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;OÙ&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;QUAND&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;COMMENT&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;span class="Apple-style-span" style="font-size: medium;"&gt;Ainsi, notre phrase est à son tour découpée en ensemble de mots (1 à 6 mots en général).&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Nous enlevons ensuite les stopwords, et récupérons des mots clefs composés de plusieurs mots et pouvant être classés en fonction de leur nature (sujet, action, manière, date, lieu ...).&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Donc, le mot “maison blanche” sera un mot clef (d’un ensemble de mot) et non décomposé en plusieurs mots clés, on n’aura plus besoin de faire des jointures. On gagne alors en ressource, rapidité, et précision.&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: Times;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman';"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: Times;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman';"&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: Times;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman';"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Exemple :&lt;/span&gt;&lt;/b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;span class="Apple-style-span" style="font-size: medium;"&gt;QUI : you&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;FAIT : have&amp;nbsp;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;QUOI : smaller pets&lt;/span&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: Times;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman';"&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;QUI : Everyone&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;FAIT : was leaving&amp;nbsp;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;QUOI : the park&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: x-large;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: x-large;"&gt;Création de mots clefs en catégorie&lt;/span&gt;.&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Cette brique permet &amp;nbsp;la classification automatique en catégorie. Elle extrait &amp;nbsp;les mots clefs provenant d’une définition d’un domaine, puis en filtrant de manière automatique il ne récupère que les mots clés ayant un rapport avec la catégorie demandée.&amp;nbsp;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Ce programme n’est pas encore totalement opérationnel même s’il retourne un grand nombre de mots ayant un rapport direct avec la catégorie, il retourne encore 10-15% de mots sans rapport (des mots de liaison). Ce problème sera résolu avec l’ajout de la lemmatisation, et d’une constitution automatique de stopword.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Exemple :&amp;nbsp;&lt;/span&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Catégorie : finance&amp;nbsp;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: Times;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman';"&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: Times;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman';"&gt;&lt;ul&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Instrument&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Internationaux&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Locales&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Monétaire&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Monétaires&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Optimisation&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Revenus&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Bourses&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Capitaux&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Collectivités&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Compagnies&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Diversification&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Décisions&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Développement&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Financement&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Finance&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Marchés&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Banques&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Entreprise&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Politique&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Risques&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Gestion&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Institutions&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Économie&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Actions&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Assurances&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Caisses&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;/ul&gt;&lt;span class="Apple-style-span" style="font-size: large;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;span class="Apple-style-span" style="font-size: x-large;"&gt;Extraction de concept.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Notre dernière brique permet d’extraire des “concept” commun de notre texte et &amp;nbsp;des informations relatives à ce concept.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Tout d’abord le logiciel recherche des mots ou ensemble de mots dont il connaît la définition à travers sa base de données (construite à partir d’un ensemble de données extraites d’internet).&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Puis pour chacun de ces mots, il va chercher les catégories dans lesquels ces mots sont inscrits.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Ensuite, il va pouvoir rajouter des informations sur les mots selon les catégories comme:&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;- personne : date de naissance, de décès, nationalité, métier, ...&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;- ville, pays : nombre d’habitants, localisation (lat, long), site internet, ...&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;- entreprise : logo, nombre d’employés, capital, valeur boursière, ...&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;- voiture : constructeur, moteur, taille, poids, ...&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;- logiciel : logo, version, licence&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Pour chacun de ses mots nous allons pouvoir trouver d’autre mots dans les même catégories que l’on pourra proposer dans la partie “Termes associés”, ainsi que des images comme des logos ou des photos.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt; &lt;b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Exemple :&amp;nbsp;&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;“Les violences se poursuivent à Gaza, mais au Caire, la diplomatie progresse. L'aviation israélienne a frappé mercredi un cimetière de la ville de Gaza, et continué à pilonner des positions du Hamas ainsi que des tunnels servant à la contrebande. Au 19e jour de son offensive, Tsahal a en outre riposté à de nouveaux tirs de roquettes sur le nord d'Israël depuis le Liban qui laissent craindre l'ouverture d'un autre front.”&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: large;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Les mots suivant sont reconnus :&amp;nbsp;&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Aviation&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Gaza&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Hamas&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Israël&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Liban&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Roquettes&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Prenons l’exemple du mot « Gaza » et cherchons d’autres informations:&amp;nbsp;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt; &lt;b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Tout d’abord la définition :&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;div&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;"La ville de Gaza (parfois appelée Gaza City pour la distinguer de la bande de Gaza qui désigne la région dans son ensemble) est la ville principale de la bande de Gaza."&lt;/span&gt;&lt;/i&gt;&lt;/div&gt;&lt;span class="Apple-style-span" style="font-size: large;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;span class="Apple-style-span" style="font-size: medium;"&gt;Cherchons les catégorie dans lequel le mot Gaza est classé :&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Cities in the Gaza Strip&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Coastal settlements&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Gaza Governorate&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Hebrew Bible cities&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Philistine cities&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;1929 Palestine Riots&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: large;"&gt;Cherchons plus d’informations :&amp;nbsp;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Nom Local&lt;/span&gt;&lt;/b&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;: &lt;/span&gt;&lt;/b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;قطاع غزة&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Carte :&lt;/span&gt;&lt;/b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; map_gaza.png&lt;/span&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Langues :&lt;/span&gt;&lt;/b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; Arabe&lt;/span&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Statut :&lt;/span&gt;&lt;/b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; Territoire administré par l'Autorité palestinienne, non reconnu internationalement comme faisant partie d'aucun pays&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Capitale :&lt;/span&gt;&lt;/b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; Aucune&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Titres Dirigeants :&lt;/span&gt;&lt;/b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; Présidents de l’Autorité palestinienne&lt;/span&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Noms Dirigeants :&lt;/span&gt;&lt;/b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; Mahmoud Abbas&lt;/span&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Titres Dirigeants :&lt;/span&gt;&lt;/b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; Premiers ministres de l’Autorité palestinienne&lt;/span&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Noms Dirigeants :&lt;/span&gt;&lt;/b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; Salam Fayyad&lt;/span&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Superficie :&lt;/span&gt;&lt;/b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; 360&lt;/span&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Population Totale :&lt;/span&gt;&lt;/b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; 1 376 289&lt;/span&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Monnaie :&lt;/span&gt;&lt;/b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; Shekel&lt;/span&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Fuseau Horaire :&lt;/span&gt;&lt;/b&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; +2&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Date importante : &lt;/span&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;12 septembre 2005&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;juin 2007&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;15 novembre&lt;/span&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; 2005&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;13 septembre 1993&lt;/span&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;15 août 2005&lt;/span&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;22 août 2005&lt;/span&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;12 septembre 2005&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-tab-span" style="white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;16 septembre 2005&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Ces informations ne sont pas de toute dernière fraicheur mais permettent d’avoir des mots clefs et des informations connexes sur la recherche.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Cette brique n’est pas encore finie mais se montre déjà très prometteuse.&amp;nbsp;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman';"&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: Times;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman';"&gt;&lt;span class="Apple-style-span" style="font-size: x-large;"&gt;Amélioration du système.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Pour améliorer notre système il faut :&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt; &lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt; &lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: Times;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman';"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;b&gt;Au niveau du crawler :&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt; &lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;Implémenter le focus.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;La profondeur.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;L’analyse du nom de domaine via whois.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;La géo localisation du site.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;Gérer la base de donnée (rajouts des urls, calcul du CRC pour éviter de réindexer &amp;nbsp;une page dont le contenue n’a pas changé).&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;Implémenter d’autre protocole que HTTP et HTTPS tel FTP, NEWS.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt; &lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;b&gt;Au niveau du scrapper :&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt; &lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;Ajouter la lemmatisation.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;Extraire la langue (fr, us, uk, es, ...).&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;Le niveau littéraire de l’article (enfant jusqu'à scientifique).&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;Donner un score des mots en fonction de leur mise en page (h1, ..., h6, u, i ....) et de leurs positions dans le texte.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;Trouver la nature du site (blog, forum, site personnel, entreprise, journal, ...).&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;Lire des fichiers non HTML, comme Word, PDF, Excel, Open Office, Flash, Mp3 et en extraire les informations utiles.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;Extraire les auteurs et les adresses email.&amp;nbsp;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;Stocker toutes les données dans une base de donnée.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;Prendre une miniature de la page afin de la proposer lors de la recherche.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;Calculer le ranking des articles/pages/sites/auteurs/images&amp;nbsp;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt; &lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;b&gt;Analyse textuelle :&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt; &lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;Ajouter de nouvelles langues.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;Améliorer la recherche de lieux.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;Donner un score des mots en fonction de leur valeur péjorative ou non.&amp;nbsp;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt; &lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;b&gt;Créations des mots en catégorie :&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt; &lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;Implémenter la lemmatisation.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;Donner un score des mots selon leur position.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;Stocker les catégories dans la base.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;Trouver des sous/sur catégorie&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt; &lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;b&gt;Extraction de concept :&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt; &lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;Trouver de nouveaux concepts de manière automatique s’ils ne sont pas déjà présents dans la base.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;Coupler les catégories des mots d’un article pour en trouver une signification au niveau de la requête du client.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;b&gt;Futures briques :&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt; &lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;La base de données.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;La génération de graphes pour les articles, auteurs, catégories, sites.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;Découper nos résultats en agrégats.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;Une interface graphique (page web).&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;Des briques de supervision.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;Haute disponibilité.&lt;/span&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: Times;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman';"&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-size: large;"&gt;&lt;span class="Apple-style-span" style="font-size: xx-large;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: large;"&gt;&lt;span class="Apple-style-span" style="font-size: xx-large;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Times;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman';"&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: Times;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman';"&gt;&lt;span class="Apple-style-span" style="font-size: x-large;"&gt;Utilisation du projet.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt; &lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: Times;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman';"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Notre projet peut servir dans différents domaines.&amp;nbsp;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt; &lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-family: Times;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman';"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Premièrement, tel un moteur de recherche classique (sur l’ensemble du web, ou une communauté), comme Google.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt; &lt;span class="Apple-style-span" style="font-size: medium;"&gt;Un moteur de news (tel Google Actu), on surveille les sites d’actualités et on peut regrouper les articles ensemble en fonction de leur contenu.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt; &lt;span class="Apple-style-span" style="font-size: medium;"&gt;D’analyse stratégique (veille) comme WebFountain. On peut surveiller ce que pense une communauté, une personne, un site, d’un événement particulier, d’un produit commercial de manière très précise. Ainsi, il est possible de cibler une personne sur un produit plus adapté, mais il est aussi possible de connaître très rapidement ce que pense une population sur un nouveau produit avant même sa sortie.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4220755233477842379-1169052108006198103?l=john-bouday.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://john-bouday.blogspot.com/feeds/1169052108006198103/comments/default' title='Publier les commentaires'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4220755233477842379&amp;postID=1169052108006198103' title='0 commentaires'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4220755233477842379/posts/default/1169052108006198103'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4220755233477842379/posts/default/1169052108006198103'/><link rel='alternate' type='text/html' href='http://john-bouday.blogspot.com/2010/04/modelisation-dun-systeme-danalyse-de.html' title='Modélisation d&apos;un système d&apos;analyse de page web'/><author><name>John Bouday</name><uri>http://www.blogger.com/profile/16390048419725870067</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_G2MDq1c2FCo/S7jF9kytVjI/AAAAAAAAAW8/6EMM6cr_HSs/s72-c/a.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4220755233477842379.post-2754711570896229898</id><published>2008-10-01T07:55:00.000-07:00</published><updated>2008-10-01T08:22:00.721-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='admin'/><category scheme='http://www.blogger.com/atom/ns#' term='Linux'/><category scheme='http://www.blogger.com/atom/ns#' term='réseau'/><category scheme='http://www.blogger.com/atom/ns#' term='windows'/><category scheme='http://www.blogger.com/atom/ns#' term='3com'/><category scheme='http://www.blogger.com/atom/ns#' term='Cisco'/><title type='text'>Olympiades des metiers</title><content type='html'>Résumé du concours des 39eme Olympiades des Métiers.&lt;br /&gt;Suite a ma sélection régional de meilleur admin réseau de moins de 23 ans, j'ai put participer a la finale national du concours du meilleur admin français de moins de 23 ans.&lt;br /&gt;J'ai finit 5eme national, j'ai mal joué j'aurais pus faire mieux si je m'était pas obstiner sur un Samba contrôleur primaire de domaine et que j'avais fais les regles Iptables qui finallement rapporté 8points comparé a mes 2 points du Samba (je penssais que Samba rapporterais plus, ca m'apprendra)&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_G2MDq1c2FCo/SOOTGKHfZZI/AAAAAAAAASM/0Lonyt99yw8/s1600-h/DSCN1818.JPG"&gt;&lt;img style="margin: 0pt 10px 10px 0pt; cursor: pointer;" src="http://1.bp.blogspot.com/_G2MDq1c2FCo/SOOTGKHfZZI/AAAAAAAAASM/0Lonyt99yw8/s320/DSCN1818.JPG" alt="" id="BLOGGER_PHOTO_ID_5252203324448400786" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;Mon poste de travail lors de l'entrainement :&lt;br /&gt;&lt;ul&gt;&lt;li&gt;2 ordi Dell (Windows 2003 server et Linux Fedora)&lt;br /&gt;&lt;/li&gt;&lt;li&gt;2 siwtchs Cisco 2960&lt;br /&gt;&lt;/li&gt;&lt;li&gt;2 Routeurs Cisco&lt;/li&gt;&lt;li&gt;Switch 3com 4500&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_G2MDq1c2FCo/SOOSyz8b2qI/AAAAAAAAASE/A_gqftLh2po/s1600-h/DSCN1905.jpg"&gt;&lt;img style="margin: 0pt 10px 10px 0pt; cursor: pointer;" src="http://4.bp.blogspot.com/_G2MDq1c2FCo/SOOSyz8b2qI/AAAAAAAAASE/A_gqftLh2po/s320/DSCN1905.jpg" alt="" id="BLOGGER_PHOTO_ID_5252202992078936738" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;Mon poste lors du concours national :&lt;br /&gt;&lt;ul&gt;&lt;li&gt;2 pc dont un assemblé&lt;/li&gt;&lt;li&gt;Switchs Cisco 2960&lt;/li&gt;&lt;li&gt;Routeurs Cisco&lt;/li&gt;&lt;li&gt;Siwtch 3Com&lt;/li&gt;&lt;/ul&gt;&lt;span style="font-size:180%;"&gt;&lt;span style="font-weight: bold;"&gt;Épreuves :&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-weight: bold;"&gt;1er jour  :&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-style: italic;"&gt; &lt;/span&gt; &lt;span style="font-style: italic; font-weight: bold;"&gt;1er partie durée 3 heures :&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt; Assemblage d'un poste de travail (carte mere, cpu, carte graph, ram, dd ...)&lt;/li&gt;&lt;li&gt;Installation de MS-DOS 6.22 (trois partitions, modif d'option ....)&lt;/li&gt;&lt;li&gt;Installation de XP (driver, réseau, netbios, trois partitions, SP2, renommage lecteur ...)&lt;/li&gt;&lt;li&gt;Dépannage du pc.&lt;/li&gt;&lt;li&gt; Création de câble réseau (droit et croise).&lt;/li&gt;&lt;li&gt;Installation de Windows 2003 Serveur sur le deuxieme poste (driver, partitions, netbios, reseaux, SP1, user ...)&lt;/li&gt;&lt;li&gt;Configuration switch Cisco (password, VLAN, FD ...)&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;2éme partie durée 3 heures :&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Serveur DHCP (win) (reservation, bail, reseaux ....)&lt;/li&gt;&lt;li&gt;Active Directory  (domaine, DNS ...)&lt;/li&gt;&lt;li&gt;Création groupes et utilisateurs (partage, dossier privee, quota, sécurite, strategie systeme commune, auto-montage ...)&lt;/li&gt;&lt;li&gt;Ghost Entreprise (image via disquette ...)&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-size:130%;" &gt;2éme jour&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;1er partie durée 3 heures :&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Installation Linux Fedora Core 4 (non graphique :), partition, réseau, paquetage, désactivation de service ....)&lt;/li&gt;&lt;li&gt;Installation  de Windows XP Pro (comme dab).&lt;/li&gt;&lt;li&gt;Cable réseau a réaliser (comme dab, faut bien s'occuper).&lt;/li&gt;&lt;li&gt; Configuration switch Cisco (VLAN, TFTP, FD ....)&lt;/li&gt;&lt;li&gt;DHCP (Linux) (réservation, gateway ...)&lt;/li&gt;&lt;li&gt;Virtual PC (encore un XP même config)&lt;/li&gt;&lt;li&gt;DNS (Linux) &lt;/li&gt;&lt;li&gt;Mail (Linux) (SMTP, POP3, user, list, config Outlook ...)&lt;/li&gt;&lt;li&gt;(tout sa en console (non graph) a part Windows)&lt;/li&gt;&lt;/ul&gt;&lt;span style="font-weight: bold;"&gt;2éme partie durée 3 heures :&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Nouvelle partition ext3 Linux.&lt;/li&gt;&lt;li&gt;Apache (nouvelle racine, htaccess, TLD ...)&lt;/li&gt;&lt;li&gt;Webmin (et ui on fait comme windows du graphique ...... )&lt;/li&gt;&lt;li&gt;DNS secondaire sur un Linux virtuel.&lt;/li&gt;&lt;li&gt;FTP (racine, user ...)&lt;/li&gt;&lt;li&gt;Samba (contrôleur primaire, quota, sécurité, user ...)&lt;/li&gt;&lt;li&gt;Iptables (firewall, log ...)&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-weight: bold;"&gt;3éme jour durée 4 heures :&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Installation XP (la routine du matin)&lt;/li&gt;&lt;li&gt;Configuration switch Cisco (2960) et 3Com (4400) (VLAN, user, FD ...)&lt;/li&gt;&lt;li&gt;Routage stastique sur un routeur Cisco.&lt;/li&gt;&lt;li&gt;Routage dynamique (RIP2).&lt;/li&gt;&lt;li&gt;Installation de IIS sur XP (access liste, site web ...)&lt;/li&gt;&lt;li&gt;Création de VLAN sur les switch.&lt;/li&gt;&lt;li&gt;Routage statique et dynamique&lt;/li&gt;&lt;li&gt;Routage inter VLAN (avec un seul ports 802.1Q)&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4220755233477842379-2754711570896229898?l=john-bouday.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://john-bouday.blogspot.com/feeds/2754711570896229898/comments/default' title='Publier les commentaires'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4220755233477842379&amp;postID=2754711570896229898' title='0 commentaires'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4220755233477842379/posts/default/2754711570896229898'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4220755233477842379/posts/default/2754711570896229898'/><link rel='alternate' type='text/html' href='http://john-bouday.blogspot.com/2008/10/olympiades-des-metiers.html' title='Olympiades des metiers'/><author><name>John Bouday</name><uri>http://www.blogger.com/profile/16390048419725870067</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_G2MDq1c2FCo/SOOTGKHfZZI/AAAAAAAAASM/0Lonyt99yw8/s72-c/DSCN1818.JPG' height='72' width='72'/><thr:total>0</thr:total></entry></feed>
