{"id":6481,"date":"2014-09-06T03:21:29","date_gmt":"2014-09-06T03:21:29","guid":{"rendered":"https:\/\/dev.railscarma.com\/scraping-of-web-sites-using-mechanize-gem\/"},"modified":"2022-08-30T09:18:09","modified_gmt":"2022-08-30T09:18:09","slug":"scraping-de-sites-web-a-laide-de-mechanize-gem","status":"publish","type":"post","link":"https:\/\/www.railscarma.com\/fr\/blog\/articles-techniques\/scraping-de-sites-web-a-laide-de-mechanize-gem\/","title":{"rendered":"Scraping de sites Web \u00e0 l&#039;aide de Mechanize Gem"},"content":{"rendered":"<div data-elementor-type=\"wp-post\" data-elementor-id=\"6481\" class=\"elementor elementor-6481\" data-elementor-post-type=\"post\">\n\t\t\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-1355bdca elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"1355bdca\" data-element_type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-65c23262\" data-id=\"65c23262\" data-element_type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-a27b432 elementor-widget elementor-widget-text-editor\" data-id=\"a27b432\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p style=\"text-align: justify;\" align=\"JUSTIFY\"><strong><em><span style=\"font-size: medium;\"><span style=\"font-style: normal;\">Scrapage Web<\/span><\/span><\/em><\/strong><em><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\"> (r\u00e9colte de donn\u00e9es Web ou extraction de donn\u00e9es Web) est une technique logicielle permettant d&#039;extraire des informations \u00e0 partir de sites Web. La biblioth\u00e8que Mechanize est utilis\u00e9e pour automatiser l&#039;interaction avec les sites Web. Mechanize gem stocke et envoie automatiquement des cookies, suit les redirections et peut suivre des liens et soumettre des formulaires. <\/span><\/span><\/span><\/em><\/p>\n<p style=\"text-align: justify;\" align=\"JUSTIFY\"><em><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\">Les champs du formulaire peuvent \u00eatre remplis et soumis. Il conserve \u00e9galement une trace des sites que vous avez visit\u00e9s sous forme d&#039;historique. Il exploite Nokogiri pour analyser une page pour les formulaires et boutons pertinents et fournit une interface simplifi\u00e9e pour manipuler un formulaire Web.<\/span><\/span><\/span><\/em><\/p>\n<p style=\"text-align: justify;\" align=\"JUSTIFY\"><a name=\"label-Dependencies\"><\/a><strong><span style=\"font-size: medium;\"><span style=\"font-style: normal;\">D\u00e9pendances<\/span><\/span><\/strong><\/p>\n<ul style=\"text-align: justify;\">\n<li>\n<p style=\"font-style: normal; font-weight: normal;\"><span style=\"font-size: medium;\">rubis 1.8.7, 1.9.2 ou 1.9.3<\/span><\/p>\n<\/li>\n<li>\n<p style=\"font-style: normal; font-weight: normal;\"><span style=\"font-size: medium;\">Nokogiri<\/span><\/p>\n<\/li>\n<\/ul>\n<p style=\"text-align: justify;\"><strong><span style=\"font-size: medium;\"><span style=\"font-style: normal;\">Premiers pas avec Mechanize\u00a0:<\/span><\/span><\/strong><\/p>\n<p style=\"text-align: justify;\"><strong><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\">Allons chercher une page\u00a0!<\/span><\/span><\/span><\/strong><\/p>\n<p style=\"font-style: normal; font-weight: normal; text-align: justify;\"><span style=\"font-size: medium;\">La premi\u00e8re chose est la premi\u00e8re. Assurez-vous que vous avez requis Mechanize et que vous instanciez un nouvel objet Mechanize\u00a0:<\/span><\/p>\n<pre class=\"western\"><em><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\">n\u00e9cessite des \u00ab rubygems \u00bb<\/span><\/span><\/span><\/em>\n<span style=\"font-size: medium;\">n\u00e9cessiter \u00ab\u00a0m\u00e9caniser\u00a0\u00bb<\/span>\n\n<span style=\"font-size: medium;\">agent = M\u00e9caniser.nouveau<\/span>\n<em><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\">Nous allons maintenant utiliser l&#039;agent que nous avons cr\u00e9\u00e9 pour r\u00e9cup\u00e9rer une page. Allons chercher Google avec notre agent de m\u00e9canisation\u00a0:<\/span><\/span><\/span><\/em>\n<em><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\">page = agent.get (&#039;<\/span><\/span><\/span><\/em><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\">http:\/\/google.com\/<\/span><\/span><\/span><em><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\">')<\/span><\/span><\/span><\/em>\n\n<strong><a name=\"label-Finding+Links\"><\/a><span style=\"font-size: medium;\"><span style=\"font-style: normal;\">Trouver des liens<\/span><\/span><\/strong><\/pre>\n<p style=\"text-align: justify;\" align=\"JUSTIFY\"><em><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\">Mechanize renvoie un objet page chaque fois que vous obtenez une page, publiez ou soumettez un formulaire. Lorsqu&#039;une page est r\u00e9cup\u00e9r\u00e9e, l&#039;agent analysera la page et mettra une liste de liens sur l&#039;objet page.<\/span><\/span><\/span><\/em><\/p>\n<p style=\"font-style: normal; font-weight: normal; text-align: justify;\"><span style=\"font-size: medium;\">Maintenant que nous avons r\u00e9cup\u00e9r\u00e9 la page d&#039;accueil de Google, essayons de lister tous les liens\u00a0:<\/span><\/p>\n<pre class=\"western\"><em><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\">page.links.each fait |link|<\/span><\/span><\/span><\/em>\n  <span style=\"font-size: medium;\">met link.text<\/span>\n<span style=\"font-size: medium;\">fin<\/span><\/pre>\n<p style=\"text-align: justify;\" align=\"JUSTIFY\"><em><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\">Nous pouvons lister les liens, mais Mechanize propose quelques raccourcis pour nous aider \u00e0 trouver un lien sur lequel cliquer. Disons que nous voulions cliquer sur le lien dont le texte est \u00ab Actualit\u00e9s \u00bb. Normalement, nous devrions faire ceci :<\/span><\/span><\/span><\/em><\/p>\n<pre class=\"western\"><em><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\">page = agent.page.links.find { |l| l.text == &#039;Actualit\u00e9s&#039; }.cliquez (ou)<\/span><\/span><\/span><\/em>\n<em><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\">page = agent.page.link_with(:text =&gt; &#039;News&#039;).cliquez (ou)<\/span><\/span><\/span><\/em>\n<em><span style=\"font-family: Liberation Serif,serif;\"><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\">agent.page.links_with(:text =&gt; &#039;Actualit\u00e9s&#039;)[1].cliquez (ou)<\/span><\/span><\/span><\/span><\/em>\n<em><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\">page.link_with(:href =&gt; &#039;\/quelque chose&#039;).<\/span><\/span><\/span><\/em>\n\n<a name=\"label-Filling+Out+Forms\"><\/a><strong><span style=\"font-size: medium;\"><span style=\"font-style: normal;\">Remplir des formulaires<\/span><\/span><\/strong><\/pre>\n<p style=\"font-style: normal; font-weight: normal; text-align: justify;\"><span style=\"font-size: medium;\">Continuons avec notre exemple Google. Voici le code que nous avons jusqu&#039;\u00e0 pr\u00e9sent\u00a0:<\/span><\/p>\n<pre class=\"western\"><em><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\">n\u00e9cessite des \u00ab rubygems \u00bb<\/span><\/span><\/span><\/em>\n<span style=\"font-size: medium;\">n\u00e9cessiter \u00ab\u00a0m\u00e9caniser\u00a0\u00bb<\/span>\n\n<span style=\"font-size: medium;\">agent = M\u00e9caniser.nouveau<\/span>\n<span style=\"font-size: medium;\">page = agent.get(&#039;http:\/\/google.com\/&#039;)<\/span><\/pre>\n<p style=\"text-align: justify;\" align=\"JUSTIFY\"><em><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\">Si nous imprimons joliment la page, nous pouvons voir qu&#039;il existe un formulaire nomm\u00e9 \u00ab f \u00bb, qui comporte quelques boutons et quelques champs\u00a0:<\/span><\/span><\/span><\/em><\/p>\n<pre class=\"western\" style=\"margin-bottom: 0.5cm;\"><em><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\">page pp<\/span><\/span><\/span><\/em><\/pre>\n<p style=\"text-align: justify;\" align=\"JUSTIFY\"><em><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\">Maintenant que nous connaissons le nom du formulaire, r\u00e9cup\u00e9rons-le sur la page\u00a0:<\/span><\/span><\/span><\/em><\/p>\n<pre class=\"western\" style=\"margin-bottom: 0.5cm;\"><em><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\">google_form = page.form(&#039;f&#039;)<\/span><\/span><\/span><\/em><\/pre>\n<p style=\"text-align: justify;\" align=\"JUSTIFY\"><em><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\">Jetons un coup d&#039;oeil au code tous ensemble\u00a0:<\/span><\/span><\/span><\/em><\/p>\n<pre class=\"western\"><em><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\">n\u00e9cessite des \u00ab rubygems \u00bb<\/span><\/span><\/span><\/em>\n<span style=\"font-size: medium;\">n\u00e9cessiter \u00ab\u00a0m\u00e9caniser\u00a0\u00bb<\/span>\n\n<span style=\"font-size: medium;\">agent = M\u00e9caniser.nouveau<\/span>\n<span style=\"font-family: Liberation Serif,serif;\"><span style=\"font-size: medium;\">page = agent.get(&#039;http:\/\/google.com\/&#039;)<\/span><\/span>\n<span style=\"font-size: medium;\">google_form = page.form(&#039;f&#039;)<\/span>\n<span style=\"font-size: medium;\">google_form.q = &#039;ruby m\u00e9caniser&#039;<\/span>\n<span style=\"font-size: medium;\">page = agent.submit(google_form)<\/span>\n<span style=\"font-size: medium;\">page pp<\/span>\n\n<strong><a name=\"label-Scraping+Data\"><\/a><span style=\"font-size: medium;\"><span style=\"font-style: normal;\">Grattage de donn\u00e9es <\/span><\/span><\/strong>\n<em><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\">Mechanize utilise Nokogiri pour analyser le HTML. Qu&#039;est-ce que cela signifie pour toi? Vous pouvez traiter une page m\u00e9canis\u00e9e comme un objet nokogiri. Apr\u00e8s avoir utilis\u00e9 Mechanize pour acc\u00e9der \u00e0 la page que vous devez gratter, grattez-la en utilisant les m\u00e9thodes nokogiri\u00a0:<\/span><\/span><\/span><\/em>\n<em><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\">agent.get(&#039;http:\/\/someurl.com\/&#039;).search(&quot;p.posted&quot;)<\/span><\/span><\/span><\/em><\/pre>\n<p style=\"text-align: justify;\" align=\"JUSTIFY\"><em><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\">L&#039;expression donn\u00e9e \u00e0 Mechanize::Page#search peut \u00eatre une expression CSS ou une expression XPath\u00a0:<\/span><\/span><\/span><\/em><\/p>\n<pre class=\"western\" style=\"margin-bottom: 0.5cm;\"><em><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\">agent.get(&#039;http:\/\/someurl.com\/&#039;).search(&quot;.\/\/p[@class=&#039;posted&#039;]&quot;)<\/span><\/span><\/span><\/em><\/pre>\n<p style=\"text-align: justify;\" align=\"JUSTIFY\"><em><span style=\"font-size: medium;\"><span style=\"font-style: normal;\"><span style=\"font-weight: normal;\">La biblioth\u00e8que Mechanize est utilis\u00e9e pour automatiser l&#039;interaction avec les sites Web. Mechanize gem stocke et envoie automatiquement des cookies, suit les redirections et peut suivre des liens et soumettre des formulaires. Les champs du formulaire peuvent \u00eatre remplis et soumis. Il garde \u00e9galement une trace des sites que vous avez visit\u00e9s sous forme d&#039;historique. exploite Nokogiri pour analyser une page pour les formulaires et boutons pertinents et fournit une interface simplifi\u00e9e pour manipuler un formulaire Web.<\/span><\/span><\/span><\/em><\/p>\n<p><strong><a href=\"https:\/\/www.railscarma.com\/fr\/\" target=\"_blank\" rel=\"noopener noreferrer\">RailsCarma<\/a><\/strong> a travaill\u00e9 sur le framework Ruby on Rails depuis ses d\u00e9buts et a g\u00e9r\u00e9 <strong>250 RoR<\/strong> <strong>projets<\/strong>. Avec une \u00e9quipe de plus de 100 d\u00e9veloppeurs RoR connaissant les derni\u00e8res techniques et outils, RailsCarma est bien plac\u00e9 pour vous aider avec tous vos besoins de d\u00e9veloppement.<\/p>\n<p>Nous serons heureux de vous aider avec vos questions. Utilisez notre <a href=\"\/fr\/contactez-nous\/\" target=\"_blank\" rel=\"noopener noreferrer\">Contactez-nous<\/a> page pour nous connecter.<\/p>\n<p>Lire les articles connexes :<\/p>\n<ul>\n<li><a href=\"\/fr\/blog\/rails-3\/api-rails-dintroduction\/\" target=\"_blank\" rel=\"noopener noreferrer\">Une introduction \u00e0 l&#039;API Rails<\/a><\/li>\n<li><a href=\"\/fr\/blog\/articles-techniques\/poodle-ssl-security-threat-explored\/\" target=\"_blank\" rel=\"noopener noreferrer\">Menace de s\u00e9curit\u00e9 Poodle SSL explor\u00e9e<\/a><\/li>\n<li><a href=\"\/fr\/blog\/articles-techniques\/un-moyen-simple-daugmenter-les-performances-de-votre-application-rails-2\/\" target=\"_blank\" rel=\"noopener noreferrer\">Un moyen simple d&#039;augmenter les performances de votre application Rails<\/a><\/li>\n<li><a href=\"\/fr\/blog\/articles-techniques\/mise-a-lechelle-des-applications-connexion-a-plusieurs-bases-de-donnees\/\" target=\"_blank\" rel=\"noopener noreferrer\">Mise \u00e0 l&#039;\u00e9chelle d&#039;une application avec plusieurs connexions \u00e0 des bases de donn\u00e9es<\/a><\/li>\n<\/ul>\n<p><a href=\"\/fr\/contactez-nous\/\">Prenez contact avec nous.<\/a><\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-4f453ac elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"4f453ac\" data-element_type=\"section\" data-settings=\"{&quot;background_background&quot;:&quot;classic&quot;}\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-28473f67\" data-id=\"28473f67\" data-element_type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-7171339f elementor-widget elementor-widget-heading\" data-id=\"7171339f\" data-element_type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">Abonnez-vous pour les derni\u00e8res mises \u00e0 jour<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-e2e9b63 elementor-widget elementor-widget-shortcode\" data-id=\"e2e9b63\" data-element_type=\"widget\" data-widget_type=\"shortcode.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<div class=\"elementor-shortcode\">\t\t\t\t\t<script type=\"text\/javascript\">\n\t\t\t\t\t\tvar gCaptchaSibWidget;\n                        var onloadSibCallbackInvisible = function () {\n\n                            var element = document.getElementsByClassName('sib-default-btn');\n                            var countInvisible = 0;\n                            var indexArray = [];\n                            jQuery('.sib-default-btn').each(function (index, el) {\n                                if ((jQuery(el).attr('id') == \"invisible\")) {\n                                    indexArray[countInvisible] = index;\n                                    countInvisible++\n                                }\n                            });\n\n                            jQuery('.invi-recaptcha').each(function (index, el) {\n                                grecaptcha.render(element[indexArray[index]], {\n                                    'sitekey': jQuery(el).attr('data-sitekey'),\n                                    'callback': sibVerifyCallback,\n                                });\n                            });\n                        };\n\t\t\t\t\t<\/script>\n\t\t\t\t\t                <script src=\"https:\/\/www.google.com\/recaptcha\/api.js?onload=onloadSibCallbackInvisible&render=explicit\" async defer><\/script>\n\t\t\t\t\n\t\t\t<form id=\"sib_signup_form_1\" method=\"post\" class=\"sib_signup_form\" action=\"\">\n\t\t\t\t<div class=\"sib_loader\" style=\"display:none;\"><img\n\t\t\t\t\t\t\tsrc=\"https:\/\/www.railscarma.com\/wp-includes\/images\/spinner.gif\" alt=\"chargeur\"><\/div>\n\t\t\t\t<input type=\"hidden\" name=\"sib_form_action\" value=\"subscribe_form_submit\">\n\t\t\t\t<input type=\"hidden\" name=\"sib_form_id\" value=\"1\">\n                <input type=\"hidden\" name=\"sib_form_alert_notice\" value=\"Please fill out this field\">\n                <input type=\"hidden\" name=\"sib_form_invalid_email_notice\" value=\"Your email address is invalid\">\n                <input type=\"hidden\" name=\"sib_security\" value=\"d7f7626ab9\">\n\t\t\t\t<div class=\"sib_signup_box_inside_1\">\n\t\t\t\t\t<div style=\"\/*display:none*\/\" class=\"sib_msg_disp\">\n\t\t\t\t\t<\/div>\n                                            <div id=\"sib_captcha_invisible\" class=\"invi-recaptcha\" data-sitekey=\"6LdikOAaAAAAAJ6SWrrKVQrtw7TQpQAEnv0HS0G3\"><\/div>\n                    \t\t\t\t\t<p class=\"sib-email-area\">\r\n    <label class=\"sib-email-area\"><\/label>\r\n    <input type=\"email\" class=\"sib-email-area\" name=\"email\" required=\"required\" placeholder=\"Adresse e-mail\">\r\n<\/p>\r\n<p class=\"sib-NAME-area\">\r\n    <label class=\"sib-NAME-area\"><\/label>\r\n    <input type=\"text\" class=\"sib-NAME-area\" name=\"NAME\" placeholder=\"Nom\">\r\n<\/p>\r\n<p>\r\n    <input type=\"submit\" id=\"invisible\" class=\"sib-default-btn\" value=\"S&#039;abonner\">\r\n<\/p>\t\t\t\t<\/div>\n\t\t\t<input type=\"hidden\" name=\"trp-form-language\" value=\"fr\"\/><\/form>\n\t\t\t<style>\n\t\t\t\tform#sib_signup_form_1 p.sib-alert-message {\n    padding: 6px 12px;\n    margin-bottom: 20px;\n    border: 1px solid transparent;\n    border-radius: 4px;\n    -webkit-box-sizing: border-box;\n    -moz-box-sizing: border-box;\n    box-sizing: border-box;\n}\nform#sib_signup_form_1 p.sib-alert-message-error {\n    background-color: #f2dede;\n    border-color: #ebccd1;\n    color: #a94442;\n}\nform#sib_signup_form_1 p.sib-alert-message-success {\n    background-color: #dff0d8;\n    border-color: #d6e9c6;\n    color: #3c763d;\n}\nform#sib_signup_form_1 p.sib-alert-message-warning {\n    background-color: #fcf8e3;\n    border-color: #faebcc;\n    color: #8a6d3b;\n}\n\t\t\t<\/style>\n\t\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<\/div>\n\t\t  <div class=\"related-post slider\">\r\n        <div class=\"headline\">Articles Similaires<\/div>\r\n    <div class=\"post-list owl-carousel\">\r\n\r\n            <div class=\"item\">\r\n            <div class=\"thumb post_thumb\">\r\n    <a  title=\"Gemme de Kaminari\" href=\"https:\/\/www.railscarma.com\/fr\/blog\/articles-techniques\/joyau-kaminari\/?related_post_from=37277\">\r\n\r\n      <img decoding=\"async\" width=\"800\" height=\"300\" src=\"https:\/\/www.railscarma.com\/wp-content\/uploads\/2023\/04\/kaminari-gem.jpg\" class=\"attachment-full size-full wp-post-image\" alt=\"joyau kaminari\" srcset=\"https:\/\/www.railscarma.com\/wp-content\/uploads\/2023\/04\/kaminari-gem.jpg 800w, https:\/\/www.railscarma.com\/wp-content\/uploads\/2023\/04\/kaminari-gem-300x113.jpg 300w, https:\/\/www.railscarma.com\/wp-content\/uploads\/2023\/04\/kaminari-gem-768x288.jpg 768w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/>\r\n\r\n    <\/a>\r\n  <\/div>\r\n\r\n  <a class=\"title post_title\"  title=\"Gemme de Kaminari\" href=\"https:\/\/www.railscarma.com\/fr\/blog\/articles-techniques\/joyau-kaminari\/?related_post_from=37277\">\r\n        Gemme de Kaminari  <\/a>\r\n\r\n        <\/div>\r\n              <div class=\"item\">\r\n            <div class=\"thumb post_thumb\">\r\n    <a  title=\"Pourquoi engager des d\u00e9veloppeurs Ruby on Rails en 2026 ?\" href=\"https:\/\/www.railscarma.com\/fr\/blog\/ror\/pourquoi-embaucher-des-developpeurs-ruby-on-rails\/?related_post_from=30627\">\r\n\r\n      <img decoding=\"async\" width=\"800\" height=\"300\" src=\"https:\/\/www.railscarma.com\/wp-content\/uploads\/2019\/01\/why-to-hire-ruby-on-rails-developers-in-2022.jpg\" class=\"attachment-full size-full wp-post-image\" alt=\"pourquoi embaucher des d\u00e9veloppeurs Ruby on Rails en 2022\" srcset=\"https:\/\/www.railscarma.com\/wp-content\/uploads\/2019\/01\/why-to-hire-ruby-on-rails-developers-in-2022.jpg 800w, https:\/\/www.railscarma.com\/wp-content\/uploads\/2019\/01\/why-to-hire-ruby-on-rails-developers-in-2022-300x113.jpg 300w, https:\/\/www.railscarma.com\/wp-content\/uploads\/2019\/01\/why-to-hire-ruby-on-rails-developers-in-2022-768x288.jpg 768w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/>\r\n\r\n    <\/a>\r\n  <\/div>\r\n\r\n  <a class=\"title post_title\"  title=\"Pourquoi engager des d\u00e9veloppeurs Ruby on Rails en 2026 ?\" href=\"https:\/\/www.railscarma.com\/fr\/blog\/ror\/pourquoi-embaucher-des-developpeurs-ruby-on-rails\/?related_post_from=30627\">\r\n        Pourquoi engager des d\u00e9veloppeurs Ruby on Rails en 2026 ?  <\/a>\r\n\r\n        <\/div>\r\n              <div class=\"item\">\r\n            <div class=\"thumb post_thumb\">\r\n    <a  title=\"Importance de l&#039;architecture logicielle dans le d\u00e9veloppement de logiciels d&#039;entreprise\" href=\"https:\/\/www.railscarma.com\/fr\/blog\/ror\/importance-de-larchitecture-logicielle-dans-le-developpement-de-logiciels-dentreprise\/?related_post_from=36250\">\r\n\r\n      <img decoding=\"async\" width=\"800\" height=\"300\" src=\"https:\/\/www.railscarma.com\/wp-content\/uploads\/2022\/06\/Importance-of-Software-Architecture-in-enterprise-software-development.jpg\" class=\"attachment-full size-full wp-post-image\" alt=\"Importance de l&#039;architecture logicielle dans le d\u00e9veloppement de logiciels d&#039;entreprise\" srcset=\"https:\/\/www.railscarma.com\/wp-content\/uploads\/2022\/06\/Importance-of-Software-Architecture-in-enterprise-software-development.jpg 800w, https:\/\/www.railscarma.com\/wp-content\/uploads\/2022\/06\/Importance-of-Software-Architecture-in-enterprise-software-development-300x113.jpg 300w, https:\/\/www.railscarma.com\/wp-content\/uploads\/2022\/06\/Importance-of-Software-Architecture-in-enterprise-software-development-768x288.jpg 768w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/>\r\n\r\n    <\/a>\r\n  <\/div>\r\n\r\n  <a class=\"title post_title\"  title=\"Importance de l&#039;architecture logicielle dans le d\u00e9veloppement de logiciels d&#039;entreprise\" href=\"https:\/\/www.railscarma.com\/fr\/blog\/ror\/importance-de-larchitecture-logicielle-dans-le-developpement-de-logiciels-dentreprise\/?related_post_from=36250\">\r\n        Importance de l&#039;architecture logicielle dans le d\u00e9veloppement de logiciels d&#039;entreprise  <\/a>\r\n\r\n        <\/div>\r\n              <div class=\"item\">\r\n            <div class=\"thumb post_thumb\">\r\n    <a  title=\"Ruby IDE\u00a0: les meilleurs IDE pour le d\u00e9veloppement Ruby on Rails\" href=\"https:\/\/www.railscarma.com\/fr\/blog\/ror\/ruby-ide-les-meilleures-idees-pour-le-developpement-de-ruby-on-rails\/?related_post_from=36125\">\r\n\r\n      <img decoding=\"async\" width=\"800\" height=\"300\" src=\"https:\/\/www.railscarma.com\/wp-content\/uploads\/2022\/01\/BEST-IDES-FOR-RUBY-ON-RAILS-DEVELOPMENT.jpg\" class=\"attachment-full size-full wp-post-image\" alt=\"MEILLEURES ID\u00c9ES POUR LE D\u00c9VELOPPEMENT DE RUBY ON RAILS\" srcset=\"https:\/\/www.railscarma.com\/wp-content\/uploads\/2022\/01\/BEST-IDES-FOR-RUBY-ON-RAILS-DEVELOPMENT.jpg 800w, https:\/\/www.railscarma.com\/wp-content\/uploads\/2022\/01\/BEST-IDES-FOR-RUBY-ON-RAILS-DEVELOPMENT-300x113.jpg 300w, https:\/\/www.railscarma.com\/wp-content\/uploads\/2022\/01\/BEST-IDES-FOR-RUBY-ON-RAILS-DEVELOPMENT-768x288.jpg 768w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/>\r\n\r\n    <\/a>\r\n  <\/div>\r\n\r\n  <a class=\"title post_title\"  title=\"Ruby IDE\u00a0: les meilleurs IDE pour le d\u00e9veloppement Ruby on Rails\" href=\"https:\/\/www.railscarma.com\/fr\/blog\/ror\/ruby-ide-les-meilleures-idees-pour-le-developpement-de-ruby-on-rails\/?related_post_from=36125\">\r\n        Ruby IDE\u00a0: les meilleurs IDE pour le d\u00e9veloppement Ruby on Rails  <\/a>\r\n\r\n        <\/div>\r\n      \r\n  <\/div>\r\n\r\n  <script>\r\n      <\/script>\r\n  <style>\r\n    .related-post {}\r\n\r\n    .related-post .post-list {\r\n      text-align: left;\r\n          }\r\n\r\n    .related-post .post-list .item {\r\n      margin: 10px;\r\n      padding: 10px;\r\n          }\r\n\r\n    .related-post .headline {\r\n      font-size: 14px !important;\r\n      color: #999999 !important;\r\n          }\r\n\r\n    .related-post .post-list .item .post_thumb {\r\n      max-height: 220px;\r\n      margin: 10px 0px;\r\n      padding: 0px;\r\n      display: block;\r\n          }\r\n\r\n    .related-post .post-list .item .post_title {\r\n      font-size: 14px;\r\n      color: #000000;\r\n      margin: 10px 0px;\r\n      padding: 0px;\r\n      display: block;\r\n      text-decoration: none;\r\n          }\r\n\r\n    .related-post .post-list .item .post_excerpt {\r\n      font-size: 12px;\r\n      color: #3f3f3f;\r\n      margin: 10px 0px;\r\n      padding: 0px;\r\n      display: block;\r\n      text-decoration: none;\r\n          }\r\n\r\n    .related-post .owl-dots .owl-dot {\r\n          }\r\n\r\n      <\/style>\r\n      <script>\r\n      jQuery(document).ready(function($) {\r\n        $(\".related-post .post-list\").owlCarousel({\r\n          items: 2,\r\n          responsiveClass: true,\r\n          responsive: {\r\n            0: {\r\n              items: 1,\r\n            },\r\n            768: {\r\n              items: 2,\r\n            },\r\n            1200: {\r\n              items: 2,\r\n            }\r\n          },\r\n                      rewind: true,\r\n                                loop: true,\r\n                                center: false,\r\n                                autoplay: true,\r\n            autoplayHoverPause: true,\r\n                                nav: true,\r\n            navSpeed: 1000,\r\n            navText: ['<i class=\"fas fa-chevron-left\"><\/i>', '<i class=\"fas fa-chevron-right\"><\/i>'],\r\n                                dots: false,\r\n            dotsSpeed: 1200,\r\n                                                    rtl: false,\r\n          \r\n        });\r\n      });\r\n    <\/script>\r\n  <\/div>","protected":false},"excerpt":{"rendered":"<p>Web scraping (web harvesting or web data extraction) is a computer software technique of extracting information from websites. The Mechanize library is used for automating interaction with websites. Mechanize gem automatically stores and sends cookies, follows redirects, and can follow links and submit forms. Form fields can be populated and submitted. It also keeps track &hellip;<\/p>\n<p class=\"read-more\"> <a class=\"\" href=\"https:\/\/www.railscarma.com\/fr\/blog\/ruby-regex-match-guide-with-examples\/\"> <span class=\"screen-reader-text\">Guide de correspondance des expressions rationnelles en Ruby (2026) avec exemples<\/span> Lire la suite \u00bb<\/a><\/p>","protected":false},"author":1,"featured_media":32090,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[384],"tags":[572,573,574,575,576],"class_list":["post-6481","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-technical-articles","tag-gem","tag-mechanize","tag-mechanize-gem","tag-web-harvesting","tag-web-scraping"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.8 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Scraping of Websites using Mechanize Gem - RailsCarma Blog<\/title>\n<meta name=\"description\" content=\"Web scraping is a computer software technique of extracting information from websites. Mechanize gem leverages Nokogiri to parse a page.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.railscarma.com\/fr\/blog\/articles-techniques\/scraping-de-sites-web-a-laide-de-mechanize-gem\/\" \/>\n<meta property=\"og:locale\" content=\"fr_FR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Scraping of Websites using Mechanize Gem - RailsCarma Blog\" \/>\n<meta property=\"og:description\" content=\"Web scraping is a computer software technique of extracting information from websites. Mechanize gem leverages Nokogiri to parse a page.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.railscarma.com\/fr\/blog\/articles-techniques\/scraping-de-sites-web-a-laide-de-mechanize-gem\/\" \/>\n<meta property=\"og:site_name\" content=\"RailsCarma - Ruby on Rails Development Company specializing in Offshore Development\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/RailsCarma\/\" \/>\n<meta property=\"article:published_time\" content=\"2014-09-06T03:21:29+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2022-08-30T09:18:09+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.railscarma.com\/wp-content\/uploads\/2014\/09\/web-scraping-1.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"800\" \/>\n\t<meta property=\"og:image:height\" content=\"300\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"admin\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@railscarma\" \/>\n<meta name=\"twitter:site\" content=\"@railscarma\" \/>\n<meta name=\"twitter:label1\" content=\"\u00c9crit par\" \/>\n\t<meta name=\"twitter:data1\" content=\"admin\" \/>\n\t<meta name=\"twitter:label2\" content=\"Dur\u00e9e de lecture estim\u00e9e\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.railscarma.com\/blog\/technical-articles\/scraping-of-web-sites-using-mechanize-gem\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.railscarma.com\/blog\/technical-articles\/scraping-of-web-sites-using-mechanize-gem\/\"},\"author\":{\"name\":\"admin\",\"@id\":\"https:\/\/www.railscarma.com\/#\/schema\/person\/5f2228a2dec7549056e709de6eb85d21\"},\"headline\":\"Scraping of Websites using Mechanize Gem\",\"datePublished\":\"2014-09-06T03:21:29+00:00\",\"dateModified\":\"2022-08-30T09:18:09+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.railscarma.com\/blog\/technical-articles\/scraping-of-web-sites-using-mechanize-gem\/\"},\"wordCount\":469,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/www.railscarma.com\/#organization\"},\"image\":{\"@id\":\"https:\/\/www.railscarma.com\/blog\/technical-articles\/scraping-of-web-sites-using-mechanize-gem\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.railscarma.com\/wp-content\/uploads\/2014\/09\/web-scraping-1.jpg\",\"keywords\":[\"gem\",\"Mechanize\",\"Mechanize gem\",\"web harvesting\",\"Web scraping\"],\"articleSection\":[\"Technical Articles\"],\"inLanguage\":\"fr-FR\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/www.railscarma.com\/blog\/technical-articles\/scraping-of-web-sites-using-mechanize-gem\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.railscarma.com\/blog\/technical-articles\/scraping-of-web-sites-using-mechanize-gem\/\",\"url\":\"https:\/\/www.railscarma.com\/blog\/technical-articles\/scraping-of-web-sites-using-mechanize-gem\/\",\"name\":\"Scraping of Websites using Mechanize Gem - RailsCarma Blog\",\"isPartOf\":{\"@id\":\"https:\/\/www.railscarma.com\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.railscarma.com\/blog\/technical-articles\/scraping-of-web-sites-using-mechanize-gem\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.railscarma.com\/blog\/technical-articles\/scraping-of-web-sites-using-mechanize-gem\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.railscarma.com\/wp-content\/uploads\/2014\/09\/web-scraping-1.jpg\",\"datePublished\":\"2014-09-06T03:21:29+00:00\",\"dateModified\":\"2022-08-30T09:18:09+00:00\",\"description\":\"Web scraping is a computer software technique of extracting information from websites. Mechanize gem leverages Nokogiri to parse a page.\",\"breadcrumb\":{\"@id\":\"https:\/\/www.railscarma.com\/blog\/technical-articles\/scraping-of-web-sites-using-mechanize-gem\/#breadcrumb\"},\"inLanguage\":\"fr-FR\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.railscarma.com\/blog\/technical-articles\/scraping-of-web-sites-using-mechanize-gem\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\/\/www.railscarma.com\/blog\/technical-articles\/scraping-of-web-sites-using-mechanize-gem\/#primaryimage\",\"url\":\"https:\/\/www.railscarma.com\/wp-content\/uploads\/2014\/09\/web-scraping-1.jpg\",\"contentUrl\":\"https:\/\/www.railscarma.com\/wp-content\/uploads\/2014\/09\/web-scraping-1.jpg\",\"width\":800,\"height\":300},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.railscarma.com\/blog\/technical-articles\/scraping-of-web-sites-using-mechanize-gem\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.railscarma.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Scraping of Websites using Mechanize Gem\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.railscarma.com\/#website\",\"url\":\"https:\/\/www.railscarma.com\/\",\"name\":\"RailsCarma - Ruby on Rails Development Company specializing in Offshore Development\",\"description\":\"RailsCarma is a Ruby on Rails Development Company in Bangalore. We specialize in Offshore Ruby on Rails Development based out in USA and India. Hire experienced Ruby on Rails developers for the ultimate Web Experience.\",\"publisher\":{\"@id\":\"https:\/\/www.railscarma.com\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.railscarma.com\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"fr-FR\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.railscarma.com\/#organization\",\"name\":\"RailsCarma\",\"url\":\"https:\/\/www.railscarma.com\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\/\/www.railscarma.com\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.railscarma.com\/wp-content\/uploads\/2020\/08\/railscarma_logo.png\",\"contentUrl\":\"https:\/\/www.railscarma.com\/wp-content\/uploads\/2020\/08\/railscarma_logo.png\",\"width\":200,\"height\":46,\"caption\":\"RailsCarma\"},\"image\":{\"@id\":\"https:\/\/www.railscarma.com\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/RailsCarma\/\",\"https:\/\/x.com\/railscarma\",\"https:\/\/www.linkedin.com\/company\/railscarma\/\",\"https:\/\/myspace.com\/railscarma\",\"https:\/\/in.pinterest.com\/railscarma\/\",\"https:\/\/www.youtube.com\/channel\/UCx3Wil-aAnDARuatTEyMdpg\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.railscarma.com\/#\/schema\/person\/5f2228a2dec7549056e709de6eb85d21\",\"name\":\"admin\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\/\/www.railscarma.com\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/308867ca6c81f3aba146080c601000087180326f752c4116849ea9f514c6a4fa?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/308867ca6c81f3aba146080c601000087180326f752c4116849ea9f514c6a4fa?s=96&d=mm&r=g\",\"caption\":\"admin\"},\"sameAs\":[\"https:\/\/www.railscarma.com\/hire-ruby-on-rails-developer\/\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Scraping de sites Web \u00e0 l&#039;aide de Mechanize Gem - RailsCarma Blog","description":"Le Web scraping est une technique logicielle permettant d&#039;extraire des informations \u00e0 partir de sites Web. Mechanize Gem exploite Nokogiri pour analyser une page.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.railscarma.com\/fr\/blog\/articles-techniques\/scraping-de-sites-web-a-laide-de-mechanize-gem\/","og_locale":"fr_FR","og_type":"article","og_title":"Scraping of Websites using Mechanize Gem - RailsCarma Blog","og_description":"Web scraping is a computer software technique of extracting information from websites. Mechanize gem leverages Nokogiri to parse a page.","og_url":"https:\/\/www.railscarma.com\/fr\/blog\/articles-techniques\/scraping-de-sites-web-a-laide-de-mechanize-gem\/","og_site_name":"RailsCarma - Ruby on Rails Development Company specializing in Offshore Development","article_publisher":"https:\/\/www.facebook.com\/RailsCarma\/","article_published_time":"2014-09-06T03:21:29+00:00","article_modified_time":"2022-08-30T09:18:09+00:00","og_image":[{"width":800,"height":300,"url":"https:\/\/www.railscarma.com\/wp-content\/uploads\/2014\/09\/web-scraping-1.jpg","type":"image\/jpeg"}],"author":"admin","twitter_card":"summary_large_image","twitter_creator":"@railscarma","twitter_site":"@railscarma","twitter_misc":{"\u00c9crit par":"admin","Dur\u00e9e de lecture estim\u00e9e":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.railscarma.com\/blog\/technical-articles\/scraping-of-web-sites-using-mechanize-gem\/#article","isPartOf":{"@id":"https:\/\/www.railscarma.com\/blog\/technical-articles\/scraping-of-web-sites-using-mechanize-gem\/"},"author":{"name":"admin","@id":"https:\/\/www.railscarma.com\/#\/schema\/person\/5f2228a2dec7549056e709de6eb85d21"},"headline":"Scraping of Websites using Mechanize Gem","datePublished":"2014-09-06T03:21:29+00:00","dateModified":"2022-08-30T09:18:09+00:00","mainEntityOfPage":{"@id":"https:\/\/www.railscarma.com\/blog\/technical-articles\/scraping-of-web-sites-using-mechanize-gem\/"},"wordCount":469,"commentCount":0,"publisher":{"@id":"https:\/\/www.railscarma.com\/#organization"},"image":{"@id":"https:\/\/www.railscarma.com\/blog\/technical-articles\/scraping-of-web-sites-using-mechanize-gem\/#primaryimage"},"thumbnailUrl":"https:\/\/www.railscarma.com\/wp-content\/uploads\/2014\/09\/web-scraping-1.jpg","keywords":["gem","Mechanize","Mechanize gem","web harvesting","Web scraping"],"articleSection":["Technical Articles"],"inLanguage":"fr-FR","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.railscarma.com\/blog\/technical-articles\/scraping-of-web-sites-using-mechanize-gem\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.railscarma.com\/blog\/technical-articles\/scraping-of-web-sites-using-mechanize-gem\/","url":"https:\/\/www.railscarma.com\/blog\/technical-articles\/scraping-of-web-sites-using-mechanize-gem\/","name":"Scraping de sites Web \u00e0 l&#039;aide de Mechanize Gem - RailsCarma Blog","isPartOf":{"@id":"https:\/\/www.railscarma.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.railscarma.com\/blog\/technical-articles\/scraping-of-web-sites-using-mechanize-gem\/#primaryimage"},"image":{"@id":"https:\/\/www.railscarma.com\/blog\/technical-articles\/scraping-of-web-sites-using-mechanize-gem\/#primaryimage"},"thumbnailUrl":"https:\/\/www.railscarma.com\/wp-content\/uploads\/2014\/09\/web-scraping-1.jpg","datePublished":"2014-09-06T03:21:29+00:00","dateModified":"2022-08-30T09:18:09+00:00","description":"Le Web scraping est une technique logicielle permettant d&#039;extraire des informations \u00e0 partir de sites Web. Mechanize Gem exploite Nokogiri pour analyser une page.","breadcrumb":{"@id":"https:\/\/www.railscarma.com\/blog\/technical-articles\/scraping-of-web-sites-using-mechanize-gem\/#breadcrumb"},"inLanguage":"fr-FR","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.railscarma.com\/blog\/technical-articles\/scraping-of-web-sites-using-mechanize-gem\/"]}]},{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/www.railscarma.com\/blog\/technical-articles\/scraping-of-web-sites-using-mechanize-gem\/#primaryimage","url":"https:\/\/www.railscarma.com\/wp-content\/uploads\/2014\/09\/web-scraping-1.jpg","contentUrl":"https:\/\/www.railscarma.com\/wp-content\/uploads\/2014\/09\/web-scraping-1.jpg","width":800,"height":300},{"@type":"BreadcrumbList","@id":"https:\/\/www.railscarma.com\/blog\/technical-articles\/scraping-of-web-sites-using-mechanize-gem\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.railscarma.com\/"},{"@type":"ListItem","position":2,"name":"Scraping of Websites using Mechanize Gem"}]},{"@type":"WebSite","@id":"https:\/\/www.railscarma.com\/#website","url":"https:\/\/www.railscarma.com\/","name":"RailsCarma - Soci\u00e9t\u00e9 de d\u00e9veloppement Ruby on Rails sp\u00e9cialis\u00e9e dans le d\u00e9veloppement offshore","description":"RailsCarma est une soci\u00e9t\u00e9 de d\u00e9veloppement Ruby on Rails \u00e0 Bangalore. Nous sommes sp\u00e9cialis\u00e9s dans le d\u00e9veloppement offshore Ruby on Rails, bas\u00e9s aux \u00c9tats-Unis et en Inde. Embauchez des d\u00e9veloppeurs Ruby on Rails exp\u00e9riment\u00e9s pour une exp\u00e9rience Web ultime.","publisher":{"@id":"https:\/\/www.railscarma.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.railscarma.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"fr-FR"},{"@type":"Organization","@id":"https:\/\/www.railscarma.com\/#organization","name":"RailsCarma","url":"https:\/\/www.railscarma.com\/","logo":{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/www.railscarma.com\/#\/schema\/logo\/image\/","url":"https:\/\/www.railscarma.com\/wp-content\/uploads\/2020\/08\/railscarma_logo.png","contentUrl":"https:\/\/www.railscarma.com\/wp-content\/uploads\/2020\/08\/railscarma_logo.png","width":200,"height":46,"caption":"RailsCarma"},"image":{"@id":"https:\/\/www.railscarma.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/RailsCarma\/","https:\/\/x.com\/railscarma","https:\/\/www.linkedin.com\/company\/railscarma\/","https:\/\/myspace.com\/railscarma","https:\/\/in.pinterest.com\/railscarma\/","https:\/\/www.youtube.com\/channel\/UCx3Wil-aAnDARuatTEyMdpg"]},{"@type":"Person","@id":"https:\/\/www.railscarma.com\/#\/schema\/person\/5f2228a2dec7549056e709de6eb85d21","name":"administrateur","image":{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/www.railscarma.com\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/308867ca6c81f3aba146080c601000087180326f752c4116849ea9f514c6a4fa?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/308867ca6c81f3aba146080c601000087180326f752c4116849ea9f514c6a4fa?s=96&d=mm&r=g","caption":"admin"},"sameAs":["https:\/\/www.railscarma.com\/hire-ruby-on-rails-developer\/"]}]}},"amp_enabled":true,"_links":{"self":[{"href":"https:\/\/www.railscarma.com\/fr\/wp-json\/wp\/v2\/posts\/6481","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.railscarma.com\/fr\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.railscarma.com\/fr\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.railscarma.com\/fr\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.railscarma.com\/fr\/wp-json\/wp\/v2\/comments?post=6481"}],"version-history":[{"count":0,"href":"https:\/\/www.railscarma.com\/fr\/wp-json\/wp\/v2\/posts\/6481\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.railscarma.com\/fr\/wp-json\/wp\/v2\/media\/32090"}],"wp:attachment":[{"href":"https:\/\/www.railscarma.com\/fr\/wp-json\/wp\/v2\/media?parent=6481"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.railscarma.com\/fr\/wp-json\/wp\/v2\/categories?post=6481"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.railscarma.com\/fr\/wp-json\/wp\/v2\/tags?post=6481"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}