Extract links using LibXML

Here’s a little example of the usage of perl’s XML::LibXML to extract links from a HTML page.

#!/usr/bin/perl
use strict;
use warnings;
use LWP::Simple;
use XML::LibXML;

my $url = "http://fantasyfilmfest.com/pages/filme.html";
my $fw = "http://www.freshwap.com/index.php?do=search&subaction=search&full_search=1&catlist[]=5&titleonly=3&story=";

my $p = XML::LibXML->new();
my %opts = (
    suppress_errors => 1,
    recover => 1,
);
my $dom = $p->parse_html_file($url, \%opts);
my $root = $dom->getDocumentElement;
my $title;
my $info;

foreach my $node ($root->findnodes("//div[\@class='FilmREITER']")) {
    $title = $node->findvalue('a');
    if ($title =~ m/, the/i) {
        $title =~ s/(.*), the/the $1/i;
    }
    print $title . "\n";
    $dom = $p->parse_html_file($fw.$title, \%opts);
    next if !defined($dom);
    $root = $dom->getDocumentElement;
    foreach my $link ($root->findnodes("//div[\@class='title']/a")) {
        print $link->getAttribute('href')."\n";
    }
    print "--------------\n";
}

Extract links using LibXML

Trending Articles

RAMAYAMPET Mandal Sarpanch | Upa-Sarpanch | Ward member Mobile Numbers Medak...

लड़कियां सेक्स के दौरान क्यों करती है उह! आह!लड़कियां सेक्स के दौरान क्यों करती...

Neem Baba Extra Questions Answer Class 6 English Poorvi

Throw Back: 4×4 — Sikilitele (Ft Castro) Prod by JQ

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Lowe faces four theft charges

Practice Sheet of Right form of verbs for HSC Students

Mafia, Murder & Mayhem In The Motor City: Detroit Mob Hit Timeline (1937-2007)

The 10 Tennessee Cities With The Largest Black Population For 2021

Materials Around Us Class 6 Worksheet Science Chapter 6

デスクトップヒープの枯渇

Best Suvichar in Hindi |बेस्ट सुविचार |शुभ विचार हिंदी में

Kanulanu Thaake Lyrics and translation | Manam (2014)

Korean Sex Porn Videos: XXX Videos & Free Porn Movies

Teen Shot In Miami Drive-By Dies From Injuries

Download: IQ Muzatasha feat Shy D & Pmj – Ulesi NiFertilizer Yamavuto

Mahakal Attitude Status

Property developer set up cannabis factory to help pay off debts...

♡

KB: How to troubleshoot issues when adding a Hyper-V host in System Center...