NAME

URI::Title - get the titles of things on the web in a sensible way

VERSION

version 1.904

SYNOPSIS

  use URI::Title qw( title );
  my $title = title('http://microsoft.com');
  print "Title is $title\n";

DESCRIPTION

I keep having to find the title of things on the web. This seems like a really simple request, just get() the object, parse for a title tag, you're done. Ha, I wish. There are several problems with this approach:

What if the resource is on a very slow server? Do we wait for ever or what?
What if the resource is a 900 gig file? You don't want to download that.
What if the page title isn't in a title tag, but is buried in the HTML somewhere?
What if the resource is an MP3 file, or a word document or something?
...

So, let's solve these issues once.

METHODS

only one, the title(url) method. Call it with an url, get the title if possible, undef if it wasn't. Very simple.

SEE ALSO

WWW::GetPageTitle - similar this module, but just handles web pages. The author of that module suggests you should use URI::Title.

NOTES

Embedded title metadata of png files can be extracted if you have installed either Image::ExifTool or Image::PNG::Libpng.

TODO

Many, many, many things. Still unimplemented:

Get titles of MP3 files, Word Docs, PDFs, etc.
Configurable.. well, anything, in fact. Timeout would be a good start.
Better error reporting.

AUTHORS

Tom Insam <tom@jerakeen.org>, original author, 2004-2012.

Philippe Bruhat (BooK) <book@cpan.org>, maintainer, 2014.

COPYRIGHT AND LICENSE

This software is copyright (c) 2004 Tom Insam.

This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

CREDITS

Invented because of a conversation with rjp, who contributed some eyeball-melting and as-yet-unused code to get titles from MP3s and PDFs, and hex, who has also solved the problem, and got bits done in a nicer way than I did.