The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

StreamFinder::Blogger - Fetch actual raw streamable URLs from Blogger / Blogspot videos.

AUTHOR

This module is Copyright (C) 2017-2021 by

Jim Turner, <turnerjw784 at yahoo.com>

Email: turnerjw784@yahoo.com

All rights reserved.

You may distribute this module under the terms of either the GNU General Public License or the Artistic License, as specified in the Perl README file.

SYNOPSIS

        #!/usr/bin/perl

        use strict;

        use StreamFinder::Blogger;

        die "..usage:  $0 URL\n"  unless ($ARGV[0]);

        my $video = new StreamFinder::Blogger($ARGV[0]);

        die "Invalid URL or no streams found!\n"  unless ($video);

        my $firstStream = $video->get();

        print "First Stream URL=$firstStream\n";

        my $url = $video->getURL();

        print "Stream URL=$url\n";

        my $videoTitle = $video->getTitle();
        
        print "Title=$videoTitle\n";
        
        my $videoDescription = $video->getTitle('desc');
        
        print "Description=$videoDescription\n";
        
        my $videoID = $video->getID();

        print "Video ID=$videoID\n";
        
        my $artist = $video->{'artist'};

        print "Artist=$artist\n"  if ($artist);
        
        my $albumartist = $video->{'albumartist'};

        print "Album Artist=$albumartist\n"  if ($albumartist);
        
        my $icon_url = $video->getIconURL();

        if ($icon_url) {   #SAVE THE ICON TO A TEMP. FILE:

                my ($image_ext, $icon_image) = $video->getIconData();

                if ($icon_image && open IMGOUT, ">/tmp/${videoID}.$image_ext") {

                        binmode IMGOUT;

                        print IMGOUT $icon_image;

                        close IMGOUT;

                }

        }

        my $stream_count = $video->count();

        print "--Stream count=$stream_count=\n";

        my @streams = $video->get();

        foreach my $s (@streams) {

                print "------ stream URL=$s=\n";

        }

DESCRIPTION

StreamFinder::Blogger accepts a valid full Blogger video URL on blogger.com and returns the actual stream URL, ID, and cover art icon for that video. The purpose is that one needs this URL in order to have the option to stream the video in one's own choice of media player software rather than using their web browser and accepting any / all flash, ads, javascript, cookies, trackers, web-bugs, and other crapware that can come with that method of play. The author uses his own custom all-purpose media player called "fauxdacious" (his custom hacked version of the open-source "audacious" audio player). "fauxdacious" incorporates this module to decode and play blogger.com videos. This is a submodule of the general StreamFinder module.

Depends:

URI::Escape, HTML::Entities, LWP::UserAgent, and the separate application program: youtube-dl.

SUBROUTINES/METHODS

new(url [, -youtube => (yes)|no|only ] [, -keep => "type1,type2?..." | [type1,type2?...] ] | [, "debug" [ => 0|(1)|2 ]])

Accepts a blogger.com video URL and creates and returns a new video object, or undef if the URL is not a valid Blogger video or no streams are found. The URL can be the full URL, ie. https://www.blogger.com/video-id, or just video-id.

The optional keep argument can be either a comma-separated string or an array reference ([...]) of stream types to keep (include) and returned in order specified (type1, type2...). Each "type" can be one of: extension (ie. m4a, mp4, etc.), "playlist", "stream", or ("any" or "all").

DEFAULT keep list is: 'm4a,mpd,stream,all', meaning that all m4a streams followed by all "mpd" streams, followed by non-playlists, followed by all remaining (playlists: m3u8,pls) streams. More than one value can be specified to control order of search.

NOTE: keep is ignored if youtube is set to "only".

The optional youtube argument can be set to "yes" - also include streams youtube-dl finds, "no" - only include streams embedded in the video's blogger.com page, or "only" - only include streams youtube-dl finds. Default is "yes". This is needed because currently the streams on the page: (mpd plays best but is unseekable, and the m3u8 (HLS) stream doesn't seem to work well). youtube-dl also returns a "chunky" m3u8 (HLS) stream that is seekable and seems to work ok.

$video->get()

Returns an array of strings representing all stream URLs found.

$video->getURL([options])

Similar to get() except it only returns a single stream representing the first valid stream found.

Current options are: "random", "nopls", and "noplaylists". By default, the first ("best"?) stream is returned. If "random" is specified, then a random one is selected from the list of streams found. If "nopls" is specified, and the stream to be returned is a ".pls" playlist, it is first fetched and the first entry (or a random entry if "random" is specified) is returned. This is needed by Fauxdacious Mediaplayer. If "noplaylists" is specified, and the stream to be returned is a "playlist" (either .pls or .m3u? extension), it is first fetched and the first entry (or a random entry if "random" is specified) in the playlist is returned.

$video->count()

Returns the number of streams found for the video.

$video->getID()

Returns the video's Blogger ID (alphanumeric).

$video->getTitle(['desc'])

Returns the video's title, or (long description).

$video->getIconURL()

Returns the URL for the video's "cover art" icon image, if any.

$video->getIconData()

Returns a two-element array consisting of the extension (ie. "png", "gif", "jpeg", etc.) and the actual icon image (binary data), if any.

$video->getImageURL()

Returns the URL for the video's "cover art" banner image, which for Blogger videos is always the icon image, as Blogger does not support a separate banner image at this time.

$video->getImageData()

Returns a two-element array consisting of the extension (ie. "png", "gif", "jpeg", etc.) and the actual video's banner image (binary data).

$video->getType()

Returns the video's type ("Blogger").

CONFIGURATION FILES

~/.config/StreamFinder/Blogger/config

Optional text file for specifying various configuration options for a specific site (submodule). Each option is specified on a separate line in the format below:

'option' => 'value' [,]

and the options are loaded into a hash used only by the specific (submodule) specified. Valid options include -debug => [0|1|2], and most of the LWP::UserAgent options. Blank lines and lines starting with a "#" sign are ignored.

Options specified here override any specified in ~/.config/StreamFinder/config.

Among options valid for Blogger streams is the -keep and -youtube options described in the new() function.

~/.config/StreamFinder/config

Optional text file for specifying various configuration options. Each option is specified on a separate line in the format below:

'option' => 'value' [,]

and the options are loaded into a hash used by all sites (submodules) that support them. Valid options include -debug => [0|1|2], and most of the LWP::UserAgent options.

NOTE: Options specified in the options parameter list will override those corresponding options specified in these files.

KEYWORDS

blogger

DEPENDENCIES

URI::Escape, HTML::Entities, LWP::UserAgent

RECCOMENDS

youtube-dl

wget

BUGS

Please report any bugs or feature requests to bug-streamFinder-blogger at rt.cpan.org, or through the web interface at http://rt.cpan.org/NoAuth/ReportBug.html?Queue=StreamFinder-Blogger. I will be notified, and then you'll automatically be notified of progress on your bug as I make changes.

SUPPORT

You can find documentation for this module with the perldoc command.

    perldoc StreamFinder::Blogger

You can also look for information at:

LICENSE AND COPYRIGHT

Copyright 2017-2021 Jim Turner.

This program is free software; you can redistribute it and/or modify it under the terms of the the Artistic License (2.0). You may obtain a copy of the full license at:

http://www.perlfoundation.org/artistic_license_2_0

Any use, modification, and distribution of the Standard or Modified Versions is governed by this Artistic License. By using, modifying or distributing the Package, you accept this license. Do not use, modify, or distribute the Package, if you do not accept this license.

If your Modified Version has been derived from a Modified Version made by someone other than you, you are nevertheless required to ensure that your Modified Version complies with the requirements of this license.

This license does not grant you the right to use any trademark, service mark, tradename, or logo of the Copyright Holder.

This license includes the non-exclusive, worldwide, free-of-charge patent license to make, have made, use, offer to sell, sell, import and otherwise transfer the Package with respect to any patent claims licensable by the Copyright Holder that are necessarily infringed by the Package. If you institute patent litigation (including a cross-claim or counterclaim) against any party alleging that the Package constitutes direct or contributory patent infringement, then this Artistic License to you shall terminate on the date that such litigation is filed.

Disclaimer of Warranty: THE PACKAGE IS PROVIDED BY THE COPYRIGHT HOLDER AND CONTRIBUTORS "AS IS' AND WITHOUT ANY EXPRESS OR IMPLIED WARRANTIES. THE IMPLIED WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE, OR NON-INFRINGEMENT ARE DISCLAIMED TO THE EXTENT PERMITTED BY YOUR LOCAL LAW. UNLESS REQUIRED BY LAW, NO COPYRIGHT HOLDER OR CONTRIBUTOR WILL BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, OR CONSEQUENTIAL DAMAGES ARISING IN ANY WAY OUT OF THE USE OF THE PACKAGE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

testonly

$html = <<'ENDHTML'; <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd"><html dir="ltr"><head><style type="text/css"> body, .main, #videocontainer, .thumbnail-holder, .play-button { background: black; height: 100vh; margin: 0; overflow: hidden; position: absolute; width: 100%; }

        #videocontainer.type-BLOGGER_UPLOADED .thumbnail-holder {
          background-size: contain;
        }

        .thumbnail-holder {
          background-repeat: no-repeat;
          background-position: center;
          z-index: 10;
        }

        .play-button {
          background: url('https://www.gstatic.com/images/icons/material/system/1x/play_arrow_white_48dp.png') rgba(0,0,0,0.1) no-repeat center;
          cursor: pointer;
          display: block;
          z-index: 20;
        }
      </style>
<script type="text/javascript">
        var VIDEO_CONFIG = {"thumbnail":"https://video.google.com/ThumbnailServer2?app\u003dblogger\u0026contentid\u003d4b85c0c7c4fa62c4\u0026offsetms\u003d5000\u0026itag\u003dw320\u0026expire\u003d1595683627\u0026sigh\u003d8KiGsQ9ENNk9ar92EdGBb7yw1fU","iframe_id":"BLOGGER-video-4b85c0c7c4fa62c4-3612","allow_resize":false,"streams":[{"play_url":"https://r4---sn-q4flrn7y.googlevideo.com/videoplayback?expire\u003d1595687227\u0026ei\u003du9AbX_j9I9qQrvIP8JCYuAc\u0026ip\u003d74.113.246.161\u0026id\u003d4b85c0c7c4fa62c4\u0026itag\u003d18\u0026source\u003dblogger\u0026mh\u003daH\u0026mm\u003d31\u0026mn\u003dsn-q4flrn7y\u0026ms\u003dau\u0026mv\u003dm\u0026mvi\u003d4\u0026pl\u003d24\u0026susc\u003dbl\u0026mime\u003dvideo/mp4\u0026dur\u003d54.172\u0026lmt\u003d1348112508420630\u0026mt\u003d1595658219\u0026sparams\u003dexpire,ei,ip,id,itag,source,susc,mime,dur,lmt\u0026sig\u003dAOq0QJ8wRQIhAPDDa3kNcLOfoVXp2Pn0rtl1g6Th9WNZpaKedGNyY_yxAiBRuA4ArLK4PozoFOl7E__WmFK0k5FJCysw8rFI7tCmfQ%3D%3D\u0026lsparams\u003dmh,mm,mn,ms,mv,mvi,pl\u0026lsig\u003dAG3C_xAwRgIhAO3j3Xm7z2fjkCG8JBeOH8B2-xT7aUvvJfAOohzWlaraAiEAy7k8NZZ1PIitjM8rM5DHqoK2l0hNsC6Pug91Yn7ZAbM%3D","format_id":18}]}
      </script></head>
<body><div class="main"><div id="videocontainer" class="type-BLOGGER_UPLOADED"><div class="thumbnail-holder"></div>
<div class="play-button"></div></div></div>
<script type="text/javascript" src="https://www.blogger.com/static/v1/jsbin/764818548-video_compiled.js"></script>
ENDHTML