The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

Image::Leptonica::Func::conncomp

VERSION

version 0.04

conncomp.c

  conncomp.c

    Connected component counting and extraction, using Heckbert's
    stack-based filling algorithm.

      4- and 8-connected components: counts, bounding boxes and images

      Top-level calls:
           BOXA     *pixConnComp()
           BOXA     *pixConnCompPixa()
           BOXA     *pixConnCompBB()
           l_int32   pixCountConnComp()

      Identify the next c.c. to be erased:
           l_int32   nextOnPixelInRaster()
           l_int32   nextOnPixelInRasterLow()

      Erase the c.c., saving the b.b.:
           BOX      *pixSeedfillBB()
           BOX      *pixSeedfill4BB()
           BOX      *pixSeedfill8BB()

      Just erase the c.c.:
           l_int32   pixSeedfill()
           l_int32   pixSeedfill4()
           l_int32   pixSeedfill8()

      Static stack helper functions for single raster line seedfill:
           static void    pushFillsegBB()
           static void    pushFillseg()
           static void    popFillseg()

  The basic method in pixConnCompBB() is very simple.  We scan the
  image in raster order, looking for the next ON pixel.  When it
  is found, we erase it and every pixel of the 4- or 8-connected
  component to which it belongs, using Heckbert's seedfill
  algorithm.  As pixels are erased, we keep track of the
  minimum rectangle that encloses all erased pixels; after
  the connected component has been erased, we save its
  bounding box in an array of boxes.  When all pixels in the
  image have been erased, we have an array that describes every
  4- or 8-connected component in terms of its bounding box.

  pixConnCompPixa() is a slight variation on pixConnCompBB(),
  where we additionally save an array of images (in a Pixa)
  of each of the 4- or 8-connected components.  This is done trivially
  by maintaining two temporary images.  We erase a component from one,
  and use the bounding box to extract the pixels within the b.b.
  from each of the two images.  An XOR between these subimages
  gives the erased component.  Then we erase the component from the
  second image using the XOR again, with the extracted component
  placed on the second image at the location of the bounding box.
  Rasterop does all the work.  At the end, we have an array
  of the 4- or 8-connected components, as well as an array of the
  bounding boxes that describe where they came from in the original image.

  If you just want the number of connected components, pixCountConnComp()
  is a bit faster than pixConnCompBB(), because it doesn't have to
  keep track of the bounding rectangles for each c.c.

FUNCTIONS

nextOnPixelInRaster

l_int32 nextOnPixelInRaster ( PIX *pixs, l_int32 xstart, l_int32 ystart, l_int32 *px, l_int32 *py )

  nextOnPixelInRaster()

      Input:  pixs (1 bpp)
              xstart, ystart  (starting point for search)
              &x, &y  (<return> coord value of next ON pixel)
      Return: 1 if a pixel is found; 0 otherwise or on error

pixConnComp

BOXA * pixConnComp ( PIX *pixs, PIXA **ppixa, l_int32 connectivity )

  pixConnComp()

      Input:  pixs (1 bpp)
              &pixa   (<optional return> pixa of each c.c.)
              connectivity (4 or 8)
      Return: boxa, or null on error

  Notes:
      (1) This is the top-level call for getting bounding boxes or
          a pixa of the components, and it can be used instead
          of either pixConnCompBB() or pixConnCompPixa(), rsp.

pixConnCompBB

BOXA * pixConnCompBB ( PIX *pixs, l_int32 connectivity )

  pixConnCompBB()

      Input:  pixs (1 bpp)
              connectivity (4 or 8)
      Return: boxa, or null on error

 Notes:
     (1) Finds bounding boxes of 4- or 8-connected components
         in a binary image.
     (2) This works on a copy of the input pix.  The c.c. are located
         in raster order and erased one at a time.  In the process,
         the b.b. is computed and saved.

pixConnCompPixa

BOXA * pixConnCompPixa ( PIX *pixs, PIXA **ppixa, l_int32 connectivity )

  pixConnCompPixa()

      Input:  pixs (1 bpp)
              &pixa (<return> pixa of each c.c.)
              connectivity (4 or 8)
      Return: boxa, or null on error

  Notes:
      (1) This finds bounding boxes of 4- or 8-connected components
          in a binary image, and saves images of each c.c
          in a pixa array.
      (2) It sets up 2 temporary pix, and for each c.c. that is
          located in raster order, it erases the c.c. from one pix,
          then uses the b.b. to extract the c.c. from the two pix using
          an XOR, and finally erases the c.c. from the second pix.
      (3) A clone of the returned boxa (where all boxes in the array
          are clones) is inserted into the pixa.
      (4) If the input is valid, this always returns a boxa and a pixa.
          If pixs is empty, the boxa and pixa will be empty.

pixCountConnComp

l_int32 pixCountConnComp ( PIX *pixs, l_int32 connectivity, l_int32 *pcount )

  pixCountConnComp()

      Input:  pixs (1 bpp)
              connectivity (4 or 8)
              &count (<return>
      Return: 0 if OK, 1 on error

 Notes:
     (1) This is the top-level call for getting the number of
         4- or 8-connected components in a 1 bpp image.
     (2) It works on a copy of the input pix.  The c.c. are located
         in raster order and erased one at a time.

pixSeedfill

l_int32 pixSeedfill ( PIX *pixs, L_STACK *stack, l_int32 x, l_int32 y, l_int32 connectivity )

  pixSeedfill()

      Input:  pixs (1 bpp)
              stack (for holding fillsegs)
              x,y   (location of seed pixel)
              connectivity  (4 or 8)
      Return: 0 if OK, 1 on error

  Notes:
      (1) This removes the component from pixs with a fg pixel at (x,y).
      (2) See pixSeedfill4() and pixSeedfill8() for details.

pixSeedfill4

l_int32 pixSeedfill4 ( PIX *pixs, L_STACK *stack, l_int32 x, l_int32 y )

  pixSeedfill4()

      Input:  pixs (1 bpp)
              stack (for holding fillsegs)
              x,y   (location of seed pixel)
      Return: 0 if OK, 1 on error

  Notes:
      (1) This is Paul Heckbert's stack-based 4-cc seedfill algorithm.
      (2) This operates on the input 1 bpp pix to remove the fg seed
          pixel, at (x,y), and all pixels that are 4-connected to it.
          The seed pixel at (x,y) must initially be ON.
      (3) Reference: see pixSeedFill4BB()

pixSeedfill4BB

BOX * pixSeedfill4BB ( PIX *pixs, L_STACK *stack, l_int32 x, l_int32 y )

  pixSeedfill4BB()

      Input:  pixs (1 bpp)
              stack (for holding fillsegs)
              x,y   (location of seed pixel)
      Return: box or null on error.

  Notes:
      (1) This is Paul Heckbert's stack-based 4-cc seedfill algorithm.
      (2) This operates on the input 1 bpp pix to remove the fg seed
          pixel, at (x,y), and all pixels that are 4-connected to it.
          The seed pixel at (x,y) must initially be ON.
      (3) Returns the bounding box of the erased 4-cc component.
      (4) Reference: see Paul Heckbert's stack-based seed fill algorithm
          in "Graphic Gems", ed. Andrew Glassner, Academic
          Press, 1990.  The algorithm description is given
          on pp. 275-277; working C code is on pp. 721-722.)
          The code here follows Heckbert's exactly, except
          we use function calls instead of macros for
          pushing data on and popping data off the stack.
          This makes sense to do because Heckbert's fixed-size
          stack with macros is dangerous: images exist that
          will overrun the stack and crash.   The stack utility
          here grows dynamically as needed, and the fillseg
          structures that are not in use are stored in another
          stack for reuse.  It should be noted that the
          overhead in the function calls (vs. macros) is negligible.

pixSeedfill8

l_int32 pixSeedfill8 ( PIX *pixs, L_STACK *stack, l_int32 x, l_int32 y )

  pixSeedfill8()

      Input:  pixs (1 bpp)
              stack (for holding fillsegs)
              x,y   (location of seed pixel)
      Return: 0 if OK, 1 on error

  Notes:
      (1) This is Paul Heckbert's stack-based 8-cc seedfill algorithm.
      (2) This operates on the input 1 bpp pix to remove the fg seed
          pixel, at (x,y), and all pixels that are 8-connected to it.
          The seed pixel at (x,y) must initially be ON.
      (3) Reference: see pixSeedFill8BB()

pixSeedfill8BB

BOX * pixSeedfill8BB ( PIX *pixs, L_STACK *stack, l_int32 x, l_int32 y )

  pixSeedfill8BB()

      Input:  pixs (1 bpp)
              stack (for holding fillsegs)
              x,y   (location of seed pixel)
      Return: box or null on error.

  Notes:
      (1) This is Paul Heckbert's stack-based 8-cc seedfill algorithm.
      (2) This operates on the input 1 bpp pix to remove the fg seed
          pixel, at (x,y), and all pixels that are 8-connected to it.
          The seed pixel at (x,y) must initially be ON.
      (3) Returns the bounding box of the erased 8-cc component.
      (4) Reference: see Paul Heckbert's stack-based seed fill algorithm
          in "Graphic Gems", ed. Andrew Glassner, Academic
          Press, 1990.  The algorithm description is given
          on pp. 275-277; working C code is on pp. 721-722.)
          The code here follows Heckbert's closely, except
          the leak checks are changed for 8 connectivity.
          See comments on pixSeedfill4BB() for more details.

pixSeedfillBB

BOX * pixSeedfillBB ( PIX *pixs, L_STACK *stack, l_int32 x, l_int32 y, l_int32 connectivity )

  pixSeedfillBB()

      Input:  pixs (1 bpp)
              stack (for holding fillsegs)
              x,y   (location of seed pixel)
              connectivity  (4 or 8)
      Return: box or null on error

  Notes:
      (1) This is the high-level interface to Paul Heckbert's
          stack-based seedfill algorithm.

AUTHOR

Zakariyya Mughal <zmughal@cpan.org>

COPYRIGHT AND LICENSE

This software is copyright (c) 2014 by Zakariyya Mughal.

This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.