Class FindHTMLHrefHandler

    • Constructor Summary

      Constructors 
      Constructor Description
      FindHTMLHrefHandler​(java.lang.String parentURI, java.util.regex.Pattern preferredLinkPattern)  
    • Method Summary

      All Methods Instance Methods Concrete Methods 
      Modifier and Type Method Description
      void applyOverrides​(LoginParameters lp)
      Apply overrides
      void finishUp()
      Done with the document.
      void noteAHREF​(java.lang.String rawURL)
      Note discovered href
      void noteBASEHREF​(java.lang.String rawURL)
      Note discovered base
      void noteDiscoveredBase​(java.lang.String rawURL)
      Inform the world of a new base HREF.
      void noteDiscoveredLink​(java.lang.String rawURL)
      Override noteDiscoveredLink
      void noteFormEnd()
      Note the end of a form
      void noteFormInput​(java.util.Map inputAttributes)
      Note an input tag
      void noteFormStart​(java.util.Map formAttributes)
      Note the start of a form
      void noteFRAMESRC​(java.lang.String rawURL)
      Note discovered FRAME SRC
      void noteIMGSRC​(java.lang.String rawURL)
      Note discovered IMG SRC
      void noteLINKHREF​(java.lang.String rawURL)
      Note discovered href
      void noteMetaTag​(java.util.Map metaAttributes)
      Note a meta tag
      void noteTextCharacter​(char textCharacter)
      Note a character of text.
      • Methods inherited from class java.lang.Object

        clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
    • Field Detail

      • preferredLinkPattern

        protected final java.util.regex.Pattern preferredLinkPattern
    • Constructor Detail

      • FindHTMLHrefHandler

        public FindHTMLHrefHandler​(java.lang.String parentURI,
                                   java.util.regex.Pattern preferredLinkPattern)
    • Method Detail

      • applyOverrides

        public void applyOverrides​(LoginParameters lp)
                            throws org.apache.manifoldcf.core.interfaces.ManifoldCFException
        Apply overrides
        Throws:
        org.apache.manifoldcf.core.interfaces.ManifoldCFException
      • noteTextCharacter

        public void noteTextCharacter​(char textCharacter)
                               throws org.apache.manifoldcf.core.interfaces.ManifoldCFException
        Note a character of text. Structured this way to keep overhead low for handlers that don't use text.
        Specified by:
        noteTextCharacter in interface IHTMLHandler
        Throws:
        org.apache.manifoldcf.core.interfaces.ManifoldCFException
      • noteMetaTag

        public void noteMetaTag​(java.util.Map metaAttributes)
                         throws org.apache.manifoldcf.core.interfaces.ManifoldCFException
        Note a meta tag
        Specified by:
        noteMetaTag in interface IMetaTagHandler
        Parameters:
        metaAttributes - are the attributes that belong to the tag.
        Throws:
        org.apache.manifoldcf.core.interfaces.ManifoldCFException
      • noteFormStart

        public void noteFormStart​(java.util.Map formAttributes)
                           throws org.apache.manifoldcf.core.interfaces.ManifoldCFException
        Note the start of a form
        Specified by:
        noteFormStart in interface IHTMLHandler
        Throws:
        org.apache.manifoldcf.core.interfaces.ManifoldCFException
      • noteFormInput

        public void noteFormInput​(java.util.Map inputAttributes)
                           throws org.apache.manifoldcf.core.interfaces.ManifoldCFException
        Note an input tag
        Specified by:
        noteFormInput in interface IHTMLHandler
        Throws:
        org.apache.manifoldcf.core.interfaces.ManifoldCFException
      • noteFormEnd

        public void noteFormEnd()
                         throws org.apache.manifoldcf.core.interfaces.ManifoldCFException
        Note the end of a form
        Specified by:
        noteFormEnd in interface IHTMLHandler
        Throws:
        org.apache.manifoldcf.core.interfaces.ManifoldCFException
      • noteDiscoveredBase

        public void noteDiscoveredBase​(java.lang.String rawURL)
                                throws org.apache.manifoldcf.core.interfaces.ManifoldCFException
        Description copied from interface: IDiscoveredLinkHandler
        Inform the world of a new base HREF.
        Specified by:
        noteDiscoveredBase in interface IDiscoveredLinkHandler
        Overrides:
        noteDiscoveredBase in class FindHandler
        Parameters:
        rawURL - is the new base HREF, in raw form. This may be relative, malformed, etc.
        Throws:
        org.apache.manifoldcf.core.interfaces.ManifoldCFException
      • noteDiscoveredLink

        public void noteDiscoveredLink​(java.lang.String rawURL)
                                throws org.apache.manifoldcf.core.interfaces.ManifoldCFException
        Override noteDiscoveredLink
        Specified by:
        noteDiscoveredLink in interface IDiscoveredLinkHandler
        Overrides:
        noteDiscoveredLink in class FindHandler
        Parameters:
        rawURL - is the raw discovered url. This may be relative, malformed, or otherwise unsuitable for use until final form is acheived.
        Throws:
        org.apache.manifoldcf.core.interfaces.ManifoldCFException
      • noteBASEHREF

        public void noteBASEHREF​(java.lang.String rawURL)
                          throws org.apache.manifoldcf.core.interfaces.ManifoldCFException
        Note discovered base
        Specified by:
        noteBASEHREF in interface IHTMLHandler
        Throws:
        org.apache.manifoldcf.core.interfaces.ManifoldCFException
      • noteAHREF

        public void noteAHREF​(java.lang.String rawURL)
                       throws org.apache.manifoldcf.core.interfaces.ManifoldCFException
        Note discovered href
        Specified by:
        noteAHREF in interface IHTMLHandler
        Throws:
        org.apache.manifoldcf.core.interfaces.ManifoldCFException
      • noteLINKHREF

        public void noteLINKHREF​(java.lang.String rawURL)
                          throws org.apache.manifoldcf.core.interfaces.ManifoldCFException
        Note discovered href
        Specified by:
        noteLINKHREF in interface IHTMLHandler
        Throws:
        org.apache.manifoldcf.core.interfaces.ManifoldCFException
      • noteIMGSRC

        public void noteIMGSRC​(java.lang.String rawURL)
                        throws org.apache.manifoldcf.core.interfaces.ManifoldCFException
        Note discovered IMG SRC
        Specified by:
        noteIMGSRC in interface IHTMLHandler
        Throws:
        org.apache.manifoldcf.core.interfaces.ManifoldCFException
      • noteFRAMESRC

        public void noteFRAMESRC​(java.lang.String rawURL)
                          throws org.apache.manifoldcf.core.interfaces.ManifoldCFException
        Note discovered FRAME SRC
        Specified by:
        noteFRAMESRC in interface IHTMLHandler
        Throws:
        org.apache.manifoldcf.core.interfaces.ManifoldCFException
      • finishUp

        public void finishUp()
                      throws org.apache.manifoldcf.core.interfaces.ManifoldCFException
        Description copied from interface: IHTMLHandler
        Done with the document.
        Specified by:
        finishUp in interface IHTMLHandler
        Throws:
        org.apache.manifoldcf.core.interfaces.ManifoldCFException