ó ˆ(phÐEãó<•SrSSKJr SrS/rSSKJr SSKJrJ r J r JrJrJ r JrJrJrJrJr SSKJrJrJrJrJrJr SSKJrJr SS KJrJrJ r J!r! SS K"J#r# \(aSSK$J%r% SSKJ&r& SS K'J(r(J)r)J*r* Sr+\ \\,\,4\,\,/S4r-"SS\\5r."SS\ 5r/g)zCUse the HTMLParser library to parse HTML files that aren't too bad.é)ÚannotationsÚMITÚHTMLParserTreeBuilder)Ú HTMLParser)ÚAnyÚCallableÚcastÚDictÚIterableÚListÚOptionalÚ TYPE_CHECKINGÚTupleÚTypeÚUnion)Ú AttributeDictÚCDataÚCommentÚDeclarationÚDoctypeÚProcessingInstruction)ÚEntitySubstitutionÚ UnicodeDammit)ÚDetectsXMLParsedAsHTMLÚHTMLÚHTMLTreeBuilderÚSTRICT©ÚParserRejectedMarkup)Ú BeautifulSoup)ÚNavigableString)Ú _EncodingÚ _EncodingsÚ _RawMarkupzhtml.parserNcó•\rSrSr%SrS\S'SrS\S'\S.SSjjrS\S 'S \S'S \S'SSjrSSjr SS Sjjr SS!SjjrS"SjrS#Sjr S#SjrS"SjrS"SjrS"SjrS"SjrSrg)$ÚBeautifulSoupHTMLParseré=ÚreplaceÚstrÚREPLACEÚignoreÚIGNORE©Úon_duplicate_attributeÚsoupr r.ú&Union[str, _DuplicateAttributeHandler]có´•XlX lURRUl[R "U/UQ70UD6 /UlUR5 g©N)r/r.ÚbuilderÚattribute_dict_classrÚ__init__Úalready_closed_empty_elementÚ_initialize_xml_detector)Úselfr/r.ÚargsÚkwargss ÚJ/var/www/html/venv/lib/python3.13/site-packages/bs4/builder/_htmlparser.pyr5Ú BeautifulSoupHTMLParser.__init__TsO€ðŒ Ø&<Ô#Ø$(§L¡L×$EÑ$EˆÔ!Ü×Ò˜DÐ2 4Ò2¨6Ò2ð-/ˆÔ)à×%Ñ%Õ'óz List[str]r6có•[U5er2r)r8Úmessages r;ÚerrorÚBeautifulSoupHTMLParser.erroros€ô# 7Ó+Ð+r=cóF•URXSS9 URU5 g)zmHandle an incoming empty-element tag. html.parser only calls this method when the markup looks like . F)Úhandle_empty_elementN)Úhandle_starttagÚ handle_endtag)r8ÚnameÚattrss r;Úhandle_startendtagÚ*BeautifulSoupHTMLParser.handle_startendtags%€ð ×Ñ˜T¸uÐÑEØ×Ñ˜4Õ r=c óx•UR5nUHcupVUcSnXT;aPURnXpR:XaM,USUR4;aXdU'MD[ [ U5nU"XEU5 M_XdU'Me URRR(aUR5up‰OS=p‰URRUSSXHU S9n U (aCU R(a2U(a+URUSS9 URRU5 URcUR!U5 gg)z³Handle an opening tag, e.g. '' :param handle_empty_element: True if this tag is known to be an empty-element tag (i.e. there is not expected to be any closing tag). NÚ)Ú sourcelineÚ sourceposF)Úcheck_already_closed)r4r.r,r*r Ú_DuplicateAttributeHandlerr/r3Ústore_line_numbersÚgetposrDÚis_empty_elementrEr6ÚappendÚ_root_tag_nameÚ_root_tag_encountered)r8rFrGrCÚ attr_dictÚkeyÚvalueÚon_duperLrMÚtags r;rDÚ'BeautifulSoupHTMLParser.handle_starttagŽs+€ð$(×#<Ñ#<Ó#>ˆ Û‰JˆCð‰}ØØÓð×5Ñ5ØŸk™kÓ)ÙØ t§|¡|Ð 4Ó4Ø%*˜c“Nä"Ô#=¸wÓGGÙ˜I¨EÖ2à!&˜#“ñ% ð,9‰9×Ñ×/×/Ø$(§K¡K£MÑ!ˆJ˜ à%)Ð)ˆJØi‰i×'Ñ'Ø$˜˜iÈ)ð(ð ˆö3×'×'Ö,@ð ×Ñ˜t¸%ÐÑ@ð ×-Ñ-×4Ñ4°TÔ:à×ÑÑ&Ø×&Ñ& tÕ,ð'r=cóž•U(a+XR;aURRU5 gURRU5 g)zÅHandle a closing tag, e.g. '' :param name: A tag name. :param check_already_closed: True if this tag is expected to be the closing portion of an empty-element tag, e.g. ''. N)r6Úremover/rE)r8rFrNs r;rEÚ%BeautifulSoupHTMLParser.handle_endtagÌs:€ö D×,MÑ,MÓ$Mð ×-Ñ-×4Ñ4°TÕ:àI‰I×#Ñ# DÕ)r=có:•URRU5 g)z4Handle some textual data that shows up between tags.N)r/Úhandle_data©r8Údatas r;r`Ú#BeautifulSoupHTMLParser.handle_dataÞs€à ‰ ×Ñ˜dÕ#r=có&•URS5(a[URS5S5nO=URS5(a[URS5S5nO[U5nSnUS:aDURRS4H(nU(dM[U/5R U5nM* U(d[U5nU=(d SnURU5 g![a Mif=f![[4a N@f=f)z·Handle a numeric character reference by converting it to the corresponding Unicode character and treating it as textual data. :param name: Character number, possibly in hexadecimal. ÚxéÚXNézwindows-1252uï¿½)Ú startswithÚintÚlstripr/Úoriginal_encodingÚ bytearrayÚdecodeÚUnicodeDecodeErrorÚchrÚ ValueErrorÚ OverflowErrorr`)r8rFÚ real_namerbÚencodings r;Úhandle_charrefÚ&BeautifulSoupHTMLParser.handle_charrefâsù€ð?‰?˜3×ÑÜ˜DŸK™K¨Ó,¨bÓ1‰IØ _‰_˜S× !Ñ !Ü˜DŸK™K¨Ó,¨bÓ1‰Iä˜D› ˆIàˆØs‹?ð"ŸY™Y×8Ñ8¸.ÓIÞÙðÜ$ i [Ó1×8Ñ8¸ÓB’Dñ Jöð Ü˜9“~ð×2Ð2ˆØ×Ñ˜Õøô*óÚðûô ¤ Ð.ó Ùð ús$ÂC,ÃC=Ã, C:Ã9C:Ã=DÄDcóz•[RRU5nUbUnOSU-nURU5 g)z¨Handle a named entity reference by converting it to the corresponding Unicode character(s) and treating it as textual data. :param name: Name of the entity reference. Nz&%s)rÚHTML_ENTITY_TO_CHARACTERÚgetr`)r8rFÚ characterrbs r;Úhandle_entityrefÚ(BeautifulSoupHTMLParser.handle_entityref s>€ô'×?Ñ?×CÑCÀDÓIˆ ØÑ Ø‰Dð˜4‘<ˆDØ×Ñ˜Õr=có¬•URR5 URRU5 URR[5 g)z?Handle an HTML comment. :param data: The text of the comment. N)r/ÚendDatar`rras r;Úhandle_commentÚ&BeautifulSoupHTMLParser.handle_comments8€ð ‰ ×ÑÔØ ‰ ×Ñ˜dÔ#Ø ‰ ×Ñœ'Õ"r=cóÈ•URR5 U[S5SnURRU5 URR[5 g)zIHandle a DOCTYPE declaration. :param data: The text of the declaration. zDOCTYPE N)r/r~Úlenr`rras r;Úhandle_declÚ#BeautifulSoupHTMLParser.handle_decl&sI€ð ‰ ×ÑÔØ”C˜ “OÐ%Ð&ˆØ ‰ ×Ñ˜dÔ#Ø ‰ ×Ñœ'Õ"r=có"•UR5RS5(a[nU[S5SnO[nUR R 5 UR RU5 UR R U5 g)zkHandle a declaration of unknown type -- probably a CDATA block. :param data: The text of the declaration. zCDATA[N)Úupperrirr‚rr/r~r`)r8rbÚclss r;Úunknown_declÚ$BeautifulSoupHTMLParser.unknown_decl0si€ð:‰:‹<×"Ñ" 8×,Ñ,ÜˆCØœ˜H› ˜Ð(‰DäˆCØ ‰ ×ÑÔØ ‰ ×Ñ˜dÔ#Ø ‰ ×Ñ˜#Õr=cóÎ•URR5 URRU5 URU5 URR[5 g)zLHandle a processing instruction. :param data: The text of the instruction. N)r/r~r`Ú_document_might_be_xmlrras r;Ú handle_piÚ!BeautifulSoupHTMLParser.handle_pi?sG€ð ‰ ×ÑÔØ ‰ ×Ñ˜dÔ#Ø×#Ñ# DÔ)Ø ‰ ×ÑÔ/Õ0r=)r6r4r.r/N)r/r r9rr.r0r:r)r?r)ÚreturnÚNone)rFr)rGúList[Tuple[str, Optional[str]]]rŽr)T)rFr)rGrrCÚboolrŽr)rFr)rNr‘rŽr)rbr)rŽr)rFr)rŽr)Ú__name__Ú __module__Ú__qualname__Ú__firstlineno__r*Ú__annotations__r,r5r@rHrDrEr`rur{rrƒrˆrŒÚ__static_attributes__©r=r;r&r&=sã‡ð€GˆSÓð€FˆCÓðð$JQñ (àð(ðð(ð!Gð (ð õ(ð.CÓBØ"+Ó+Ø Óô,ð !Øð !Ø ?ð !à ô !ð&&*ð <-àð<-ð/ð<-ð#ð <-ð õ<-ö|*ô$$ô&ôPô&#ô#ô ÷1r=r&cóÊ^•\rSrSr%SrSrS\S'SrS\S'\r S\S '\ \ \/rS \S'S\S 'Sr S\S'SSU4SjjjrSSSjjrSSjrSrU=r$)riJz‹A Beautiful soup `bs4.builder.TreeBuilder` that uses the :py:class:`html.parser.HTMLParser` parser, found in the Python standard library. Fr‘Úis_xmlTÚ picklabler)ÚNAMEz Iterable[str]Úfeaturesz$Tuple[Iterable[Any], Dict[str, Any]]Úparser_argsÚTRACKS_LINE_NUMBERScóô>•[5nSHnXS;dM URU5nXdU'M! [[U]"S0UD6 U=(d /nU=(d 0nURU5 SUS'X4Ulg)aBConstructor. :param parser_args: Positional arguments to pass into the BeautifulSoupHTMLParser constructor, once it's invoked. :param parser_kwargs: Keyword arguments to pass into the BeautifulSoupHTMLParser constructor, once it's invoked. :param kwargs: Keyword arguments for the superclass constructor. r-FÚconvert_charrefsNr˜)ÚdictÚpopÚsuperrr5Úupdaterž)r8ržÚ parser_kwargsr:Úextra_parser_kwargsÚargrXÚ __class__s €r;r5ÚHTMLParserTreeBuilder.__init__[sø€ô$#›fÐÛ.ˆCØ}ØŸ ™ 3›Ø+0 CÓ(ñ/ô Ô# TÒ3Ñ=°fÒ=Ø!×' RˆØ%×+¨ˆ Ø×ÑÐ0Ô1Ø,1ˆ Ð(Ñ)Ø'Ð7ˆÕr=c#óZ# •[U[5(a USSS4v• g/nU(aURU5 /nU(aURU5 [UUUSUS9nURc[S5eURURURUR4v• g7f)aÂRun any preliminary steps necessary to make incoming markup acceptable to the parser. :param markup: Some markup -- probably a bytestring. :param user_specified_encoding: The user asked to try this encoding. :param document_declared_encoding: The markup itself claims to be in this encoding. :param exclude_encodings: The user asked _not_ to try any of these encodings. :yield: A series of 4-tuples: (markup, encoding, declared encoding, has undergone character replacement) Each 4-tuple represents a strategy for parsing the document. This TreeBuilder uses Unicode, Dammit to convert the markup into Unicode, so the ``markup`` element of the tuple will always be a string. NFT)Úknown_definite_encodingsÚuser_encodingsÚis_htmlÚexclude_encodingszPCould not convert input to Unicode, and html.parser will not accept bytestrings.) Ú isinstancer)rSrÚunicode_markuprrlÚdeclared_html_encodingÚcontains_replacement_characters)r8ÚmarkupÚuser_specified_encodingÚdocument_declared_encodingr¯r¬rÚdammits r;Úprepare_markupÚ$HTMLParserTreeBuilder.prepare_markupysÄé€ô2fœc×"Ñ"à˜4 uÐ-Ò-Øð57Ð Þ"ð %×+Ñ+Ð,CÔDà*,ˆÞ%ð ×!Ñ!Ð"<Ô=äØØ%=Ø)ØØ/ñ ˆð× Ñ Ñ(ô'Øbóð ð ×%Ñ%Ø×(Ñ(Ø×-Ñ-Ø×6Ñ6ð ó ùs‚B)B+có*•URup#[U[5(deURce[ UR/UQ70UD6nURU5 UR 5 /Ul g![an[U5eSnAff=fr2) ržr°r)r/r&ÚfeedÚcloseÚAssertionErrorrr6)r8r´r9r:ÚparserÚes r;r»ÚHTMLParserTreeBuilder.feedÁs‘€Ø×'Ñ'‰ˆô˜&¤#×&Ñ&Ð&Ð&ð y‰yÑ$Ð$Ð$Ü(¨¯©ÐD°TÒD¸VÑDˆð *ØK‰K˜ÔØL‰LŒNð/1ˆÕ+øôó *ô' qÓ)Ð)ûð *úsÁ!A8Á8 BÂB Â B)rž)NN)ržzOptional[Iterable[Any]]r¦zOptional[Dict[str, Any]]r:r)NNN) r´r$rµúOptional[_Encoding]r¶rÁr¯zOptional[_Encodings]rŽzDIterable[Tuple[str, Optional[_Encoding], Optional[_Encoding], bool]])r´r$rŽr)r’r“r”r•Ú__doc__ršr–r›Ú HTMLPARSERrœrrrrŸr5r¸r»r—Ú __classcell__)r©s@r;rrJsÍø‡ñð€FˆDÓØ€IˆtÓØ€Dˆ#ÓØ# T¨6Ð2€HˆmÓ2Ø5Ó5ð!%Ð˜Ó$ð04Ø26ð8à,ð8ð0ð8ð÷ 8ð8ðB8<Ø:>Ø26ðFàðFð"5ðFð%8ð Fð 0ðFð Nõ F÷P1ò1r=)0rÂÚ __future__rÚ__license__Ú__all__Úhtml.parserrÚtypingrrr r rrr rrrrÚbs4.elementrrrrrrÚ bs4.dammitrrÚbs4.builderrrrrÚbs4.exceptionsrÚbs4r r!Úbs4._typingr"r#r$rÃr)rOr&rr˜r=r;ÚrÐs®ðáIÝ"ð€ðð€õ#÷÷÷ñ÷÷÷9÷óõ0æÝ!Ý+÷ñð€ à% t¨C°¨H¡~°s¸CÐ&@À$Ð&FÑGÐôJ1˜jÐ*@ôJ1ôZP1˜OõP1r=