<feed xmlns="http://www.w3.org/2005/Atom"> <id>/</id><title>Jan Lipovský</title><subtitle>Jan Lipovský - open-source python developer, linux geek, photographer, woodworker, DIY enthusiast.</subtitle> <updated>2023-03-12T09:23:08+01:00</updated> <author> <name>Jan Lipovský</name> <uri>/</uri> </author><link rel="self" type="application/atom+xml" href="/feed.xml"/><link rel="alternate" type="text/html" hreflang="en" href="/"/> <generator uri="https://jekyllrb.com/" version="4.3.2">Jekyll</generator> <rights> © 2023 Jan Lipovský </rights> <icon>/assets/img/favicons/favicon.ico</icon> <logo>/assets/img/favicons/favicon-96x96.png</logo> <entry><title>URLExtract - introduction</title><link href="/posts/urlextract_introduction/" rel="alternate" type="text/html" title="URLExtract - introduction" /><published>2023-01-01T08:00:00+01:00</published> <updated>2023-01-01T08:00:00+01:00</updated> <id>/posts/urlextract_introduction/</id> <content src="/posts/urlextract_introduction/" /> <author> <name>Jan Lipovský</name> </author> <category term="urlextract" /> <summary> Let me introduce you urlextract. Small python class/library and command line interface which can be used for collecting (extracting) URLs from given text based on locating TLD. The latest version can be easily installed using: pip install urlextract regex vs urlextract To write one regular expression that will cover all formats of URL is impossible. For example this regex covers only few ... </summary> </entry> </feed>
