一、urlparsepython
做用:解析urlweb
from urllib import parse url = "https://book.qidian.com/info/1004608738" result = parse.urlparse(url=url) print(result)
結果:編碼
ParseResult(scheme='https', netloc='book.qidian.com', path='/info/1004608738', params='', query='', fragment='')
scheme:表示協議
netloc:域名
path:路徑
params:參數
query:查詢條件,通常都是get請求的url
fragment:錨點,用於直接定位頁
面的下拉位置,跳轉到網頁的指定位置
二、urlunparseurl
做用:上傳urlspa
from urllib import parse url_params = ('https', 'book.qidian.com', '/info/1004608738', '', '', '') _url = parse.urlunparse(url_params) print(_url) # https://book.qidian.com/info/1004608738
三、urljoincode
做用:拼接urlorm
from urllib import parse url_1 = "https://book.qidian.com/" url_2 = "info/1004608738" new_url = parse.urljoin(url_1, url_2) print(new_url) # https://book.qidian.com/info/1004608738
四、urlencodeblog
做用:url的參數,字典格式->url格式get
from urllib import parse params = { 'page': 10, 'job': 'python' } url = "https://test.job.com/" url_params = parse.urlencode(params) new_url = parse.urljoin(url, url_params) print(new_url) # https://test.job.com/page=10&job=python
五、quote域名
做用:將中文->url的編碼
from urllib import parse key = "海賊王" _key = parse.quote(key) print(_key) url = "https://www.sogou.com/web?query={}".format(_key) print(url) # https://www.sogou.com/web?query=%E6%B5%B7%E8%B4%BC%E7%8E%8B
六、unquote
url編碼->中文
from urllib import parse url = "https://www.sogou.com/web?query=%E6%B5%B7%E8%B4%BC%E7%8E%8B" unquote_url = parse.unquote(url) print(unquote_url) # https://www.sogou.com/web?query=海賊王