业余爱好『Favourite』 Archives

全国统计用区划代码和城乡划分代码 2022年数据[爬虫]【Json+CSV格式】

2023年4月21日 5 条评论

<noscript>
<h1><strong>Please enable JavaScript and refresh the page.</strong></h1>
</noscript>

业余爱好『Favourite』

精品美女吧爬虫【Windows】【23.04.16】

2023年4月16日 33 条评论

精品美女吧 爬虫
Verson: 23.04.16
Blog: http://www.h4ck.org.cn
****************************************************************************************************
USAGE:
spider -h <help> -a <all> -q <search> -e <early stop>
Arguments:
         -a <download all site images>
         -h <display help text, just this>
Option Arguments:
         -p <image download path>
         -r <random index category list>
         -c <single category url>
         -e <early stop, work in site crawl mode only>
         -s <site url eg: https://www.jpxgmn.net (no last backslash "/")>
****************************************************************************************************

业余爱好『Favourite』

requests SSLCertVerificationError

2023年4月16日一条评论

Traceback (most recent call last):
  File "requests\adapters.py", line 439, in send
  File "urllib3\connectionpool.py", line 785, in urlopen
  File "urllib3\util\retry.py", line 592, in increment
urllib3.exceptions.MaxRetryError: HTTPSConnectionPool(host='www.jpmn8.cc', port=443): Max retries exceeded with url: / (Caused by SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1124)')))

业余爱好『Favourite』

爱看美女网爬虫【群辉Docker】【23.03.02】

2023年3月16日 8 条评论

在群辉下通过pyinstaller编译py文件会出现各种问题。首先是没有binutils，如果要安装这个工具包，需要安装包管理器ipkg。在确定系统处理器架构之后即可安装对应的包管理下，命令如下：

wget http://ipkg.nslu2-linux.org/feeds/optware/syno-i686/cross/stable/syno-i686-bootstrap_1.2-7_i686.xsh
chmod +x syno-i686-bootstrap_1.2-7_i686.xsh
sh syno-i686-bootstrap_1.2-7_i686.xsh

安装完成之后即可通过ipkg进行包管理了，

ipkg install binutils

业余爱好『Favourite』

美女图片整理【异常图片】

2023年3月14日 27 条评论

由于爬虫比较多，有的爬虫在下载的时候没有处理网络问题或者图片本山链接错误导致的图片异常。有的是处理了的，不要问为什么没加异常检测，问就是懒。

下载的图片会出现下面的问题，其实预览的时候就会发下问题了，另外打开这个图片其实会显示404或者502之类的错误页面。所以写了一段处理代码，主要两个功能：

1.删除小文件，至于多小自己去调整代码
2.如果目录下所有的文件都有问题，删除文件后同时删除目录

业余爱好『Favourite』

m3u8 downloader [23.03.04][Windows]

2023年3月4日 18 条评论

更新记录：
1.修复txt文件url列表格式下载导致的windows下的文件名命名错误

m3u8_downloader.exe
****************************************************************************************************
Verson: 23.03.04
m3u8_downloader -i <input m3u8 link> -o <output file> -p <out put path> -f <input file> -m <ffmpeg path>
Need Arguments:
         -i <input m3u8 link>
Option Arguments:
         -o <output file> -p <out put path> -f <input file>
         -m <ffmpeg path>
ffmpeg:F:\Pycharm_Projects\m3u8_downloader\dist\m3u8_downloader\bin/ffmpeg.exe
Blog: http://www.h4ck.org.cn
Source Code: http://h4ck.org.cn/2020/01/基于ffmpeg的m3u8下载/
****************************************************************************************************