June 2018 – 大步Blog

Unicode(0xb) error-An invalid XML character (Unicode: 0xb) was found in the element content of the document.

ksharpdabu 2018-06-30 0 Comments

问题描述：今天对接接口的时候，在解析对方的xml时，报了这样一个错误“Unicode(0xb) error-An invalid XML character (Unicode: 0xb) was found in the element content of the document.” 原因分析：报错提示是说，xml中有一个非法的xml字符（Unicode: 0xb），unicode编码的0xb表示垂直tab（ vertical tab，VT），在notepad++之类的文本编辑器中，显示为VT。详情见：https://en.wikipedia.org/wiki/Tab_key。如有以下unicode编码后的文本： %u63a5%u53d7%0b%u3002 unicdoe解码后，notepad++中的显示，可以看到VT字符：对于xml 1.0来说，它的合法的字符范围应该是（见：https://en.wikipedia.org/wiki/Valid_characters_in_XML#XML_1.0）： XML 1.0 Unicode…

python

批量删除新浪博客文章的请求分析

ksharpdabu 2018-06-24 2 Comments

新浪博客、网易博客这种博客平台，功能不多，连批量删除文章的功能都没有，本来想网上下个工具直接删，后来想想怕不安全，索性自己分析请求，写个脚本删除。工具： chorme的f12 python3.6 分析过程： 1.先f12，分别录制删除操作的请求地址和参数 Request URL: http://control.blog.sina.com.cn/admin/article/article_del_recycle.php?domain=1 Request Method: POST Status Code: 200 OK Remote Address: 219.142.118.113:80 request param： blog_id: 70b92d280102ux4j uid: 1312312312 成功后的response： {“code”:”A00006″,”data”:””} 从上面看到，blog_id就是要删除的文章的id，uid是博客账号的id。其他的cookies之类的参数，f12都能全都获取到，写py脚本的时候照着拷贝一份即可。 2.获取所有删除的文章的id。在文章列表管理页面，查看源码，就能看到每页的文章列表的blog_id了，直接在console里循环请求文章列表页，然后提取每页的blog_id，拼接成一个list，就获得所有文章的blog_id了。 3.编写py脚本删除。请求都分析完了，剩下的就是py拼写http请求参数了，这里就不贴代码了。…

Unicode(0xb) error-An invalid XML character (Unicode: 0xb) was found in the element content of the document.

批量删除新浪博客文章的请求分析

You Missed

Golang实现webVTT字幕转SRT字幕

git迁移远程仓库同时保留所有提交记录和所有分支

Spring MVC接口总是默认返回XML的问题排查

git如何删除已经push到远程的提交