[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"$fuEe8IjdbzOmgWXEbjXAoEIDd3QzSSAFg2ZK3_zN8e1o":3},{"answer":4,"createTime":5,"id":6,"options":7,"origin":12,"question":19,"related":20,"source":24,"type":25},[],"2024-12-29 14:38:08",174917969,[8,9,10,11],"lxml","Beautiful Soup","JSONPath","Requests",{"count":13,"courseId":14,"courseImg":15,"courseName":16,"workId":17,"workName":18},30,"58c966a7fc5da6d84866e5685d0419d2","https:\u002F\u002Ftihai-oss-cloud.itihey.com\u002Fimg\u002F9c1e48361b00f3ee2086f4e259ed792b.jpg","网络爬虫","work_39754128","复习题-选择题1","下列选项中,不能用于解析网页数据的是()",[21,26,35,44,54,63,70,79,88,97],{"answer":22,"createTime":5,"id":6,"options":23,"question":19,"source":24,"type":25},[],[8,9,10,11],"v1",0,{"answer":27,"createTime":5,"id":28,"options":29,"question":34,"source":24,"type":25},[],174917970,[30,31,32,33],"如果要抓取静态网页的数据,只需要获得网页的源代码即可","通过urllib、urllib3和Requests等库抓取静态网页数据","Requests库只能发送网络请求不能获取网页源码","抓取静态网页数据的整个过程是模仿用户通过浏览器访问网页的过程","关于抓取静态网页实现技术的说法,下列描述错误的是( )",{"answer":36,"createTime":5,"id":37,"options":38,"question":43,"source":24,"type":25},[],174917971,[39,40,41,42],"如果网页返回的是结构化数据,那么无法使用Python进行提取","对于非结构化数据的提取可以使用正则表达式、XPath、CSS选择器进行提取","结构化数据是先有结构,再有数据","非结构化数据是先有数据,再有结构","下列选项中,关于网页数据格式的描述说法错误的是()",{"answer":45,"createTime":46,"id":47,"options":48,"question":53,"source":24,"type":25},[],"2024-12-29 14:38:09",174917972,[49,50,51,52],"http:\u002F\u002F127.0.0:8000\u002Fstatic\u002Fgoods\u002Fglass.jpg","http:\u002F\u002F127.0.0:8000\u002Fstatic\u002Fgoods\u002Fglass.png","http:\u002F\u002F127.0.0:8000\u002Fimages\u002Fglass.png","http:\u002F\u002F127.0.0:8000\u002Fglass.png","下列选项中,表示访问服务器images目录下的glass.png的是()",{"answer":55,"createTime":46,"id":56,"options":57,"question":62,"source":24,"type":25},[],174917973,[58,59,60,61],"JSON","HTML","CSV","XML","下列选项中,不属于结构化数据的是()",{"answer":64,"createTime":46,"id":65,"options":66,"question":69,"source":24,"type":25},[],174917974,[9,67,11,68],"Scrapy","urllib","下列选项中属于内置库的是()",{"answer":71,"createTime":46,"id":72,"options":73,"question":78,"source":24,"type":25},[],174917975,[74,75,76,77],"JSONPath只适用于JSON文档","JSONPath提供了描述JSON文档层次结构的表达式","JSONPath提供的语法与XPath提供的语法相同","JSONPath可以看作定位目标对象位置的语言","关于JSONPath的描述,说法错误的是()",{"answer":80,"createTime":46,"id":81,"options":82,"question":87,"source":24,"type":25},[],174917976,[83,84,85,86],"Selenium是一个开源的、便携式的自动化测试工具","Selenium可以模拟用户使用浏览器完成一些动作","Selenium最初的目的就是为了便于网络爬虫抓取动态网页数据","Selenium需要通过浏览器驱动程序WebDriver才能与所选浏览器进行交互","下列选项中,关于Selenium的描述说法错误的是()",{"answer":89,"createTime":46,"id":90,"options":91,"question":96,"source":24,"type":25},[],174917977,[92,93,94,95],"Selenium可以模拟用户输入文本、选择下拉框、单击按钮、单击超链接等操作","Selenium不支持IE浏览器","PyAutoGUI可以控制鼠标和键盘自动与其他应用程序交互","Splash用于JavaScript渲染服务,是一个带有HTTP API的轻量级Web浏览器","下列选项中,关于抓取动态网页的实现技术的描述错误的是()",{"answer":98,"createTime":46,"id":99,"options":100,"question":105,"source":24,"type":25},[],174917978,[101,102,103,104],"XPath基于XML或HTML的节点树定位目标节点所在的位置","XPath是一种用于确定XML文档中部分节点位置的语言","XPath匹配节点的方式与正则表达式匹配字符串的方式类似","XPath通过路径表达式可以快速地定位与选取XML或HTML文档中的一个节点或者一组节点集","关于XPath的描述,说法错误的是()"]