Ausgabe der neuen DB Einträge
This commit is contained in:
parent
bad48e1627
commit
cfbbb9ee3d
2399 changed files with 843193 additions and 43 deletions
|
|
@ -0,0 +1 @@
|
|||
pip
|
||||
|
|
@ -0,0 +1,27 @@
|
|||
Copyright (c) Scrapy developers.
|
||||
All rights reserved.
|
||||
|
||||
Redistribution and use in source and binary forms, with or without modification,
|
||||
are permitted provided that the following conditions are met:
|
||||
|
||||
1. Redistributions of source code must retain the above copyright notice,
|
||||
this list of conditions, and the following disclaimer.
|
||||
|
||||
2. Redistributions in binary form must reproduce the above copyright
|
||||
notice, this list of conditions, and the following disclaimer in the
|
||||
documentation and/or other materials provided with the distribution.
|
||||
|
||||
3. Neither the name of Scrapy nor the names of its contributors may be used
|
||||
to endorse or promote products derived from this software without
|
||||
specific prior written permission.
|
||||
|
||||
THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND
|
||||
ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED
|
||||
WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE
|
||||
DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT OWNER OR CONTRIBUTORS BE LIABLE FOR
|
||||
ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES
|
||||
(INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES;
|
||||
LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON
|
||||
ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
|
||||
(INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS
|
||||
SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
|
||||
|
|
@ -0,0 +1,107 @@
|
|||
Metadata-Version: 2.1
|
||||
Name: itemloaders
|
||||
Version: 1.0.4
|
||||
Summary: Base library for scrapy's ItemLoader
|
||||
Home-page: https://github.com/scrapy/itemloaders
|
||||
Author: Scrapinghub
|
||||
Author-email: info@scrapinghub.com
|
||||
License: BSD
|
||||
Project-URL: Documentation, https://itemloaders.readthedocs.io/
|
||||
Project-URL: Source, https://github.com/scrapy/itemloaders
|
||||
Platform: UNKNOWN
|
||||
Classifier: Development Status :: 5 - Production/Stable
|
||||
Classifier: Intended Audience :: Developers
|
||||
Classifier: License :: OSI Approved :: BSD License
|
||||
Classifier: Operating System :: OS Independent
|
||||
Classifier: Programming Language :: Python
|
||||
Classifier: Programming Language :: Python :: 3
|
||||
Classifier: Programming Language :: Python :: 3.6
|
||||
Classifier: Programming Language :: Python :: 3.7
|
||||
Classifier: Programming Language :: Python :: 3.8
|
||||
Classifier: Programming Language :: Python :: Implementation :: CPython
|
||||
Classifier: Programming Language :: Python :: Implementation :: PyPy
|
||||
Requires-Python: >=3.6
|
||||
Description-Content-Type: text/x-rst
|
||||
Requires-Dist: w3lib (>=1.17.0)
|
||||
Requires-Dist: parsel (>=1.5.0)
|
||||
Requires-Dist: jmespath (>=0.9.5)
|
||||
Requires-Dist: itemadapter (>=0.1.0)
|
||||
|
||||
===========
|
||||
itemloaders
|
||||
===========
|
||||
|
||||
.. image:: https://img.shields.io/pypi/v/itemloaders.svg
|
||||
:target: https://pypi.python.org/pypi/itemloaders
|
||||
:alt: PyPI Version
|
||||
|
||||
.. image:: https://img.shields.io/pypi/pyversions/itemloaders.svg
|
||||
:target: https://pypi.python.org/pypi/itemloaders
|
||||
:alt: Supported Python Versions
|
||||
|
||||
.. image:: https://travis-ci.com/scrapy/itemloaders.svg?branch=master
|
||||
:target: https://travis-ci.com/scrapy/itemloaders
|
||||
:alt: Build Status
|
||||
|
||||
.. image:: https://codecov.io/github/scrapy/itemloaders/coverage.svg?branch=master
|
||||
:target: https://codecov.io/gh/scrapy/itemloaders
|
||||
:alt: Coverage report
|
||||
|
||||
.. image:: https://readthedocs.org/projects/itemloaders/badge/?version=latest
|
||||
:target: https://itemloaders.readthedocs.io/en/latest/?badge=latest
|
||||
:alt: Documentation Status
|
||||
|
||||
|
||||
``itemloaders`` is a library that helps you collect data from HTML and XML sources.
|
||||
|
||||
It comes in handy to extract data from web pages, as it supports
|
||||
data extraction using CSS and XPath Selectors.
|
||||
|
||||
It's specially useful when you need to standardize the data from many sources.
|
||||
For example, it allows you to have all your casting and parsing rules in a
|
||||
single place.
|
||||
|
||||
Here is an example to get you started::
|
||||
|
||||
from itemloaders import ItemLoader
|
||||
from parsel import Selector
|
||||
|
||||
html_data = '''
|
||||
<!DOCTYPE html>
|
||||
<html>
|
||||
<head>
|
||||
<title>Some random product page</title>
|
||||
</head>
|
||||
<body>
|
||||
<div class="product_name">Some random product page</div>
|
||||
<p id="price">$ 100.12</p>
|
||||
</body>
|
||||
</html>
|
||||
'''
|
||||
loader = ItemLoader(selector=Selector(html_data))
|
||||
loader.add_xpath('name', '//div[@class="product_name"]/text()')
|
||||
loader.add_xpath('name', '//div[@class="product_title"]/text()')
|
||||
loader.add_css('price', '#price::text')
|
||||
loader.add_value('last_updated', 'today') # you can also use literal values
|
||||
item = loader.load_item()
|
||||
item
|
||||
# {'name': ['Some random product page'], 'price': ['$ 100.12'], 'last_updated': ['today']}
|
||||
|
||||
For more information, check out the `documentation <https://itemloaders.readthedocs.io/en/latest/>`_.
|
||||
|
||||
Contributing
|
||||
============
|
||||
|
||||
All contributions are welcome!
|
||||
|
||||
* If you want to review some code, check open
|
||||
`Pull Requests here <https://github.com/scrapy/itemloaders/pulls>`_
|
||||
|
||||
* If you want to submit a code change
|
||||
|
||||
* File an `issue here <https://github.com/scrapy/itemloaders/issues>`_, if there isn't one yet
|
||||
* Fork this repository
|
||||
* Create a branch to work on your changes
|
||||
* Push your local branch and submit a Pull Request
|
||||
|
||||
|
||||
|
|
@ -0,0 +1,14 @@
|
|||
itemloaders-1.0.4.dist-info/INSTALLER,sha256=zuuue4knoyJ-UwPPXg8fezS7VCrXJQrAP7zeNuwvFQg,4
|
||||
itemloaders-1.0.4.dist-info/LICENSE,sha256=_TMa88y4d2u-fzF4IeXguRwqt8c0ZOd0JdUiBPe12YA,1517
|
||||
itemloaders-1.0.4.dist-info/METADATA,sha256=UtfloJ5xlowMIeYjuZ0856mDpps7-IRv0wT8JHF4ABk,3715
|
||||
itemloaders-1.0.4.dist-info/RECORD,,
|
||||
itemloaders-1.0.4.dist-info/WHEEL,sha256=EVRjI69F5qVjm_YgqcTXPnTAv3BfSUr0WVAHuSP3Xoo,92
|
||||
itemloaders-1.0.4.dist-info/top_level.txt,sha256=8r2IbIMBVz7V7Ljj61V2IbRlFdDKqHljKTthRr_lqck,12
|
||||
itemloaders/__init__.py,sha256=7Oe8zALHvZLyOl68yUrhwSY7HSxgdjx2vc58IB-bnYI,16496
|
||||
itemloaders/__pycache__/__init__.cpython-39.pyc,,
|
||||
itemloaders/__pycache__/common.cpython-39.pyc,,
|
||||
itemloaders/__pycache__/processors.cpython-39.pyc,,
|
||||
itemloaders/__pycache__/utils.cpython-39.pyc,,
|
||||
itemloaders/common.py,sha256=khpuozzw70n1Xr_yGi0x7eLMPaj_0b6WddN5cfTI8Bg,466
|
||||
itemloaders/processors.py,sha256=EEKrlWd9fEuECD9AO1_28EB9jGv0yw2D1987Dg_4WgI,8550
|
||||
itemloaders/utils.py,sha256=JIe-yzt6lng6kqxO1Oc8iDjnPIoewqnetPNRl4t1y4k,2156
|
||||
|
|
@ -0,0 +1,5 @@
|
|||
Wheel-Version: 1.0
|
||||
Generator: bdist_wheel (0.35.1)
|
||||
Root-Is-Purelib: true
|
||||
Tag: py3-none-any
|
||||
|
||||
|
|
@ -0,0 +1 @@
|
|||
itemloaders
|
||||
Loading…
Add table
Add a link
Reference in a new issue