Ausgabe der neuen DB Einträge

This commit is contained in:
hubobel 2022-01-02 21:50:48 +01:00
parent bad48e1627
commit cfbbb9ee3d
2399 changed files with 843193 additions and 43 deletions

View file

@ -0,0 +1,27 @@
Copyright (c) Scrapy developers.
All rights reserved.
Redistribution and use in source and binary forms, with or without modification,
are permitted provided that the following conditions are met:
1. Redistributions of source code must retain the above copyright notice,
this list of conditions, and the following disclaimer.
2. Redistributions in binary form must reproduce the above copyright
notice, this list of conditions, and the following disclaimer in the
documentation and/or other materials provided with the distribution.
3. Neither the name of Scrapy nor the names of its contributors may be used
to endorse or promote products derived from this software without
specific prior written permission.
THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND
ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED
WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE
DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT OWNER OR CONTRIBUTORS BE LIABLE FOR
ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES
(INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES;
LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON
ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
(INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS
SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

View file

@ -0,0 +1,107 @@
Metadata-Version: 2.1
Name: itemloaders
Version: 1.0.4
Summary: Base library for scrapy's ItemLoader
Home-page: https://github.com/scrapy/itemloaders
Author: Scrapinghub
Author-email: info@scrapinghub.com
License: BSD
Project-URL: Documentation, https://itemloaders.readthedocs.io/
Project-URL: Source, https://github.com/scrapy/itemloaders
Platform: UNKNOWN
Classifier: Development Status :: 5 - Production/Stable
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: BSD License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: Implementation :: CPython
Classifier: Programming Language :: Python :: Implementation :: PyPy
Requires-Python: >=3.6
Description-Content-Type: text/x-rst
Requires-Dist: w3lib (>=1.17.0)
Requires-Dist: parsel (>=1.5.0)
Requires-Dist: jmespath (>=0.9.5)
Requires-Dist: itemadapter (>=0.1.0)
===========
itemloaders
===========
.. image:: https://img.shields.io/pypi/v/itemloaders.svg
:target: https://pypi.python.org/pypi/itemloaders
:alt: PyPI Version
.. image:: https://img.shields.io/pypi/pyversions/itemloaders.svg
:target: https://pypi.python.org/pypi/itemloaders
:alt: Supported Python Versions
.. image:: https://travis-ci.com/scrapy/itemloaders.svg?branch=master
:target: https://travis-ci.com/scrapy/itemloaders
:alt: Build Status
.. image:: https://codecov.io/github/scrapy/itemloaders/coverage.svg?branch=master
:target: https://codecov.io/gh/scrapy/itemloaders
:alt: Coverage report
.. image:: https://readthedocs.org/projects/itemloaders/badge/?version=latest
:target: https://itemloaders.readthedocs.io/en/latest/?badge=latest
:alt: Documentation Status
``itemloaders`` is a library that helps you collect data from HTML and XML sources.
It comes in handy to extract data from web pages, as it supports
data extraction using CSS and XPath Selectors.
It's specially useful when you need to standardize the data from many sources.
For example, it allows you to have all your casting and parsing rules in a
single place.
Here is an example to get you started::
from itemloaders import ItemLoader
from parsel import Selector
html_data = '''
<!DOCTYPE html>
<html>
<head>
<title>Some random product page</title>
</head>
<body>
<div class="product_name">Some random product page</div>
<p id="price">$ 100.12</p>
</body>
</html>
'''
loader = ItemLoader(selector=Selector(html_data))
loader.add_xpath('name', '//div[@class="product_name"]/text()')
loader.add_xpath('name', '//div[@class="product_title"]/text()')
loader.add_css('price', '#price::text')
loader.add_value('last_updated', 'today') # you can also use literal values
item = loader.load_item()
item
# {'name': ['Some random product page'], 'price': ['$ 100.12'], 'last_updated': ['today']}
For more information, check out the `documentation <https://itemloaders.readthedocs.io/en/latest/>`_.
Contributing
============
All contributions are welcome!
* If you want to review some code, check open
`Pull Requests here <https://github.com/scrapy/itemloaders/pulls>`_
* If you want to submit a code change
* File an `issue here <https://github.com/scrapy/itemloaders/issues>`_, if there isn't one yet
* Fork this repository
* Create a branch to work on your changes
* Push your local branch and submit a Pull Request

View file

@ -0,0 +1,14 @@
itemloaders-1.0.4.dist-info/INSTALLER,sha256=zuuue4knoyJ-UwPPXg8fezS7VCrXJQrAP7zeNuwvFQg,4
itemloaders-1.0.4.dist-info/LICENSE,sha256=_TMa88y4d2u-fzF4IeXguRwqt8c0ZOd0JdUiBPe12YA,1517
itemloaders-1.0.4.dist-info/METADATA,sha256=UtfloJ5xlowMIeYjuZ0856mDpps7-IRv0wT8JHF4ABk,3715
itemloaders-1.0.4.dist-info/RECORD,,
itemloaders-1.0.4.dist-info/WHEEL,sha256=EVRjI69F5qVjm_YgqcTXPnTAv3BfSUr0WVAHuSP3Xoo,92
itemloaders-1.0.4.dist-info/top_level.txt,sha256=8r2IbIMBVz7V7Ljj61V2IbRlFdDKqHljKTthRr_lqck,12
itemloaders/__init__.py,sha256=7Oe8zALHvZLyOl68yUrhwSY7HSxgdjx2vc58IB-bnYI,16496
itemloaders/__pycache__/__init__.cpython-39.pyc,,
itemloaders/__pycache__/common.cpython-39.pyc,,
itemloaders/__pycache__/processors.cpython-39.pyc,,
itemloaders/__pycache__/utils.cpython-39.pyc,,
itemloaders/common.py,sha256=khpuozzw70n1Xr_yGi0x7eLMPaj_0b6WddN5cfTI8Bg,466
itemloaders/processors.py,sha256=EEKrlWd9fEuECD9AO1_28EB9jGv0yw2D1987Dg_4WgI,8550
itemloaders/utils.py,sha256=JIe-yzt6lng6kqxO1Oc8iDjnPIoewqnetPNRl4t1y4k,2156

View file

@ -0,0 +1,5 @@
Wheel-Version: 1.0
Generator: bdist_wheel (0.35.1)
Root-Is-Purelib: true
Tag: py3-none-any

View file

@ -0,0 +1 @@
itemloaders