ckan / ckanext-harvest Goto Github PK

/15

sudo apt-get update
sudo apt-get install redis-server
ckan.harvest.mq.type = redis
sudo apt-get update
sudo apt-get install rabbitmq-server
ckan.harvest.mq.type = amqp
$ . /usr/lib/ckan/default/bin/activate
(pyenv) $ pip install -e git+https://github.com/ckan/ckanext-harvest.git#egg=ckanext-harvest
(pyenv) $ cd /usr/lib/ckan/default/src/ckanext-harvest/
(pyenv) $ pip install -r requirements.txt
ckan.plugins = harvest ckan_harvester
ckan.harvest.mq.type = redis
sudo service apache2 restart
http://localhost/harvest
ckan.harvest.log_scope = 0
ckan.harvest.log_timeframe = 10
ckan.harvest.log_level = info
$ curl {ckan_url}/api/3/action/harvest_log_list
$ curl {ckan_url}/api/3/action/harvest_log_list?level=info

{
  "help":"http://127.0.0.1:5000/api/3/action/help_show?name=harvest_log_list",

  "success":true,

  "result": [{"content":"Sent job aa987717-2316-4e47-b0f2-cbddfb4c4dfc to the gather queue","level":"INFO","created":"2016-06-03 10:59:40.961657"}, {"content":"Sent job aa987717-2316-4e47-b0f2-cbddfb4c4dfc to the gather queue","level":"INFO","created":"2016-06-03 10:59:40.951548"}]

}
harvester source {name} {url} {type} [{title}] [{active}] [{owner_org}] [{frequency}] [{config}]
  - create new harvest source

harvester source {source-id/name}
  - shows a harvest source

harvester rmsource {source-id/name}
  - remove (deactivate) a harvester source, whilst leaving any related
    datasets, jobs and objects

harvester clearsource {source-id/name}
  - clears all datasets, jobs and objects related to a harvest source,
    but keeps the source itself

harvester clearsource-history [{source-id}] [-k]
  - If no source id is given the history for all harvest sources (maximum is 1000)
    will be cleared.
    Clears all jobs and objects related to a harvest source, but keeps the source
    itself. The datasets imported from the harvest source will **NOT** be deleted!!!
    If a source id is given, it only clears the history of the harvest source with
    the given source id.

    To keep the currently active jobs use the -k option.

harvester sources [all]
  - lists harvest sources
    If 'all' is defined, it also shows the Inactive sources

harvester job {source-id/name}
  - create new harvest job

harvester jobs
  - lists harvest jobs

harvester job-abort {source-id/name}
  - marks a job as "Aborted" so that the source can be restarted afresh.
    It ensures that the job's harvest objects status are also marked
    finished. You should ensure that neither the job nor its objects are
    currently in the gather/fetch queues.

harvester run
  - starts any harvest jobs that have been created by putting them onto
    the gather queue. Also checks running jobs - if finished it
    changes their status to Finished.

harvester run-test {source-id/name}
  - runs a harvest - for testing only.
    This does all the stages of the harvest (creates job, gather, fetch,
    import) without involving the web UI or the queue backends. This is
    useful for testing a harvester without having to fire up
    gather/fetch_consumer processes, as is done in production.

harvester run-test {source-id/name} force-import=guid1,guid2...
  - In order to force an import of particular datasets, useful to
    target a dataset for dev purposes or when forcing imports on other environments.

harvester gather-consumer
  - starts the consumer for the gathering queue

harvester fetch-consumer
  - starts the consumer for the fetching queue

harvester purge-queues
  - removes all jobs from fetch and gather queue
    WARNING: if using Redis, this command purges all data in the current
    Redis database

harvester clean-harvest-log
  - Clean-up mechanism for the harvest log table.
    You can configure the time frame through the configuration
    parameter 'ckan.harvest.log_timeframe'. The default time frame is 30 days

harvester [-j] [-o] [--segments={segments}] import [{source-id}]
  - perform the import stage with the last fetched objects, for a certain
    source or a single harvest object. Please note that no objects will
    be fetched from the remote server. It will only affect the objects
    already present in the database.

    To import a particular harvest source, specify its id as an argument.
    To import a particular harvest object use the -o option.
    To import a particular package use the -p option.

    You will need to specify the -j flag in cases where the datasets are
    not yet created (e.g. first harvest, or all previous harvests have
    failed)

    The --segments flag allows to define a string containing hex digits that represent which of
    the 16 harvest object segments to import. e.g. 15af will run segments 1,5,a,f

harvester job-all
  - create new harvest jobs for all active sources.

harvester reindex
  - reindexes the harvest source datasets
ckan.plugins = harvest ckan_harvester
{
 "api_version": 1,
 "default_tags": [{"name": "geo"}, {"name": "namibia"}],
 "default_groups": ["science", "spend-data"],
 "default_extras": {"encoding":"utf8", "harvest_url": "{harvest_source_url}/dataset/{dataset_id}"},
 "override_extras": true,
 "organizations_filter_include": [],
 "organizations_filter_exclude": ["remote-organization"],
 "user":"harverster-user",
 "api_key":"<REMOTE_API_KEY>",
 "read_only": true,
 "remote_groups": "only_local",
 "remote_orgs": "create"
}
from ckanext.harvest.harvesters.ckanharvester import CKANHarvester

class MySiteCKANHarvester(CKANHarvester):

    def modify_package_dict(self, package_dict, harvest_object):

        # Set a default custom field

        package_dict['remote_harvest'] = True

        # Add tags
        package_dict['tags'].append({'name': 'sdi'})

        return package_dict
# setup.py

entry_points='''
    [ckan.plugins]
    my_site=ckanext.my_site.plugin:MySitePlugin
    my_site_ckan_harvester=ckanext.my_site.harvesters:MySiteCKANHarvester
'''

# ini file
ckan.plugins = ... my_site my_site_ckan_harvester
from ckan.plugins.core import SingletonPlugin, implements
from ckanext.harvest.interfaces import IHarvester

class MyHarvester(SingletonPlugin):
'''
A Test Harvester
'''
implements(IHarvester)

def info(self):
    '''
    Harvesting implementations must provide this method, which will return
    a dictionary containing different descriptors of the harvester. The
    returned dictionary should contain:

    * name: machine-readable name. This will be the value stored in the
      database, and the one used by ckanext-harvest to call the appropiate
      harvester.
    * title: human-readable name. This will appear in the form's select box
      in the WUI.
    * description: a small description of what the harvester does. This
      will appear on the form as a guidance to the user.

    A complete example may be::

        {
            'name': 'csw',
            'title': 'CSW Server',
            'description': 'A server that implements OGC's Catalog Service
                            for the Web (CSW) standard'
        }

    :returns: A dictionary with the harvester descriptors
    '''

def validate_config(self, config):
    '''

    [optional]

    Harvesters can provide this method to validate the configuration
    entered in the form. It should return a single string, which will be
    stored in the database.  Exceptions raised will be shown in the form's
    error messages.

    :param harvest_object_id: Config string coming from the form
    :returns: A string with the validated configuration options
    '''

def get_original_url(self, harvest_object_id):
    '''

    [optional]

    This optional but very recommended method allows harvesters to return
    the URL to the original remote document, given a Harvest Object id.
    Note that getting the harvest object you have access to its guid as
    well as the object source, which has the URL.
    This URL will be used on error reports to help publishers link to the
    original document that has the errors. If this method is not provided
    or no URL is returned, only a link to the local copy of the remote
    document will be shown.

    Examples:
        * For a CKAN record: http://{ckan-instance}/api/rest/{guid}
        * For a WAF record: http://{waf-root}/{file-name}
        * For a CSW record: http://{csw-server}/?Request=GetElementById&Id={guid}&...

    :param harvest_object_id: HarvestObject id
    :returns: A string with the URL to the original document
    '''

def gather_stage(self, harvest_job):
    '''
    The gather stage will receive a HarvestJob object and will be
    responsible for:
        - gathering all the necessary objects to fetch on a later.
          stage (e.g. for a CSW server, perform a GetRecords request)
        - creating the necessary HarvestObjects in the database, specifying
          the guid and a reference to its job. The HarvestObjects need a
          reference date with the last modified date for the resource, this
          may need to be set in a different stage depending on the type of
          source.
        - creating and storing any suitable HarvestGatherErrors that may
          occur.
        - returning a list with all the ids of the created HarvestObjects.
        - to abort the harvest, create a HarvestGatherError and raise an
          exception. Any created HarvestObjects will be deleted.

    :param harvest_job: HarvestJob object
    :returns: A list of HarvestObject ids
    '''

def fetch_stage(self, harvest_object):
    '''
    The fetch stage will receive a HarvestObject object and will be
    responsible for:
        - getting the contents of the remote object (e.g. for a CSW server,
          perform a GetRecordById request).
        - saving the content in the provided HarvestObject.
        - creating and storing any suitable HarvestObjectErrors that may
          occur.
        - returning True if everything is ok (ie the object should now be
          imported), "unchanged" if the object didn't need harvesting after
          all (ie no error, but don't continue to import stage) or False if
          there were errors.

    :param harvest_object: HarvestObject object
    :returns: True if successful, 'unchanged' if nothing to import after
              all, False if not successful
    '''

def import_stage(self, harvest_object):
    '''
    The import stage will receive a HarvestObject object and will be
    responsible for:
        - performing any necessary action with the fetched object (e.g.
          create, update or delete a CKAN package).
          Note: if this stage creates or updates a package, a reference
          to the package should be added to the HarvestObject.
        - setting the HarvestObject.package (if there is one)
        - setting the HarvestObject.current for this harvest:
           - True if successfully created/updated
           - False if successfully deleted
        - setting HarvestObject.current to False for previous harvest
          objects of this harvest source if the action was successful.
        - creating and storing any suitable HarvestObjectErrors that may
          occur.
        - creating the HarvestObject - Package relation (if necessary)
        - returning True if the action was done, "unchanged" if the object
          didn't need harvesting after all or False if there were errors.

    NB You can run this stage repeatedly using 'paster harvest import'.

    :param harvest_object: HarvestObject object
    :returns: True if the action was done, "unchanged" if the object didn't
              need harvesting after all or False if there were errors.
    '''
sudo apt-get update
sudo apt-get install supervisor
ps aux | grep supervisord
root      9224  0.0  0.3  56420 12204 ?        Ss   15:52   0:00 /usr/bin/python /usr/bin/supervisord
; ===============================
; ckan harvester
; ===============================

[program:ckan_gather_consumer]

command=/usr/lib/ckan/default/bin/ckan --config=/etc/ckan/default/ckan.ini harvester gather-consumer

; user that owns virtual environment.
user=ckan

numprocs=1
stdout_logfile=/var/log/ckan/std/gather_consumer.log
stderr_logfile=/var/log/ckan/std/gather_consumer.log
autostart=true
autorestart=true
startsecs=10

[program:ckan_fetch_consumer]

command=/usr/lib/ckan/default/bin/ckan --config=/etc/ckan/default/ckan.ini harvester fetch-consumer

; user that owns virtual environment.
user=ckan

numprocs=1
stdout_logfile=/var/log/ckan/std/fetch_consumer.log
stderr_logfile=/var/log/ckan/std/fetch_consumer.log
autostart=true
autorestart=true
startsecs=10
sudo supervisorctl reread
sudo supervisorctl add ckan_gather_consumer
sudo supervisorctl add ckan_fetch_consumer
sudo supervisorctl start ckan_gather_consumer
sudo supervisorctl start ckan_fetch_consumer
sudo supervisorctl status

ckan_fetch_consumer              RUNNING    pid 6983, uptime 0:22:06
ckan_gather_consumer             RUNNING    pid 6968, uptime 0:22:45
sudo service supervisor start; sudo service supervisor stop
`socket.error: [Errno 111] Connection refused`
RabbitMQ is not running::

  sudo service rabbitmq-server start
sudo crontab -e -u ckan
sudo crontab -e -u ckan
@toolkit.chained_action
def harvest_get_notifications_recipients(up_func, context, data_dict):
    """ Harvester plugin notify by default about harvest jobs only to
            admin users of the related organization.
            Also allow to add custom recipients with this function.

        Return a list of dicts with name and email like
            {'name': 'John', 'email': '[email protected]'} """

    recipients = up_func(context, data_dict)
    new_recipients = []

    # you custom logic to add new_recipients here
    # new_recipients.append({'name': 'Harvester Admin', 'email': '[email protected]'})
    # recipients += new_recipients
    return recipients
cd ckanext-harvest
pytest --ckan-ini=test.ini ckanext/harvest/tests
AttributeError("'thread._local' object has no attribute 'host'",)

fc3bd3d
paster --plugin=ckanext-harvest harvester source http://localhost ckan -c 
development.ini

2013-08-05 20:37:04,366 INFO  [ckanext.harvest.logic.action.create] Creating harvest source: {'user_id': u'', 'url': u'http://localhost', 'type': u'ckan', 'frequency': 'MANUAL', 'publisher_id': u'', 'active': True, 'config': None}
An error occurred:
{'name': ['Missing value'], 'title': ['Missing value'], 'source_type': ['Missing value']}
Traceback (most recent call last):
  File "/home/vagrant/pyenv/bin/paster", line 9, in <module>
    load_entry_point('PasteScript==1.7.5', 'console_scripts', 'paster')()
  File "/home/vagrant/pyenv/local/lib/python2.7/site-packages/paste/script/command.py", line 104, in run
    invoke(command, command_name, options, args[1:])
  File "/home/vagrant/pyenv/local/lib/python2.7/site-packages/paste/script/command.py", line 143, in invoke
    exit_code = runner.run(args)
  File "/home/vagrant/pyenv/local/lib/python2.7/site-packages/paste/script/command.py", line 238, in run
    result = self.command()
  File "/vagrant/ckanext-harvest/ckanext/harvest/commands/harvester.py", line 104, in command
    self.create_harvest_source()
  File "/vagrant/ckanext-harvest/ckanext/harvest/commands/harvester.py", line 217, in create_harvest_source
    raise e
ckan.logic.ValidationError: {'Name': 'Missing value', 'Source type': 'Missing value', 'Title': 'Missing value'}

hrtimer: interrupt took 31998821 ns
BUG: soft lockup - CPU#0 stuck for 67s! [java:1566]
Modules linked in: ipt_REJECT nf_conntrack_ipv4 nf_defrag_ipv4 iptable_filter ip_tables ip6t_REJECT nf_conntrack_ipv6 nf_defrag_ipv6 xt_state nf_conntrack ip6table_filter ip6_tables ipv6 dm_mod i2c_piix4 i2c_core virtio_balloon e1000 snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device snd_pcm snd_timer snd soundcore snd_page_alloc ext4 mbcache jbd2 virtio_blk virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix [last unloaded: scsi_wait_scan]
CPU 0 
Modules linked in: ipt_REJECT nf_conntrack_ipv4 nf_defrag_ipv4 iptable_filter ip_tables ip6t_REJECT nf_conntrack_ipv6 nf_defrag_ipv6 xt_state nf_conntrack ip6table_filter ip6_tables ipv6 dm_mod i2c_piix4 i2c_core virtio_balloon e1000 snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device snd_pcm snd_timer snd soundcore snd_page_alloc ext4 mbcache jbd2 virtio_blk virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix [last unloaded: scsi_wait_scan]

Pid: 1566, comm: java Not tainted 2.6.32-279.el6.x86_64 #1 Bochs Bochs
RIP: 0010:[<ffffffff81500126>]  [<ffffffff81500126>] _spin_lock+0x26/0x30
RSP: 0018:ffff880037763918  EFLAGS: 00000206
RAX: 0000000000000001 RBX: ffff880037763918 RCX: 0000000000000001
RDX: 0000000000000000 RSI: ffff8800bc9d55c0 RDI: ffff8800b8f10998
RBP: ffffffff8100bc0e R08: 0000000000000000 R09: 0000000000000001
R10: 00000000000134a0 R11: 0000000000000000 R12: ffffea0000e41ae0
R13: 80000000412c4067 R14: ffffffff811497a4 R15: ffff880037763908
FS:  00007fb9a492f700(0000) GS:ffff880002200000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 00000000ffd24ea8 CR3: 0000000037154000 CR4: 00000000000006f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process java (pid: 1566, threadinfo ffff880037762000, task ffff8800373faae0)
Stack:
 ffff880037763948 ffffffff811480e3 0000000000000000 ffff8800b8f10998
<d> ffffea0000e41aa8 0000000000000000 ffff8800377639e8 ffffffff811682a8
<d> ffff880000000001 ffff8800b8f10998 0000000000000000 0000880000000000
Call Trace:
 [<ffffffff811480e3>] ? page_lock_anon_vma+0x53/0x70
 [<ffffffff811682a8>] ? migrate_pages+0x3a8/0x4b0
 [<ffffffff8115d8f0>] ? compaction_alloc+0x0/0x3e0
 [<ffffffff8115e1e7>] ? compact_zone+0x517/0x820
 [<ffffffff8115e771>] ? compact_zone_order+0xa1/0xe0
 [<ffffffff810097cc>] ? __switch_to+0x1ac/0x320
 [<ffffffff8115e8cc>] ? try_to_compact_pages+0x11c/0x190
 [<ffffffff81127415>] ? __alloc_pages_nodemask+0x5f5/0x940
 [<ffffffff81039678>] ? pvclock_clocksource_read+0x58/0xd0
 [<ffffffff8115c2da>] ? alloc_pages_vma+0x9a/0x150
 [<ffffffff81176635>] ? do_huge_pmd_anonymous_page+0x145/0x380
 [<ffffffff8113fe7a>] ? handle_mm_fault+0x25a/0x2b0
 [<ffffffff810a68c2>] ? do_futex+0x682/0xb00
 [<ffffffff81044479>] ? __do_page_fault+0x139/0x480
 [<ffffffff81039678>] ? pvclock_clocksource_read+0x58/0xd0
 [<ffffffff81039678>] ? pvclock_clocksource_read+0x58/0xd0
 [<ffffffff8150326e>] ? do_page_fault+0x3e/0xa0
 [<ffffffff81500625>] ? page_fault+0x25/0x30
Code: e3 d7 ff c9 c3 55 48 89 e5 0f 1f 44 00 00 b8 00 00 01 00 f0 0f c1 07 0f b7 d0 c1 e8 10 39 c2 74 0e f3 90 0f 1f 44 00 00 83 3f 00 <75> f4 eb df c9 c3 0f 1f 40 00 55 48 89 e5 0f 1f 44 00 00 f0 81 
Call Trace:
 [<ffffffff811480e3>] ? page_lock_anon_vma+0x53/0x70
 [<ffffffff811682a8>] ? migrate_pages+0x3a8/0x4b0
 [<ffffffff8115d8f0>] ? compaction_alloc+0x0/0x3e0
 [<ffffffff8115e1e7>] ? compact_zone+0x517/0x820
 [<ffffffff8115e771>] ? compact_zone_order+0xa1/0xe0
 [<ffffffff810097cc>] ? __switch_to+0x1ac/0x320
 [<ffffffff8115e8cc>] ? try_to_compact_pages+0x11c/0x190
 [<ffffffff81127415>] ? __alloc_pages_nodemask+0x5f5/0x940
 [<ffffffff81039678>] ? pvclock_clocksource_read+0x58/0xd0
 [<ffffffff8115c2da>] ? alloc_pages_vma+0x9a/0x150
 [<ffffffff81176635>] ? do_huge_pmd_anonymous_page+0x145/0x380
 [<ffffffff8113fe7a>] ? handle_mm_fault+0x25a/0x2b0
 [<ffffffff810a68c2>] ? do_futex+0x682/0xb00
 [<ffffffff81044479>] ? __do_page_fault+0x139/0x480
 [<ffffffff81039678>] ? pvclock_clocksource_read+0x58/0xd0
 [<ffffffff81039678>] ? pvclock_clocksource_read+0x58/0xd0
 [<ffffffff8150326e>] ? do_page_fault+0x3e/0xa0
 [<ffffffff81500625>] ? page_fault+0x25/0x30
BUG: soft lockup - CPU#5 stuck for 67s! [java:1570]
Modules linked in: ipt_REJECT nf_conntrack_ipv4 nf_defrag_ipv4 iptable_filter ip_tables ip6t_REJECT nf_conntrack_ipv6 nf_defrag_ipv6 xt_state nf_conntrack ip6table_filter ip6_tables ipv6 dm_mod i2c_piix4 i2c_core virtio_balloon e1000 snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device snd_pcm snd_timer snd soundcore snd_page_alloc ext4 mbcache jbd2 virtio_blk virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix [last unloaded: scsi_wait_scan]
CPU 5 
Modules linked in: ipt_REJECT nf_conntrack_ipv4 nf_defrag_ipv4 iptable_filter ip_tables ip6t_REJECT nf_conntrack_ipv6 nf_defrag_ipv6 xt_state nf_conntrack ip6table_filter ip6_tables ipv6 dm_mod i2c_piix4 i2c_core virtio_balloon e1000 snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device snd_pcm snd_timer snd soundcore snd_page_alloc ext4 mbcache jbd2 virtio_blk virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix [last unloaded: scsi_wait_scan]

Pid: 1570, comm: java Not tainted 2.6.32-279.el6.x86_64 #1 Bochs Bochs
RIP: 0010:[<ffffffff81500126>]  [<ffffffff81500126>] _spin_lock+0x26/0x30
RSP: 0018:ffff88003703b918  EFLAGS: 00000206
RAX: 0000000000000001 RBX: ffff88003703b918 RCX: 0000000000000001
RDX: 0000000000000000 RSI: ffff8800bc9b2980 RDI: ffff8800b8f10998
RBP: ffffffff8100bc0e R08: 0000000000000000 R09: 0000000000000001
R10: 00000000000134a0 R11: 0000000000000000 R12: ffffea0000e98ac8
R13: 8000000042b9f067 R14: ffffffff811497a4 R15: ffff88003703b908
FS:  00007fb9a452b700(0000) GS:ffff880002340000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 00007fb9aeb73580 CR3: 0000000037154000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process java (pid: 1570, threadinfo ffff88003703a000, task ffff8800b8480040)
Stack:
 ffff88003703b948 ffffffff811480e3 0000000000000000 ffff8800b8f10998
<d> ffffea0000e98a90 0000000000000000 ffff88003703b9e8 ffffffff811682a8
<d> ffff880000000001 ffff8800b8f10998 0000000000000000 0000880000000000
Call Trace:
 [<ffffffff811480e3>] ? page_lock_anon_vma+0x53/0x70
 [<ffffffff811682a8>] ? migrate_pages+0x3a8/0x4b0
 [<ffffffff8115d8f0>] ? compaction_alloc+0x0/0x3e0
 [<ffffffff8115e1e7>] ? compact_zone+0x517/0x820
 [<ffffffff8115e771>] ? compact_zone_order+0xa1/0xe0
 [<ffffffff8115e8cc>] ? try_to_compact_pages+0x11c/0x190
 [<ffffffff81127415>] ? __alloc_pages_nodemask+0x5f5/0x940
 [<ffffffff81039678>] ? pvclock_clocksource_read+0x58/0xd0
 [<ffffffff8115c2da>] ? alloc_pages_vma+0x9a/0x150
 [<ffffffff81176635>] ? do_huge_pmd_anonymous_page+0x145/0x380
 [<ffffffff8113fe7a>] ? handle_mm_fault+0x25a/0x2b0
 [<ffffffff810a68c2>] ? do_futex+0x682/0xb00
 [<ffffffff81044479>] ? __do_page_fault+0x139/0x480
 [<ffffffff814fd830>] ? thread_return+0x4e/0x76e
 [<ffffffff8150326e>] ? do_page_fault+0x3e/0xa0
 [<ffffffff81500625>] ? page_fault+0x25/0x30
Code: e3 d7 ff c9 c3 55 48 89 e5 0f 1f 44 00 00 b8 00 00 01 00 f0 0f c1 07 0f b7 d0 c1 e8 10 39 c2 74 0e f3 90 0f 1f 44 00 00 83 3f 00 <75> f4 eb df c9 c3 0f 1f 40 00 55 48 89 e5 0f 1f 44 00 00 f0 81 
Call Trace:
 [<ffffffff811480e3>] ? page_lock_anon_vma+0x53/0x70
 [<ffffffff811682a8>] ? migrate_pages+0x3a8/0x4b0
 [<ffffffff8115d8f0>] ? compaction_alloc+0x0/0x3e0
 [<ffffffff8115e1e7>] ? compact_zone+0x517/0x820
 [<ffffffff8115e771>] ? compact_zone_order+0xa1/0xe0
 [<ffffffff8115e8cc>] ? try_to_compact_pages+0x11c/0x190
 [<ffffffff81127415>] ? __alloc_pages_nodemask+0x5f5/0x940
 [<ffffffff81039678>] ? pvclock_clocksource_read+0x58/0xd0
 [<ffffffff8115c2da>] ? alloc_pages_vma+0x9a/0x150
 [<ffffffff81176635>] ? do_huge_pmd_anonymous_page+0x145/0x380
 [<ffffffff8113fe7a>] ? handle_mm_fault+0x25a/0x2b0
 [<ffffffff810a68c2>] ? do_futex+0x682/0xb00
 [<ffffffff81044479>] ? __do_page_fault+0x139/0x480
 [<ffffffff814fd830>] ? thread_return+0x4e/0x76e
 [<ffffffff8150326e>] ? do_page_fault+0x3e/0xa0
 [<ffffffff81500625>] ? page_fault+0x25/0x30
BUG: soft lockup - CPU#1 stuck for 62s! [java:1568]
Modules linked in: ipt_REJECT nf_conntrack_ipv4 nf_defrag_ipv4 iptable_filter ip_tables ip6t_REJECT nf_conntrack_ipv6 nf_defrag_ipv6 xt_state nf_conntrack ip6table_filter ip6_tables ipv6 dm_mod i2c_piix4 i2c_core virtio_balloon e1000 snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device snd_pcm snd_timer snd soundcore snd_page_alloc ext4 mbcache jbd2 virtio_blk virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix [last unloaded: scsi_wait_scan]
CPU 1 
Modules linked in: ipt_REJECT nf_conntrack_ipv4 nf_defrag_ipv4 iptable_filter ip_tables ip6t_REJECT nf_conntrack_ipv6 nf_defrag_ipv6 xt_state nf_conntrack ip6table_filter ip6_tables ipv6 dm_mod i2c_piix4 i2c_core virtio_balloon e1000 snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device snd_pcm snd_timer snd soundcore snd_page_alloc ext4 mbcache jbd2 virtio_blk virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix [last unloaded: scsi_wait_scan]

Pid: 1568, comm: java Not tainted 2.6.32-279.el6.x86_64 #1 Bochs Bochs
RIP: 0010:[<ffffffff81500126>]  [<ffffffff81500126>] _spin_lock+0x26/0x30
RSP: 0018:ffff8800371a5918  EFLAGS: 00000206
RAX: 0000000000000001 RBX: ffff8800371a5918 RCX: 0000000000000001
RDX: 0000000000000000 RSI: ffff8800bc9f3980 RDI: ffff8800b8f10998
RBP: ffffffff8100bc0e R08: 0000000000000000 R09: 0000000000000001
R10: 00000000000134a0 R11: 0000000000000000 R12: ffffea0000e3ef58
R13: 80000000411fd067 R14: ffffffff811497a4 R15: ffff8800371a5908
FS:  00007fb9a472d700(0000) GS:ffff880002240000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 0000000001488020 CR3: 0000000037154000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process java (pid: 1568, threadinfo ffff8800371a4000, task ffff8800b8481500)
Stack:
 ffff8800371a5948 ffffffff811480e3 0000000000000000 ffff8800b8f10998
<d> ffffea0000e3ef20 0000000000000000 ffff8800371a59e8 ffffffff811682a8
<d> ffff880000000001 ffff8800b8f10998 0000000000000000 0000880000000000
Call Trace:
 [<ffffffff811480e3>] ? page_lock_anon_vma+0x53/0x70
 [<ffffffff811682a8>] ? migrate_pages+0x3a8/0x4b0
 [<ffffffff8115d8f0>] ? compaction_alloc+0x0/0x3e0
 [<ffffffff8115e1e7>] ? compact_zone+0x517/0x820
 [<ffffffff810629d3>] ? dequeue_entity+0x113/0x2e0
 [<ffffffff8115e771>] ? compact_zone_order+0xa1/0xe0
 [<ffffffff810097cc>] ? __switch_to+0x1ac/0x320
 [<ffffffff8115e8cc>] ? try_to_compact_pages+0x11c/0x190
 [<ffffffff81127415>] ? __alloc_pages_nodemask+0x5f5/0x940
 [<ffffffff81039678>] ? pvclock_clocksource_read+0x58/0xd0
 [<ffffffff8106335b>] ? enqueue_task_fair+0xfb/0x100
 [<ffffffff8115c2da>] ? alloc_pages_vma+0x9a/0x150
 [<ffffffff81176635>] ? do_huge_pmd_anonymous_page+0x145/0x380
 [<ffffffff8113fe7a>] ? handle_mm_fault+0x25a/0x2b0
 [<ffffffff810a68c2>] ? do_futex+0x682/0xb00
 [<ffffffff81044479>] ? __do_page_fault+0x139/0x480
 [<ffffffff81039678>] ? pvclock_clocksource_read+0x58/0xd0
 [<ffffffff81039678>] ? pvclock_clocksource_read+0x58/0xd0
 [<ffffffff8150326e>] ? do_page_fault+0x3e/0xa0
 [<ffffffff81500625>] ? page_fault+0x25/0x30
Code: e3 d7 ff c9 c3 55 48 89 e5 0f 1f 44 00 00 b8 00 00 01 00 f0 0f c1 07 0f b7 d0 c1 e8 10 39 c2 74 0e f3 90 0f 1f 44 00 00 83 3f 00 <75> f4 eb df c9 c3 0f 1f 40 00 55 48 89 e5 0f 1f 44 00 00 f0 81 
Call Trace:
 [<ffffffff811480e3>] ? page_lock_anon_vma+0x53/0x70
 [<ffffffff811682a8>] ? migrate_pages+0x3a8/0x4b0
 [<ffffffff8115d8f0>] ? compaction_alloc+0x0/0x3e0
 [<ffffffff8115e1e7>] ? compact_zone+0x517/0x820
 [<ffffffff810629d3>] ? dequeue_entity+0x113/0x2e0
 [<ffffffff8115e771>] ? compact_zone_order+0xa1/0xe0
 [<ffffffff810097cc>] ? __switch_to+0x1ac/0x320
 [<ffffffff8115e8cc>] ? try_to_compact_pages+0x11c/0x190
 [<ffffffff81127415>] ? __alloc_pages_nodemask+0x5f5/0x940
 [<ffffffff81039678>] ? pvclock_clocksource_read+0x58/0xd0
 [<ffffffff8106335b>] ? enqueue_task_fair+0xfb/0x100
 [<ffffffff8115c2da>] ? alloc_pages_vma+0x9a/0x150
 [<ffffffff81176635>] ? do_huge_pmd_anonymous_page+0x145/0x380
 [<ffffffff8113fe7a>] ? handle_mm_fault+0x25a/0x2b0
 [<ffffffff810a68c2>] ? do_futex+0x682/0xb00
 [<ffffffff81044479>] ? __do_page_fault+0x139/0x480
 [<ffffffff81039678>] ? pvclock_clocksource_read+0x58/0xd0
 [<ffffffff81039678>] ? pvclock_clocksource_read+0x58/0xd0
 [<ffffffff8150326e>] ? do_page_fault+0x3e/0xa0
 [<ffffffff81500625>] ? page_fault+0x25/0x30
BUG: soft lockup - CPU#2 stuck for 62s! [java:1565]
Modules linked in: ipt_REJECT nf_conntrack_ipv4 nf_defrag_ipv4 iptable_filter ip_tables ip6t_REJECT nf_conntrack_ipv6 nf_defrag_ipv6 xt_state nf_conntrack ip6table_filter ip6_tables ipv6 dm_mod i2c_piix4 i2c_core virtio_balloon e1000 snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device snd_pcm snd_timer snd soundcore snd_page_alloc ext4 mbcache jbd2 virtio_blk virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix [last unloaded: scsi_wait_scan]
CPU 2 
Modules linked in: ipt_REJECT nf_conntrack_ipv4 nf_defrag_ipv4 iptable_filter ip_tables ip6t_REJECT nf_conntrack_ipv6 nf_defrag_ipv6 xt_state nf_conntrack ip6table_filter ip6_tables ipv6 dm_mod i2c_piix4 i2c_core virtio_balloon e1000 snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device snd_pcm snd_timer snd soundcore snd_page_alloc ext4 mbcache jbd2 virtio_blk virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix [last unloaded: scsi_wait_scan]

Pid: 1565, comm: java Not tainted 2.6.32-279.el6.x86_64 #1 Bochs Bochs
RIP: 0010:[<ffffffff8150011e>]  [<ffffffff8150011e>] _spin_lock+0x1e/0x30
RSP: 0018:ffff880037059898  EFLAGS: 00000206
RAX: 0000000000000001 RBX: ffff880037059898 RCX: 0000000000000001
RDX: 0000000000000000 RSI: 0000000000000301 RDI: ffff8800b8f10998
RBP: ffffffff8100bc0e R08: ffff8800b8f10998 R09: 0000000000000001
R10: 00000000000134a0 R11: 0000000000000000 R12: 0000000000000000
R13: ffff8800b934cb18 R14: ffffffff810097cc R15: ffff880037059848
FS:  00007fb9a4a30700(0000) GS:ffff880002280000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 00000000ffe141d8 CR3: 0000000037154000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process java (pid: 1565, threadinfo ffff880037058000, task ffff8800372eaaa0)
Stack:
 ffff8800370598c8 ffffffff811480e3 000000000000fb88 ffff8800b8f10998
<d> ffffea0000ee8618 ffffea0000ee8618 ffff880037059928 ffffffff81149871
<d> ffffc9000060b000 ffffea0000ee8618 ffffea0000ee85e0 ffff8800370599a8
Call Trace:
 [<ffffffff811480e3>] ? page_lock_anon_vma+0x53/0x70
 [<ffffffff81149871>] ? try_to_unmap_anon+0x21/0x140
 [<ffffffff8114a1e5>] ? try_to_unmap+0x55/0x70
 [<ffffffff81168139>] ? migrate_pages+0x239/0x4b0
 [<ffffffff8115d8f0>] ? compaction_alloc+0x0/0x3e0
 [<ffffffff8115e1e7>] ? compact_zone+0x517/0x820
 [<ffffffff8115e771>] ? compact_zone_order+0xa1/0xe0
 [<ffffffff810097cc>] ? __switch_to+0x1ac/0x320
 [<ffffffff8115e8cc>] ? try_to_compact_pages+0x11c/0x190
 [<ffffffff81127415>] ? __alloc_pages_nodemask+0x5f5/0x940
 [<ffffffff810a3b49>] ? futex_wait_queue_me+0xb9/0xf0
 [<ffffffff8115c2da>] ? alloc_pages_vma+0x9a/0x150
 [<ffffffff81176635>] ? do_huge_pmd_anonymous_page+0x145/0x380
 [<ffffffff8113fe7a>] ? handle_mm_fault+0x25a/0x2b0
 [<ffffffff810a6340>] ? do_futex+0x100/0xb00
 [<ffffffff81044479>] ? __do_page_fault+0x139/0x480
 [<ffffffff814fd830>] ? thread_return+0x4e/0x76e
 [<ffffffff8150326e>] ? do_page_fault+0x3e/0xa0
 [<ffffffff81500625>] ? page_fault+0x25/0x30
Code: 00 00 00 01 74 05 e8 e2 e3 d7 ff c9 c3 55 48 89 e5 0f 1f 44 00 00 b8 00 00 01 00 f0 0f c1 07 0f b7 d0 c1 e8 10 39 c2 74 0e f3 90 <0f> 1f 44 00 00 83 3f 00 75 f4 eb df c9 c3 0f 1f 40 00 55 48 89 
Call Trace:
 [<ffffffff811480e3>] ? page_lock_anon_vma+0x53/0x70
 [<ffffffff81149871>] ? try_to_unmap_anon+0x21/0x140
 [<ffffffff8114a1e5>] ? try_to_unmap+0x55/0x70
 [<ffffffff81168139>] ? migrate_pages+0x239/0x4b0
 [<ffffffff8115d8f0>] ? compaction_alloc+0x0/0x3e0
 [<ffffffff8115e1e7>] ? compact_zone+0x517/0x820
 [<ffffffff8115e771>] ? compact_zone_order+0xa1/0xe0
 [<ffffffff810097cc>] ? __switch_to+0x1ac/0x320
 [<ffffffff8115e8cc>] ? try_to_compact_pages+0x11c/0x190
 [<ffffffff81127415>] ? __alloc_pages_nodemask+0x5f5/0x940
 [<ffffffff810a3b49>] ? futex_wait_queue_me+0xb9/0xf0
 [<ffffffff8115c2da>] ? alloc_pages_vma+0x9a/0x150
 [<ffffffff81176635>] ? do_huge_pmd_anonymous_page+0x145/0x380
 [<ffffffff8113fe7a>] ? handle_mm_fault+0x25a/0x2b0
 [<ffffffff810a6340>] ? do_futex+0x100/0xb00
 [<ffffffff81044479>] ? __do_page_fault+0x139/0x480
 [<ffffffff814fd830>] ? thread_return+0x4e/0x76e
 [<ffffffff8150326e>] ? do_page_fault+0x3e/0xa0
 [<ffffffff81500625>] ? page_fault+0x25/0x30
epmd invoked oom-killer: gfp_mask=0x201da, order=0, oom_adj=0, oom_score_adj=0
epmd cpuset=/ mems_allowed=0
Pid: 1321, comm: epmd Not tainted 2.6.32-279.el6.x86_64 #1
Call Trace:
 [<ffffffff810c4971>] ? cpuset_print_task_mems_allowed+0x91/0xb0
 [<ffffffff811170e0>] ? dump_header+0x90/0x1b0
 [<ffffffff812146fc>] ? security_real_capable_noaudit+0x3c/0x70
 [<ffffffff81117562>] ? oom_kill_process+0x82/0x2a0
 [<ffffffff811174a1>] ? select_bad_process+0xe1/0x120
 [<ffffffff811179a0>] ? out_of_memory+0x220/0x3c0
 [<ffffffff811276be>] ? __alloc_pages_nodemask+0x89e/0x940
 [<ffffffff8115c1da>] ? alloc_pages_current+0xaa/0x110
 [<ffffffff811144e7>] ? __page_cache_alloc+0x87/0x90
 [<ffffffff8118fec0>] ? pollwake+0x0/0x60
 [<ffffffff8112a10b>] ? __do_page_cache_readahead+0xdb/0x210
 [<ffffffff8112a261>] ? ra_submit+0x21/0x30
 [<ffffffff81115813>] ? filemap_fault+0x4c3/0x500
 [<ffffffff8113ec14>] ? __do_fault+0x54/0x510
 [<ffffffff81127b2f>] ? free_hot_page+0x2f/0x60
 [<ffffffff8113f1c7>] ? handle_pte_fault+0xf7/0xb50
 [<ffffffff81010ba0>] ? copy_user_generic+0x0/0x20
 [<ffffffff81010bae>] ? copy_user_generic+0xe/0x20
 [<ffffffff8118fbe9>] ? set_fd_set+0x49/0x60
 [<ffffffff811910bc>] ? core_sys_select+0x1ec/0x2c0
 [<ffffffff8113fe04>] ? handle_mm_fault+0x1e4/0x2b0
 [<ffffffff81044479>] ? __do_page_fault+0x139/0x480
 [<ffffffff8103876c>] ? kvm_clock_read+0x1c/0x20
 [<ffffffff81038779>] ? kvm_clock_get_cycles+0x9/0x10
 [<ffffffff8109cd39>] ? ktime_get_ts+0xa9/0xe0
 [<ffffffff8118fb18>] ? poll_select_copy_remaining+0xf8/0x150
 [<ffffffff8150326e>] ? do_page_fault+0x3e/0xa0
 [<ffffffff81500625>] ? page_fault+0x25/0x30
Mem-Info:
Node 0 DMA per-cpu:
CPU    0: hi:    0, btch:   1 usd:   0
CPU    1: hi:    0, btch:   1 usd:   0
CPU    2: hi:    0, btch:   1 usd:   0
CPU    3: hi:    0, btch:   1 usd:   0
CPU    4: hi:    0, btch:   1 usd:   0
CPU    5: hi:    0, btch:   1 usd:   0
Node 0 DMA32 per-cpu:
CPU    0: hi:  186, btch:  31 usd:  33
CPU    1: hi:  186, btch:  31 usd:   8
CPU    2: hi:  186, btch:  31 usd: 150
CPU    3: hi:  186, btch:  31 usd:  28
CPU    4: hi:  186, btch:  31 usd:  31
CPU    5: hi:  186, btch:  31 usd:  50
active_anon:539479 inactive_anon:144774 isolated_anon:0
 active_file:5 inactive_file:209 isolated_file:32
 unevictable:0 dirty:3 writeback:0 unstable:0
 free:14247 slab_reclaimable:2867 slab_unreclaimable:13627
 mapped:8838 shmem:8817 pagetables:5063 bounce:0
Node 0 DMA free:12184kB min:224kB low:280kB high:336kB active_anon:1248kB inactive_anon:2048kB active_file:20kB inactive_file:0kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:15344kB mlocked:0kB dirty:0kB writeback:0kB mapped:28kB shmem:0kB slab_reclaimable:12kB slab_unreclaimable:4kB kernel_stack:0kB pagetables:28kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
lowmem_reserve[]: 0 2990 2990 2990
Node 0 DMA32 free:44804kB min:44828kB low:56032kB high:67240kB active_anon:2156668kB inactive_anon:577048kB active_file:0kB inactive_file:836kB unevictable:0kB isolated(anon):0kB isolated(file):128kB present:3062308kB mlocked:0kB dirty:12kB writeback:0kB mapped:35324kB shmem:35268kB slab_reclaimable:11456kB slab_unreclaimable:54504kB kernel_stack:2104kB pagetables:20224kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:128 all_unreclaimable? yes
lowmem_reserve[]: 0 0 0 0
Node 0 DMA: 4*4kB 4*8kB 3*16kB 2*32kB 4*64kB 0*128kB 2*256kB 2*512kB 2*1024kB 2*2048kB 1*4096kB = 12192kB
Node 0 DMA32: 1303*4kB 771*8kB 421*16kB 228*32kB 113*64kB 43*128kB 16*256kB 3*512kB 1*1024kB 0*2048kB 0*4096kB = 44804kB
9131 total pagecache pages
0 pages in swap cache
Swap cache stats: add 0, delete 0, find 0/0
Free swap  = 0kB
Total swap = 0kB
780284 pages RAM
47313 pages reserved
68602 pages shared
705483 pages non-shared
[ pid ]   uid  tgid total_vm      rss cpu oom_adj oom_score_adj name
[  454]     0   454     2832      272   0     -17         -1000 udevd
[ 1064]     0  1064     2278      125   5       0             0 dhclient
[ 1108]     0  1108     6908       68   2     -17         -1000 auditd
[ 1133]     0  1133    62270      145   5       0             0 rsyslogd
[ 1145]    81  1145     7910       74   4       0             0 dbus-daemon
[ 1157]     0  1157    47287      225   1       0             0 cupsd
[ 1190]     0  1190    16016      168   3     -17         -1000 sshd
[ 1217]    26  1217    53409      793   0     -17         -1000 postmaster
[ 1296]     0  1296    19667      216   0       0             0 master
[ 1301]    89  1301    19730      212   1       0             0 qmgr
[ 1321]   498  1321     2705       38   5       0             0 epmd
[ 1334]     0  1334    27039       50   0       0             0 sh
[ 1336]     0  1336    27039       52   2       0             0 rabbitmq-server
[ 1343]     0  1343    36335       96   0       0             0 su
[ 1347]   498  1347   231690     7677   0       0             0 beam.smp
[ 1400]    26  1400    44162      260   1       0             0 postmaster
[ 1478]   498  1478     1012       21   4       0             0 cpu_sup
[ 1483]    26  1483    53442     8661   4       0             0 postmaster
[ 1484]    26  1484    53409      292   0       0             0 postmaster
[ 1485]    26  1485    53469      303   1       0             0 postmaster
[ 1486]    26  1486    44194      285   5       0             0 postmaster
[ 1487]   498  1487     2696       28   3       0             0 inet_gethost
[ 1488]   498  1488     4278       44   0       0             0 inet_gethost
[ 1555]    91  1555   973792    94613   3       0             0 java
[ 1574]     0  1574    45903      533   1       0             0 httpd
[ 1580]    48  1580   202983   120866   3       0             0 httpd
[ 1581]    48  1581    97567    17050   3       0             0 httpd
[ 1582]    48  1582   116178    33462   1       0             0 httpd
[ 1583]    48  1583    93866    13378   4       0             0 httpd
[ 1584]    48  1584   170141    89649   3       0             0 httpd
[ 1585]    48  1585    93866    13378   5       0             0 httpd
[ 1586]    48  1586   116177    33462   3       0             0 httpd
[ 1587]    48  1587   164341    83616   5       0             0 httpd
[ 1590]     0  1590    29301      157   4       0             0 crond
[ 1606]     0  1606     5362       46   3       0             0 atd
[ 1614]     0  1614    48903     2182   3       0             0 supervisord
[ 1617]   500  1617    83751    12720   4       0             0 paster
[ 1618]   500  1618   165335    93624   0       0             0 paster
[ 1631]     0  1631     1014       24   3       0             0 mingetty
[ 1633]     0  1633     1014       24   5       0             0 mingetty
[ 1635]     0  1635     3096      507   1     -17         -1000 udevd
[ 1636]     0  1636     3096      507   3     -17         -1000 udevd
[ 1637]     0  1637     1014       24   0       0             0 mingetty
[ 1639]     0  1639     1014       24   0       0             0 mingetty
[ 1641]     0  1641     1014       23   4       0             0 mingetty
[ 1662]    26  1662    55455    10540   4       0             0 postmaster
[ 1663]    26  1663    53918     1106   1       0             0 postmaster
[ 1675]    26  1675    54443     9639   2       0             0 postmaster
[ 1679]    48  1679    93866    13378   3       0             0 httpd
[ 1698]    48  1698    93865    13378   2       0             0 httpd
[ 1699]    48  1699    97567    17050   2       0             0 httpd
[ 1708]    26  1708    53931     1063   5       0             0 postmaster
[ 1709]    26  1709    56860    10570   2       0             0 postmaster
[ 1710]    26  1710    56867    10050   1       0             0 postmaster
[ 1711]    26  1711    53930     1023   3       0             0 postmaster
[ 1712]    26  1712    54438     9048   3       0             0 postmaster
[ 1729]    26  1729    56149    10427   5       0             0 postmaster
[ 1730]    26  1730    53930     1024   0       0             0 postmaster
[ 1731]    26  1731    54006     1756   3       0             0 postmaster
[ 1732]    26  1732    54005     1692   2       0             0 postmaster
[ 1733]    26  1733    53930     1025   5       0             0 postmaster
[ 3310]    89  3310    19687      210   0       0             0 pickup
[ 3492]     0  3492     1014       23   4       0             0 mingetty
[ 3493]     0  3493    24453      241   1       0             0 sshd
[ 3497]     0  3497    27074       94   0       0             0 bash
[ 3515]     0  3515    27255      468   4       0             0 watch
[ 3522]     0  3522     2272       24   1       0             0 sh
Out of memory: Kill process 1580 (httpd) score 165 or sacrifice child
Killed process 1580, UID 48, (httpd) total-vm:811932kB, anon-rss:483420kB, file-rss:44kB

2014-01-08 17:44:44,141 DEBUG [ckanext.harvest.queue] Received harvest job id: a5449e60-0996-49c3-a305-8b4034647cc6
2014-01-08 17:44:44,142 DEBUG [ckanext.harvest.queue] pika connection using {'retry_delay': 2.0, 'frame_max': 10000, 'channel_max': 0, 'locale': 'en_US', 'socket_timeout': 0.25, 'ssl': False, 'host': 'localhost', 'ssl_options': {}, 'virtual_host': '/', 'heartbeat': 0, 'credentials': <pika.credentials.PlainCredentials object at 0x3c33b10>, 'backpressure_detection': False, 'port': 5672, 'connection_attempts': 1}
2014-01-08 17:44:44,695 DEBUG [ckanext.harvest.harvesters.ckanharvester] In CKANHarvester gather_stage ({CKAN_WEBSITE_I_WANT_TO_HARVEST_FROM})
2014-01-08 17:44:44,695 DEBUG [ckanext.harvest.harvesters.ckanharvester] Using config: {u'read_only': True, u'default_tags': [u'POPULATION', u'ACS'], u'remote_groups': u'only_local', u'default_groups': [u'testgroup'], u'user': u'harvest', u'api_key': u'3a3c9e64-45f1-40d5-ab04-c9ddc8157885', u'override_extras': True, u'api_version': 2}
2014-01-08 17:44:44,746 ERROR [ckanext.harvest.harvesters.base] Unable to get content for URL: http://{CKAN_WEBSITE_I_WANT_TO_HARVEST_FROM}/api/2/rest/package: HTTP Error 403: Forbidden
2014-01-08 17:44:44,750 ERROR [ckanext.harvest.queue] Gather stage failed

{
   "read_only":true,
   "default_tags":[
      "POPULATION",
      "ACS"
   ],
   "remote_groups":"only_local",
   "default_groups":[
      "testgroup"
   ],
   "user":"harvest",
   "api_key":"3a3c9e64-45f1-40d5-ab04-c9ddc8157885",
   "override_extras":true,
   "api_version":2
}
File '/usr/lib/ckan/src/ckan/ckan/controllers/package.py', line 362 in read
  return render(template, loader_class=loader)
...
TemplateNotFound: Template "source/read.rdf" not found

ckan / ckanext-harvest Goto Github PK

ckanext-harvest's Introduction

ckanext-harvest - Remote harvesting extension

Installation

Configuration

Database logger configuration(optional)

Dataset name generation configuration (optional)

Send error mails when harvesting fails (optional)

Set a timeout for a harvest job (optional)

Avoid overwriting certain fields (optional)

Command line interface

Authorization

The CKAN harvester

The harvesting interface

Running the harvest jobs

harvester run-test

harvester run

Setting up the harvesters on a production server

Extensible actions

Recipients on harvest jobs notifications

Tests

Releases

Community

Contributing

License

ckanext-harvest's People

Contributors

Stargazers

Watchers

Forkers

ckanext-harvest's Issues

Recommend Projects

Recommend Topics

Recommend Org