[med-svn] r7377 - in trunk/community/infrastructure/getData: . debian

Steffen Möller moeller at alioth.debian.org
Wed Aug 3 14:07:46 UTC 2011


Author: moeller
Date: 2011-08-03 14:07:46 +0000 (Wed, 03 Aug 2011)
New Revision: 7377

Modified:
   trunk/community/infrastructure/getData/README
   trunk/community/infrastructure/getData/debian/changelog
   trunk/community/infrastructure/getData/debian/control
   trunk/community/infrastructure/getData/debian/rules
   trunk/community/infrastructure/getData/getData
Log:
Series of updates preparing for an upload.


Modified: trunk/community/infrastructure/getData/README
===================================================================
--- trunk/community/infrastructure/getData/README	2011-08-03 10:11:12 UTC (rev 7376)
+++ trunk/community/infrastructure/getData/README	2011-08-03 14:07:46 UTC (rev 7377)
@@ -1,12 +1,30 @@
 getData
 =======
 
-While the community is discussing the possibility to prepare large
-Debian packages for the distribution of biological databases, this
-package instead sets out to prepare such distributions automatically -
-not the Debian package, but the provisioning of local updates and first
-installations of the data.
+Computational biology as a science works without data. Well, but one
+cannot prove anything then or just learn from appling the computer to
+some new data, then.  There are many reasons why Debian cares about
+bioinformatics, but alone for a contribution to the education of our
+students at all levels - something we cannot talk enough about, there
+needs to be an easy straight forward way to get access to data. For the
+professionals then around us, we also need those extra features like
+automated updates and whatever else comes to mind to maintain those data
+files in our daily routine.
 
+There are now various approaches to it. Many say that there should be data
+packages available which are tarballs or some other format of that data to
+be installed. This is very reasonable indeed. But alone for the concept
+of releases that comes with our and basically every other Linux distro,
+this concept is doomed. Nobody wants to use old biological databases
+when a new version is available basically at the very same costs.
+
+This package instead sets out to prepare the installation and maintenance
+of databases automatically - directly from the data creator's websites
+and redistributed as a Debian package.  There may still be packages
+with the name of a particular database, like 'uniprot', but those should
+then merely provide the instructions to this new 'getData' tool on how
+to deal with the database for download, indexing and updating.
+
 The download of the databases listed in getData should all
 remain functional. The major challenge is the integration with the
 post-processing of the data. In this respect well performing should
@@ -16,11 +34,23 @@
 that have the EMBOSS tool kit installed, also the respective indexing
 is performed.
 
-Once the package has hit unstable, we shall have the configuration
-files with database-specific tools or database install packages.
-Please help with your direct contributions and/or feedback.
+Once the package has hit unstable, we shall have the configuration files
+with database-specific tools or database install packages.  Please help
+with your direct contributions and/or feedback.
 
+'getData' can be successful only when there is a strong communication
+among ourselves concerning new tools on the horizon that should possibly
+be added. And when there is a new URL for a particular pathway, then
+this should be updated in some community effort.  So, please, should at
+us by saying "reportbug getdata" whenever there is something to report.
+While the development of this tools was indeed seeded by the Debian Med
+community, there is some strong hope that folks at Fedora and OpenSuSE
+would adopt this package from us. We'll see.
 
+Many thanks!!!
+
+Steffen and Charles
+
 Acknowledgements
 ----------------
 

Modified: trunk/community/infrastructure/getData/debian/changelog
===================================================================
--- trunk/community/infrastructure/getData/debian/changelog	2011-08-03 10:11:12 UTC (rev 7376)
+++ trunk/community/infrastructure/getData/debian/changelog	2011-08-03 14:07:46 UTC (rev 7377)
@@ -3,6 +3,7 @@
   [ Steffen Moeller ]
   * Initial release (Closes: #514984)
   * source format 3.0 (quilt)
+  * Suggesting biomaj
 
   [ Charles Plessy ]
   * Renamed getData.pl getData (.pl extensions are unwelcome in /usr/bin)
@@ -14,4 +15,4 @@
     /local/databases/mirrored.
   * Recommends perl-doc as getData --help needs it.
 
- -- Steffen Moeller <moeller at debian.org>  Fri, 05 Nov 2010 14:51:58 +0100
+ -- Steffen Moeller <moeller at debian.org>  Wed, 03 Aug 2011 16:00:22 +0200

Modified: trunk/community/infrastructure/getData/debian/control
===================================================================
--- trunk/community/infrastructure/getData/debian/control	2011-08-03 10:11:12 UTC (rev 7376)
+++ trunk/community/infrastructure/getData/debian/control	2011-08-03 14:07:46 UTC (rev 7377)
@@ -5,7 +5,7 @@
 DM-Upload-Allowed: yes
 Uploaders: Charles Plessy <plessy at debian.org>
 Build-Depends: cdbs, debhelper (>= 7)
-Standards-Version: 3.9.1
+Standards-Version: 3.9.2
 Vcs-Browser: http://svn.debian.org/wsvn/debian-med/trunk/community/infrastructure/getData/?rev=0&sc=0trunk
 Vcs-Svn: svn://svn.debian.org/svn/debian-med/trunk/community/infrastructure/getData
 Homepage: http://debian-med.alioth.debian.org
@@ -14,13 +14,19 @@
 Architecture: all
 Depends: ${shlibs:Depends}, ${misc:Depends}, perl
 Recommends: perl-doc
+Suggests: biomaj
 # perl-doc is needed for getData --help.
 Description: management of external databases
- Many scientific communities share the problem of regularly
- updating external database and keeping internally created
- indices updated. This tool addresses this issue.
+ Many scientific communities share the problem of regularly updating
+ external databases. With every update, also various tasks need to
+ be performed for the update of indices that need to be recreated.
+ This work depends on the tools that are available locally and is
+ not always completely simple.
  .
- Since some activities do depend on the installation
- of additional tools, getdata needs to be aware of 
- packages being installed to the system and hence 
- will be tailored to Debian Linux.
+ This package provides the getData Perl script, which in some not
+ so complicated manner performs the invocation to wget to download
+ data and then knows how to perform the indexing. There is only
+ a hash table to be filled with the commands to be executed. 
+ Maintainers of scientific packages that are strongly coupled to
+ public datasets are invited to add a runtime dependency to this
+ package and add instructions for getData to follow.

Modified: trunk/community/infrastructure/getData/debian/rules
===================================================================
--- trunk/community/infrastructure/getData/debian/rules	2011-08-03 10:11:12 UTC (rev 7376)
+++ trunk/community/infrastructure/getData/debian/rules	2011-08-03 14:07:46 UTC (rev 7377)
@@ -7,3 +7,8 @@
 	$(MAKE) getData.1
 
 install:: getData.1
+
+UPSTREAMVERSION=$(shell dpkg-parsechangelog | grep ^Version|cut -f2 -d\ | cut -f1 -d-)
+get-orig-source:
+	cd .. ; \
+	tar --exclude=debian --exclude-vcs -czvf getdata_$(UPSTREAMVERSION).orig.tar.gz getData

Modified: trunk/community/infrastructure/getData/getData
===================================================================
--- trunk/community/infrastructure/getData/getData	2011-08-03 10:11:12 UTC (rev 7376)
+++ trunk/community/infrastructure/getData/getData	2011-08-03 14:07:46 UTC (rev 7377)
@@ -12,11 +12,11 @@
 =cut
 
 # This script shall help maintaining sets of frequently changing databases
-# of various sorts. It is motivated by demands in bioinformatics and
-# astronomy.
+# of various sorts. It is motivated by demands in computational biology
+# and astronomy.
 
-# Copyright (c) 2008 Steffen Moeller <moeller at debian.org>
-# Copyright (c) 2008 Charles Plessy <plessy at debian.org>
+# Copyright (c) 2008-2011 Steffen Moeller <moeller at debian.org>
+# Copyright (c) 2008-2011 Charles Plessy <plessy at debian.org>
 # 
 # This program is free software; you can redistribute it and/or
 # modify it under the terms of the GNU General Public License




More information about the debian-med-commit mailing list