[Soc-coordination] Report 0 for Improvements to the Debian Search Project

Aarsh Shah aarshkshah1992 at gmail.com
Fri Jun 21 12:54:39 UTC 2013


*Improvements to Debain Search - Report 0

*
Mentor : Olly Betts

*Work done in the previous week:*

   - The first stage out of the three stages that I have planned for this
   project (Implement novel weighting schemes in Xapian, Implement Query
   expansion in Xapian using novel schemes, test and deploy on Debian search)
   is to implement a set of state of the art weighting schemes of the
   Divergence From Randomness Framework.
   - In the previous week, I coded,tested and documented the Inl2 weighting
   scheme of the DFR framework and have sent as pull request for it on github.
   The link to the code is here :  https://github.com/xapian/xapian/pull/11
   It is currently in the process of being merged.
   - I also implemented a method in the weight API of xapian to obtain the
   collection frequency of a word i.e the total times a word occurs in the
   whole database
   - I also cleaned,corrected,documented and wrote tests for the DPH
   weighting scheme and the PL2 Weighting scheme ,the code for which I had
   written in a very shabby way before GSOC started. I have sent pull requests
   for these two schemes as well.

*Work I am doing right now:*

   - I am currently working on implementing the DLH,IfB2 and the In2B2
   weighting schemes of the DFR franework. I will write code for them this
   week.

*Work I will do in the next week:*

   - In the next week, I will clean the code of the weighting schemes I am
   writing right now with the help of my mentor's feedback and I shall also
   write tests for these weighting schemes.
   - I will work on merging the weighting schemes that I have already
   implemented into Xapian.

   -Regards
   -Aarsh
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.alioth.debian.org/pipermail/soc-coordination/attachments/20130621/2d70a9e5/attachment.html>


More information about the Soc-coordination mailing list