Publications
The following are some of the recent articles, research papers and other articles which I have written. You can download most of them by clicking on the headers:
Academic Papers:
Baldridge, J., S. Chatterjee, A. Palmer, and B. Wing. 2007. VisCCG: Wiki and Programming Paradigms for Grammar Engineering. Grammar Engineering cross Frameworks 2007. Stanford University
Abstract: We present a suite of tools for simplifying the creation and maintenance of grammars for the OpenCCG parsing and realization system. The core of our approach relies on a terse but expressive textual format, DotCCG, for
declaring CCG grammars. It supports powerful string expansions that allow grammar developers to eliminate redundancy in the declaration of both morphology and category definitions. Grammars written in this format are converted into the XML utilized by OpenCCG using the ccg2xml utility, which — like a programming language compiler– provides information regarding errors in the grammar, including the type of error and the line number on which t occurs. DotCCG grammars can be edited with VisCCG, a graphical interface which provides visualization of various components of the grammar and allows local editing of information in a manner inspired by wikis. We also
report on resources developed to facilitate wide use of the OpenCCG tool suite presented in this paper and on recent uses of the tools in both academic research and classroom environments.
Porting Nachos to the CISC Architecture: An unpublished draft research paper that I have written, based on my final year undergraduate project. To cut a long story short, we have been able to modify a toy OS, by eliminating a cross-compiler, replacing a virtual MIPS system by an 8086 CPU, and also ported existing NachOS code to an Intel machine without any trace of the MIPS system that existed. I hope that this one comes up on the ACM SIGOPS in their January issue.
Class project reports:
Some of the classes that I’ve attended have led to some promising efforts which I’m trying to work on towards a more publishable result. In the meantime, here are two reports from such class projects:
In my recent Computational Linguistics class (offered by Dr. Katrin Erk), I tried to use dependency based semantic spaces published very recently by Sebastian Pado and Mirella Lapata (Computational Linguistics, 33:2, 2007) for word sense disambiguation. I used this dependency based word vector space instead of a simple co-occurrence based vector space and compared the two systems in word sense disambiguation using a similar approach as Hinrich Schutze in his paper, Automatic Word Sense Discrimination, (Computational Linguistics, 24:1, 1998). A copy of the class project report I submitted is here, titled Comparing Collocation-based and Dependency-based word vector spaces for unsupervised Word Sense Disambiguation.
I have also worked earlier in my Neural Networks class with Dr. Risto Miikkulainen. In this class, I tried to extend Q-learning based routing algorithms by trying to implement Probabilistic Confidence-based Dual Reinforcement Q - learning, or PCDRQ routing in short. Our final project report is here. I am interested in taking the project further to see if the new protocol can be implemented and scaled to real-time network routing, and am currently working on this.
Older work:
Managing Unmanageable Code: A small guide, or rather a collection of handy tips and tricks that you might find useful if you have suddenly decided to tweak a little of the code of your favorite open source music player. Again, one of the articles I could come up with due to my tryst with NACHOS.
The Penguin@Play!: I am an ardent Linux enthusiast, and so had quickly penned down this one to keep his mouth shut, and maybe to help other newbies switch to Linux as well. It was published in “Linux For You”, Issue: December 01, 2005 Page: 29-31
Comparison between OWB and Datastage: A whitepaper that I wrote for our DW/BI capability here in Accenture. It was a basic comparison of the various functionalities of the two widely used ETL tools in the data warehousing arena, Oracle Warehouse Builder and Ascential Datastage.