Check this message in Nutch-dev mailing list archive, where Tejas Patil picked issues for beginners that are looking to contribute to Apache Nutch.
Apache Nutch is an open source web-search software project. Nutch is a project of the Apache Software Foundation and is part of the larger Apache community of developers and users