Creating a universal, free to read, digital library containing all the books ever published is technically feasible today. Google, Yahoo and Microsoft have all announced their intention to scan and make available books of interest to public. Unfortunately many of these will be in languages inaccessible to over 80% of the world's population. Even when books in non-native languages become available online, their content will remain incomprehensible to most people. Natural Language Processing Technology is not yet perfect but promises to provide a way out of this conundrum. In this talk, we will discuss some unique research problems in text ming and discovery that arise digital libraries and other online content such as multi-lingual search, translation and summarization.