How will this corpus grow over time?

At the moment we are sitting on generous donations that we are in the process of transforming into training-ready formats. This is a long, labor-intensive process. If you'd like to volunteer, we'd love your support.

Our goal is to continue to grow this text over time, and specifically to increase its representativeness. At the moment, like Urdu publishing, this corpus over-represents works by male authors and publications from the city of Lahore. We are working to add more diversity to this corpus.