Script name: Split N
Short description: The script splits the input file into multiple parts defined by the token count; currently, it is 1000-word chunks. The size of the chunck is defined in the following line in the code, which can be modified to suit the users' needs: int tokensPerFile = 1000
Version: 1.0
Creator: #LancsBox
External link: http://corpora.lancs.ac.uk/lancsbox