Tweets and web data help track languages around the globe
Two UC academics have developed a unique multilingual data set, based on World Wide Web and Twitter traffic.
“It’s probably the biggest multilingual data set available in the world and we’re making it available for free for other people to use,” says Linguistics Lecturer Dr Jonathan Dunn.
University of Canterbury (UC) Linguistics Lecturer Dr Jonathan Dunn and UC Computer Science Senior Lecturer Dr Ben Adams have developed a language mapping website that is one of the largest multilingual data sets available in the world.
Dr Dunn says the language mapping website they’ve created, called earthLings.io, is the result of several years of work.