This data set consists of lexical entries for one hundred concepts, based on the concept lists of Haspelmath and Tadmor (2009) and Swadesh (1971). Entries were translated into twenty-two languages of the Kho-Bwa subgroup of the Sino-Tibetan language family and were annotated with respect to cognacy information.
A tutorial accompanying this data set and providing first steps towards an analysis can be found here.