manipulate vast datasets