Dataset preparation and baseline