More from Less: Learning with Limited Annotated Data in Vision and Language