Spectral-temporal compressive imaging is a technique that allows to sense spatio-spectro-temporal information, known as spectral video from a single low-framerate monochrome measurement. Several optical architectures have been recently developed to capture the spectral features of a dynamic scene or spectral video based on the compressive sensing framework. These spectral-temporal compressive architectures are principally composed by four elements: a main lens, a coded aperture, a dispersive element and an FPA detector. Traditionally, these acquisition systems use block-unblock coded apertures that either block the light rays in the optical path or allow them to pass through. However, the modulation produced by the block-unblock coded apertures is wavelength independent, thus ignoring the highly correlated structure of the spectral information in a dynamic scene. In this work, the block-unblock coded apertures are replaced by colored coded apertures, whose pixels represent a set of specific optical filters such as low, high or band pass filters that modulate particular wavelengths of the scene. An analysis of the variations in the colored coded aperture pattern that allows to obtain improvements in PSNR compared with the block-unblock coded apertures is presented. Simulation results show an improvement up to 2 dB in the reconstruction quality with respect to the block-unblock coded apertures.