One of the more difficult problems facing an analyst wishing to use a simulation is the task of collecting data and transforming it into a correctly formatted scenario. Raw data is often available from a variety of sources: multi-spectral force deployment (MSFD) documents, the electronic warfare integrated reprogramming database (EWIRDB), free text documents such as intelligence reports, pre-existing simulation scenarios, and scenarios taken from other simulations. The task of transforming this data into a usable scenario involves searching for the relevant information, followed by a manual transformation of the original format to the correct simulation format. This problem can be greatly alleviated by using a combination of three technologies: automatic parser generation, repository architectures using extensible markup language (XML), and information retrieval (IR) techniques. Automatic parser generation tools like JavaCC can automatically generate source code capable of reading data sources such as old Joint Integrated Mission Model (JIMM) or Suppressor input files. For simulations that regularly add scenario keywords to support changing needs, this can greatly reduce redevelopment time and cost for supporting tools. The objects parsed by this source can then be encapsulated in XML and stored into a repository. Using information retrieval techniques, objects can then be queried from the repository and transformed into the appropriate format for use in a scenario.