In this paper we propose a method for run time profiling of applications on instruction level by analysis of loops. Instead
of looking for coarse grain blocks we concentrate on fine grain but still costly blocks in terms of execution times. Most
code profiling is done in software by introducing code into the application under profile witch has time overhead, while
in this work data for the position of a loop, loop body, size and number of executions is stored and analysed using a
small non intrusive hardware block. The paper describes the system mapping to runtime reconfigurable systems. The fine
grain code detector block synthesis results and its functionality verification are also presented in the paper. To
demonstrate the concept MediaBench multimedia benchmark running on the chosen development platform is used.