Approximation by exponential polynomials
This follows from the fact that the set of $n\times n$ matrices with simple spectrum is dense in the space of all $n\times n$ matrices ${\bf M}_n(\mathbb C)$ (or that the set of polynomials of degree $n$ with simple roots is dense in the set of all complex polynomials of degree $n$).
The function $f$ solves the Cauchy problem for a linear ODE with constant complex coefficients $$y^{(N)}+a_1y^{(N-1)}+\dots+a_Ny=0,$$ $$y^{(k)}(0)=f^{(k)}(0),\quad k=0,1\dots,N-1,$$ where $N=\sum_{j=1}^{M} (m_j+1) \leq n$. The exponents $\lambda_j$, $j=1,\dots,M$ are the roots of the corresponding characteristic equation $$P(\lambda)=\lambda^{N}+a_1\lambda^{N-1}+\dots+a_N\lambda=0$$ with the multiplicities, respectively, $m_j+1$, $j=1,\dots,M$. The coefficients $a_1,\dots a_N$ can be found from the relation $$P(\lambda)=\prod\limits_{j=1}^{M}(\lambda-\lambda_j)^{m_j+1}.$$
Now, a small generic perturbation of the ODE's coefficients will produce an ODE $$y^{(N)}+a_1^{\varepsilon}y^{(N-1)}+\dots+a_N^{\varepsilon}y=0,$$ such that the roots of the corresponding characteristic equation are all simple. This implies that a solution to the latter ODE belongs to $E_n$. Finally, we may use a standard result that solutions to linear ODEs depend continuously on the coefficients (in the topology of uniform convergence on finite time intervals).
By induction on $n$. If $n=1$, no problem. If $n \geq 2$, if the polynomials $p_{m_j}$ are all constant, still no problem. So assume $p_{m_1}$ nonconstant. Approximating $f$ is the same as approximating $t \mapsto e^{-\lambda_1 t} f(t)$, so we may assume $\lambda_1=0$. Then we approximate $f'$, using induction (since $\lambda_1=0$, $\sum_{j=1}^M (m_j+1)$ is less for $f'$). Now if $f'-u_0$ is small, with $u_0(t) = \sum_{k=1}^{n-1} c_k e^{\mu_k t}$, then $f-u$ is also small, where $u(t)=f(0)-\sum_{k=1}^{n-1} c_k/\mu_k + \sum_{k=1}^{n-1} \frac{c_k}{\mu_k} e^{\mu_k t}$ (using integration). The only problem here is that maybe $\mu_k=0$ for some $k$. But we can take $\mu_k$ very small but nonzero instead, $u_0$ is still close to $f'$.
This method gives an algorithm: everything is computable, and the only approximations are in fact the choices of "replacements" for zero $\mu_k$ (if you take it to be $1/n$, you get a sequence).
I think the following is another proof. It suffices to approximate a polynomial of degree n by an exponential polynomial of degree n + 1. Now, define $$ g_{\lambda}(x) = \frac{e^{\lambda x}}{\lambda} - 1. $$ It is easy to check that $\sup_{x\in [0,1]} |g_{\lambda} - x| \leq \frac{1}{2} \lambda$. Also $g_{\lambda}$ has order $2$. Furthermore, we can compute $$ |g_{\lambda}(x)^n - x^n| \leq |g_{\lambda}(x) -x| \cdot n \leq \frac{n}{2} \lambda. $$ And note that $g_{\lambda}(x)^n$ has order $n+1$. So a polynomial of degree $n$ given by $$ P_n(x) = \sum_{k=0}^{n} a_k x^k $$ can be approximated by $$ \sum_{k=0}^{n} a_k (g_{\lambda}(x))^k $$ up to order $n^2 \lambda$. Letting $\lambda \to 0$ implies the claim.