Basically, if you think of these things in terms of the rate parameter and the MTBF (mean time between failures), you'll be less likely to make a mistake. The conventions are really just notational things, but the rate parameter and the MTBF will be intrinsic to the system in question.
So if you think of it as "E[T] = inverse of rate parameter = MTBF parameter" and "Var[T] = (inverse of rate parameter)^2 = (MTBF)^2", etc., you'll be less likely to slip up than if you think of it as "Exp(lambda) has mean lambda" (because the latter part requires you to remember notation/convention whereas the former is independent of which convention or notation is used).
E.g. if we're told X ~ Poisson(7), that tells us the "rate parameter" of the Poisson process is 7 (because Poisson only has one convention, so no confusions with this), so then we know T has mean 1/7 (= 1/(rate parameter)), and we also know the pdf of T is 7e-7t (because we are remembering the pdf as "rate parameter * exp(-(rate parameter)*t)).
Similarly, if we're told T is the lifetime of a lightbulb that has expected lifetime 500 hours and is exponentially distributed, we know that the pdf is (1/500)*exp(-(t/500)), because we are remembering the pdf as being "1/(MTBF) * exp(-(t/MTBF))". If we instead just remembered it via the notation method, we could get confused between the two conventions (because we'd need to be sure about whether it's Exp(500), or Exp(1/500), etc.).
[You should still know the conventions though, of course, but just make sure to also understand these in terms of rate parameter and MTBF so that you can get the right answers etc.]