Hi Michael,
These are my take on your highly engaging questions.
Q1: Which server is supposed to hold the shadow message (Exch2/Exch3/Exch4)?
A: Each next hop for the primary message requires separate shadow queues.A transport server may be the primary server for some messages and the shadow server for other messages simultaneously.
So if your message directly flows from Exch1 to Exch3, then its Exch3, but if it hops through Exch2,Exch4 and then Exch3. Then all 3 would be the shadow servers.
A successfully delivered message doesn't need to be kept in a shadow queue, so once the shadow server knows the primary server has successfully transmitted the message to the next hop, the shadow server moves the shadow message from the shadow queue into
Safety Net.
The same concepts about shadow redundancy, including the transport high availability boundary, primary messages, primary servers, shadow messages and shadow servers also apply to Safety Net.
i.e. "A transport server may be the primary server for some messages and the shadow server for others"
Single Message can have multiple Shadow Safety Net servers, depending upon the hops it performed to reach the destination.
{[I'm slightly in doubt on the last line though.]- Its true if message hops through multiple transport high availability boundaries.}
Q2: In case a user sends a message outside his/her organization where would the Shadow Safety Net exist?
A:It would exist in all hops otherthan the primary sending server. Exch3 in your normal case.
Q3: Does it mean lagged database copies can not exist without Safety Net? In other words, can lagged database copies work if Safety Net is disabled?
A: Safety Net will only be triggered if Exchange considers that all other means of recovery has failed. For example, lagged DBs require the Transaction logs from Passive\Active copies to replay and become active. If you already have a Active\Passive copy
why would you activate a lagged copy in normal scenarios. Usually it means all other copies\servers(including logs) have failed and hence only means to recover is SafetyNet.
Even if the data is not available or SafetyNet is OFF, Lagged DB will work, but would be missing the data and cause inconsitencies, might require manual intervention for it to mount.
"The main requirement for successful
resubmission from Safety Net for a lagged copy is...", so there can be failed resubmission too.
"The mailbox database copy being activated
must have all log files to the point in time to which you want to recover it. Keep in mind that database transactions can span multiple log files when determining the point in time to which you want to recover."
Activate a lagged mailbox database copy by using SafetyNet recovery:
Move-ActiveMailboxDatabase
"At this point, the database will automatically mount and request redelivery of missing messages from SafetyNet."
You can see this is one of the options, not the only one.
Below terms are relative to the current message position on the transport pipeline:
They change roles as the message traverses between hops.
Primary server
|
The transport server that's currently processing a message.
|
Shadow server
|
The transport server that holds shadow copies of a message after delivering the message to the primary server.
|
Nice Illustrations:
Understanding Shadow Redundancy