BlackBerry Forums Support Community
              

Closed Thread
 
Thread Tools
Old 12-08-2009, 12:16 PM   #1
LordZordec
New Member
 
LordZordec's Avatar
 
Join Date: Sep 2009
Model: NA
PIN: N/A
Carrier: T-Mobile, Verizon, AT&T
Posts: 8
Default ...---... - BES, Exchange Failover Scenario - Major Issues

Please Login to Remove!

Hello all!

We recently implemented software that automatically replicates data from our local Exchange 2003 front-end and back-end servers to a pair of Exchange servers at a hosted facility about 50 miles away. In the event of a failure of our Exchange servers or a hole in the ground opens and swallows our building, the special failover software will automatically change DNS entries to point email clients to the new servers and then start Exchange services on them.

A few weeks ago, we did a failover test, and as far as the Exchange portion went, the test was smooth as silk. We shut down our local Exchange servers, and in a matter of ten minutes, the failover servers were up and running, and all was well...except the BES.

We could access email via the OWA portal and Microsoft Outlook. However, the BES REFUSED to talk to the failover servers and continue letting email flow.

The local Exchange servers and the failover servers have different names, but there was a DNS forwarding rule put in place that forwards all traffic sent to the original servers over to the failover servers.

So, email clients had no issues with this. The BES refused to work with this arrangement.

The following errors were in the event log:

The Kerberos client received a KRB_AP_ERR_MODIFIED error from the server host/XXXX-dc03.XXXX.org. The target name used was HOST/DBSI-MAIL. This indicates that the target server failed to decrypt the ticket provided by the client. This can occur when the target server principal name (SPN) is registered on an account other than the account the target service is using. Please ensure that the target SPN is registered on, and only registered on, the account used by the server. This error can also happen when the target service is using a different password for the target service account than what the Kerberos Key Distribution Center (KDC) has for the target service account. Please ensure that the service on the server and the KDC are both updated to use the current password. If the server name is not fully qualified, and the target domain (XXXX.ORG) is different from the client domain (XXXX.ORG), check if there are identically named server accounts in these two domains, or use the fully-qualified name to identify the server.

Also, for each and every Blackberry user, there was one of these:

User Doe, John not started

{jdoe@XXXX.org} MAPIMailbox::MAPIMailbox - OpenMsgStore (0x8004011d) failed, MailboxDN=/o=XXXX/ou=First Administrative Group/cn=Recipients/cn=ppicarie, ServerDN=/o=XXXX/ou=First Administrative Group/cn=Configuration/cn=Servers/cn=XXXX-MAIL/cn=Microsoft Private MDB

We have verified that this is not a permissions issue for the Blackberry administrator account. It does have full rights to the Exchange server and the user mailboxes.

The BES refused to talk to the failover Exchange servers for two hours, and then suddenly started working. However, when we attempted to failover the BES to its mirrored companion at the hosting facility, it wouldnt work at all. We suspect that, since we are getting the same errors on both, its the same issue causing both.

So, what could it be? Any help would be appriciated.
Offline  
Old 12-08-2009, 12:28 PM   #2
MisterGriffiths
Talking BlackBerry Encyclopedia
 
MisterGriffiths's Avatar
 
Join Date: Nov 2007
Location: London
Model: 9780
OS: 5.0.3.31
PIN: N/A
Carrier: O2
Posts: 208
Default

Two hours! Isn't that the default Information Store Caching period in Exchange 2003?

I'm not sure I totally understand whether the actual Exchange server name is changing or not.

Also, what version of BES?
__________________
BES 5.0.3 MR4
Exchange 2010 (SP1 RU3)
SQL 2008 R2
Offline  
Old 12-08-2009, 12:50 PM   #3
mahoward
CrackBerry Addict
 
mahoward's Avatar
 
Join Date: May 2005
Model: 8900
Carrier: T-Mobile
Posts: 560
Default

Is this XOSoft? If so I am very interested in this issue as we are going to be using it for our failover solution as well. Please post any information you discover, thanks!

[Edit]: I just remembered that the XOSoft product also modifies the Service Principal Name (SPN) for the destination Exchange server to allow Kerberos to keep working. Perhaps the product you have is not doing that?
__________________
BESX 4.1.7 on Exchange 2003: 65 Devices
BESX 5.0.3 on Exchange 2003: 2007 Devices

Last edited by mahoward; 12-08-2009 at 12:54 PM..
Offline  
Old 12-08-2009, 03:50 PM   #4
LordZordec
New Member
 
LordZordec's Avatar
 
Join Date: Sep 2009
Model: NA
PIN: N/A
Carrier: T-Mobile, Verizon, AT&T
Posts: 8
Default

Thanks for the replies...

This is Exchange 2003, BES 5.0, and XOSOFT.

In the failover scenario, the name of the Exchange server DOES change.

Primary Exchange Server=XXXX-MAIL
Failover Exchange Server=DBSI-MAIL
Offline  
Old 12-09-2009, 04:58 AM   #5
freakinvibe
BlackBerry Extraordinaire
 
Join Date: Aug 2008
Location: Basel
Model: Class
PIN: N/A
Carrier: Swisscom
Posts: 1,616
Default

The only Exchange fail-over solution that worked for us with BES was Exchange clustering as it is transparent to the BES. All other solutions we tested didn't really work with BES.
Offline  
Old 12-09-2009, 07:35 AM   #6
LordZordec
New Member
 
LordZordec's Avatar
 
Join Date: Sep 2009
Model: NA
PIN: N/A
Carrier: T-Mobile, Verizon, AT&T
Posts: 8
Default

Quote:
Originally Posted by freakinvibe View Post
The only Exchange fail-over solution that worked for us with BES was Exchange clustering as it is transparent to the BES. All other solutions we tested didn't really work with BES.
What kinds of issues did you have? Were they similar to the issues described above?
Offline  
Old 12-09-2009, 07:45 AM   #7
freakinvibe
BlackBerry Extraordinaire
 
Join Date: Aug 2008
Location: Basel
Model: Class
PIN: N/A
Carrier: Swisscom
Posts: 1,616
Default

We had connection issues after failover. The implementation of the MAPI/CDO access from BES to Exchange is not very good with sudden name and IP address changes. It is probably a DNS caching issue, but we didn't invest time to find the root cause.
Offline  
Old 12-09-2009, 12:58 PM   #8
LordZordec
New Member
 
LordZordec's Avatar
 
Join Date: Sep 2009
Model: NA
PIN: N/A
Carrier: T-Mobile, Verizon, AT&T
Posts: 8
Default

Quote:
Originally Posted by freakinvibe View Post
We had connection issues after failover. The implementation of the MAPI/CDO access from BES to Exchange is not very good with sudden name and IP address changes. It is probably a DNS caching issue, but we didn't invest time to find the root cause.
Well, our plan is to fail over the email servers and then run the handheldcleanup.exe -u command to hopefully redirect the BES to the right server for mailboxes.

The key word there is "hopefully".
Offline  
Old 12-15-2009, 05:03 PM   #9
LordZordec
New Member
 
LordZordec's Avatar
 
Join Date: Sep 2009
Model: NA
PIN: N/A
Carrier: T-Mobile, Verizon, AT&T
Posts: 8
Default

Here is the solution that we are going to attempt tonight:
1. Fail over the Exchange servers.
2. Force AD replication.
3. Run the handheldcleanup.exe -u tool.

We will let you know what happens.
Offline  
Old 12-15-2009, 11:51 PM   #10
Harry Azol
Knows Where the Search Button Is
 
Join Date: Mar 2009
Model: yes
PIN: N/A
Carrier: yes
Posts: 35
Default

Just curious - did you look at the Exchange 2007 geographically disperse CCR cluster option instead of paying for a 3rd party product on 2003?
Offline  
Old 12-18-2009, 03:28 PM   #11
LordZordec
New Member
 
LordZordec's Avatar
 
Join Date: Sep 2009
Model: NA
PIN: N/A
Carrier: T-Mobile, Verizon, AT&T
Posts: 8
Default

Quote:
Originally Posted by Harry Azol View Post
Just curious - did you look at the Exchange 2007 geographically disperse CCR cluster option instead of paying for a 3rd party product on 2003?
Yep.

The higher-ups wanted email high availability NOW, but they were not willing to shell out the cash for the Exchange 2007 upgrade until NEXT year. So, we are shelling out cash for XOSOFT now to give them what they want now even though we could give it to them a few months from now WITHOUT spending a begrillion dollars on a failover product that will not have any other use for one Exchange 2007 is in place.

Do they teach "common sense" anywhere in business school???
Offline  
Old 12-18-2009, 03:30 PM   #12
LordZordec
New Member
 
LordZordec's Avatar
 
Join Date: Sep 2009
Model: NA
PIN: N/A
Carrier: T-Mobile, Verizon, AT&T
Posts: 8
Default

Quote:
Originally Posted by LordZordec View Post
Here is the solution that we are going to attempt tonight:
1. Fail over the Exchange servers.
2. Force AD replication.
3. Run the handheldcleanup.exe -u tool.

We will let you know what happens.
The outcome: The BES started to work after 1 hour instead of 2, and when we failed over to the other BES at our remote site, it worked like a charm (vs not working at all - big improvement).

The process isnt perfect yet, but the two key things that made it work were 1) forcing AD replication, and 2) running the handheldcleanup.exe tool.
Offline  
Old 12-23-2009, 11:31 AM   #13
mahoward
CrackBerry Addict
 
mahoward's Avatar
 
Join Date: May 2005
Model: 8900
Carrier: T-Mobile
Posts: 560
Default

Thanks for the updated information, m'lord! We have the EXACT same higher up reasoning as you do except we are waiting to move to 2010 and skipping 2007. Glad to hear it is working for you with the new process.
__________________
BESX 4.1.7 on Exchange 2003: 65 Devices
BESX 5.0.3 on Exchange 2003: 2007 Devices
Offline  
Old 12-23-2009, 11:42 AM   #14
mahoward
CrackBerry Addict
 
mahoward's Avatar
 
Join Date: May 2005
Model: 8900
Carrier: T-Mobile
Posts: 560
Default

BTW, do you have a separate AD site where your failover target Exchange server exists? We are going with a single AD site stretched across the primary and DR physical locations which would reduce the AD replication latency delay.
__________________
BESX 4.1.7 on Exchange 2003: 65 Devices
BESX 5.0.3 on Exchange 2003: 2007 Devices
Offline  
Old 12-28-2009, 04:57 PM   #15
LordZordec
New Member
 
LordZordec's Avatar
 
Join Date: Sep 2009
Model: NA
PIN: N/A
Carrier: T-Mobile, Verizon, AT&T
Posts: 8
Default

Quote:
Originally Posted by mahoward View Post
BTW, do you have a separate AD site where your failover target Exchange server exists? We are going with a single AD site stretched across the primary and DR physical locations which would reduce the AD replication latency delay.
We actually have a separate AD site for our remote failover facility, which I am sure is a factor in this as well.

Hmmmmm...now the wheels in my head are spinnning though...
Offline  
Closed Thread



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


Vintage V-Mac Industries Inc. Pipe Threader Vosper Drophead Threader - READ picture

Vintage V-Mac Industries Inc. Pipe Threader Vosper Drophead Threader - READ

$199.00



Vintage Mac Warehouse  3.5” Floppy Disk Solar Powered Calculator Company Swag picture

Vintage Mac Warehouse 3.5” Floppy Disk Solar Powered Calculator Company Swag

$55.50



MAC USA 18-6 XDM Six Points 18 MM Socket 3/8

MAC USA 18-6 XDM Six Points 18 MM Socket 3/8" Drive Used Vintage Excellent Condt

$14.99



Vintage MAC Tools UVEX Adjustable Safety Glasses Motorcycle Mechanic Lawnmower picture

Vintage MAC Tools UVEX Adjustable Safety Glasses Motorcycle Mechanic Lawnmower

$55.24



Vintage White APPLE IMAC EMC 1857 15

Vintage White APPLE IMAC EMC 1857 15" 20GB HDD Mac OSX 10.2 256MB RAM 500MHz

$85.00



Vintage Mac Tools AW343 Series 1/2 Pneumatic Impact Driver  picture

Vintage Mac Tools AW343 Series 1/2 Pneumatic Impact Driver

$50.00







Copyright 2004-2016 BlackBerryForums.com.
The names RIM and BlackBerry are registered Trademarks of BlackBerry Inc.