BlackBerryForums.com : Your Number One BlackBerry Community
     

»Sponsored Links


BlackBerryApps.com Best Sellers



Closed Thread
 
LinkBack Thread Tools
  (#1 (permalink)) Old
Wakefield103 Offline
Knows Where the Search Button Is
 
Posts: 36
Join Date: Nov 2005
Location: Central Florida
Model: 8130
Carrier: Verizon
Question HELP - BES Delays Delivering Messages - 11-09-2005, 06:26 PM

I need help!! We are experiencing message delivery delays with our BES. Users are receiving messages delayed 5 mins - 2 hours from the time the same messages are received in their Inbox. When messages are received, they are being received in batches (x number of messages received at the same time regardless of when they were sent).

Configuration:
Standalone BES 3.6 (Service Pack 2)
107 Users (mix of T-Mobile, Cingular, Verizon)
Mix of handhelds: oldest is 6210, newest is 7100c
2 Exchange 2003 Servers (SP 1) - one local, one remote
cdo.dll and mapi32.dll are Ex 2003 SP1 version

We first became aware of this problem last Thursday (11/3). The problem seems to come and go depending on the time of day. When the problem is occurring, all users seem to be affected regardless of their carrier or device. The problem seems to appear around 10:00am-12:00pm (eastern time) each day. The problem seems to die down or go away late at night and continue that way through the morning. I know this all sounds like it should point to something in our environment or configuration, but I can’t figure out what that may be.

Our BES and Exchange servers are both within our corporate firewall. We have not discovered any problems on the LAN or WAN. The Exchange servers do not have any abnormal errors or warnings. When within the BES Management utility and watching the User Stats for a given user, we see a delay with the “Pending to handheld” queue incrementing. Once the queue does increment, the messages are received on the handheld almost instantly after.

We have restarted the service. We have rebooted the computer. Neither has resolved the problem (although it MAY have lessened the duration of the delays - we don't have enough data to support that). We have run the IEMSTest utility to verify the connectivity with the Exchange server. We have verified the cdo.dll and mapi32.dll versions on the BES server. We have verified the permissions on each of the Exchange servers. We have verified that PIN to PIN messages work. We have verified that messages sent via the BlackBerry Web Client are delivered promptly.

Your thoughts and suggestions are greatly appreciated.

Dagon
   
Sponsored Links
Please Login or Register to Remove these Advertisements!

  (#2 (permalink)) Old
BBTechGuy Offline
Talking BlackBerry Encyclopedia
 
Posts: 299
Join Date: May 2005
Model: 7100
Carrier: T-Mobile
Default 11-09-2005, 07:52 PM

Does the Application Log on the BlackBerry Server show a lot of Warnings about
Blocked Threads?
   
  (#3 (permalink)) Old
dev Offline
CrackBerry Addict
 
dev's Avatar
 
Posts: 562
Join Date: Apr 2005
Location: Toronto
Model: 8800
Carrier: Rogers
Default 11-09-2005, 10:00 PM

Also, what's the latency between the BES and the remote Exchange server? The latency limitation is <35ms between the BES and Exchange server(s). Anything greater and the users will experience delays in mail redirection and PIM data synchronization.
Are the users with the highest pending count located on the remote Exchange?

Have you used the /3GB switch in the boot.ini file to address the virtual memory address allocation issues? See the following link..http://support.microsoft.com/kb/815372/

Take a look at the event viewer, the BES logs and exchange servers CPU usage during the peak times and compare with non-peak hours.

Are any of your servers connected to a hub? If so use a switch instead to reduce the collisions.
For a 100+ users that type of delay points to system resources or network issues.
   
  (#4 (permalink)) Old
jrbes Offline
Thumbs Must Hurt
 
jrbes's Avatar
 
Posts: 154
Join Date: Jun 2005
Location: Here
Model: 8700
Carrier: Cingular
Default 11-10-2005, 09:32 AM

Have your network guys put a watch on your connection between the BES and EXCH servers for connections. Sounds like you are getting hung threads and they aren't clearing. Moving to BES4.0.1 fixed this issue with us, on 3.6.x we had the servers rebooting twice a day to clear the threads.
   
  (#5 (permalink)) Old
Wakefield103 Offline
Knows Where the Search Button Is
 
Posts: 36
Join Date: Nov 2005
Location: Central Florida
Model: 8130
Carrier: Verizon
Default 11-10-2005, 09:45 AM

Hi BBTechGuy,

Thanks for the response. Actually, we are seeing warnings about blocked threads. The specific warnings have one of the following descriptions:

At least one worker thread seems to be blocked (3) < --- The # in the ( )’s changes
Some worker threads have been blocked for 3 health checks < --- The # changes

I used to think that these warnings were random, but going back over the Event Viewer it does look like we are receiving these around the time we are seeing the delays and are not receiving them when we do not see the delays. What is your suggestion?

Hi dev,

Thanks for your response as well. The latency between the BES and the remote Exchange server is ~120 ms. Yes, I know that is greater than MS recommended MAPI latency limit of <35 ms. But, we have been running in this configuration for over 2 years and have not had this problem before (that I know of). However, we do have more users now. I am not ruling this out…

Looking over the pending count, there does not seem to be a big difference between users on the remote Exchange server and the local Exchange server. I am on the local Exchange server and when the problem occurs, I am definitely seeing it.

The /3GB switch is set on all the Exchange servers. It is not set on the BES (BES has 2 GB of RAM). All the servers are connected to 100 Mb switched Ethernet with a gig backbone. We have a 3 Mb frame connection to our remote site.

Hi jrbes,

Thank you for the response. We have not performed any packet captures between our BES and Exchange Servers. I am not exactly sure what we would be looking for. We have verified that connectivity is consistent between the servers and that MAPI connections can be established. Any thoughts as to what we would look for? BES 4.0 is in our future, but unlikely that management would approve just upgrading to it without full planning, testing, and approved downtime.

All good thoughts. Keep them coming!

Dagon
   
  (#6 (permalink)) Old
noname Online
BlackBerry Extraordinaire
 
noname's Avatar
 
Posts: 1,003
Join Date: Sep 2005
Location: Congested Islet of Foreign Talents.
Model: 8320
OS: 5.0.0.114
PIN: FFFFFFFF
Carrier:
Default .. hey guys, check this out... - 11-10-2005, 09:53 AM

KB-01685 "What Is - Factors that contribute to latency"

Source: http://www.blackberry.com/knowledgec... 801&vernum=0

May be helpful...

-= noname =-
   
  (#7 (permalink)) Old
jrbes Offline
Thumbs Must Hurt
 
jrbes's Avatar
 
Posts: 154
Join Date: Jun 2005
Location: Here
Model: 8700
Carrier: Cingular
Default 11-10-2005, 11:50 AM

Look for MAPI sessions being established but also MAPI connections not being released. Each hung thread that you see is 10 minutes (3 health checks = 30 minutes of delay for messages that are dependant upon those threads).

Are you seeing a lot of rescans in your log files?
   
  (#8 (permalink)) Old
Wakefield103 Offline
Knows Where the Search Button Is
 
Posts: 36
Join Date: Nov 2005
Location: Central Florida
Model: 8130
Carrier: Verizon
Default 11-10-2005, 02:40 PM

Thanks jrbes,

We are seeming some rescans. I am not sure what is normal and what is "a lot".

What would cause hung threads?

Dagon
   
  (#9 (permalink)) Old
jrbes Offline
Thumbs Must Hurt
 
jrbes's Avatar
 
Posts: 154
Join Date: Jun 2005
Location: Here
Model: 8700
Carrier: Cingular
Default 11-10-2005, 02:57 PM

Hung threads are caused by connections (MAPI) to the Exchange server not terminating in a timely fashion - 4.0 fixed this for us as it will reset inactive connections.
   
  (#10 (permalink)) Old
Wakefield103 Offline
Knows Where the Search Button Is
 
Posts: 36
Join Date: Nov 2005
Location: Central Florida
Model: 8130
Carrier: Verizon
Angry 11-11-2005, 09:43 AM

A week now and the problem is still going on.

So does anyone have any other suggestions as to how to prevent blocked\hung threads? I have gone through a day's debug logs and it looks for that given day, all the logged blocked threads were from approximately 10 users. However, I was not one of the users and I was still seeing delays. My understanding of the blocked threads, was that only the user's thread that was blocked would experience the delay. In reality, it seems that if there are any blocked threads (or maybe X number), then it impacts everyone on the BES.

We are not seeing anything on the Exchange server that gives us any indication there is a problem communicating with the BES server.

One thing that concerns me, is we didn't have this problem before. The only change I can think of would be users added or removed from the BES.

Also, does anyone have any suggestions on a good text editor \ parser \delimiter to use to view the debug logs? Idealy I would like to input them into Excel, but the columns do not come in cleanly.

Dagon

Last edited by Wakefield103 : 11-11-2005 at 10:57 AM.
   
  (#11 (permalink)) Old
jrbes Offline
Thumbs Must Hurt
 
jrbes's Avatar
 
Posts: 154
Join Date: Jun 2005
Location: Here
Model: 8700
Carrier: Cingular
Default 11-11-2005, 10:21 AM

A blocked thread can cause delays on anyone's device, not just the person that is attached to the hung thread (this is what RIM told me). Preventing them is based primarily on your network and the speed between your BES and Mail server (especially if you are an Exchange shop - MAPI is a dog with this until version 4.0).

Our hung thread issues went away after upgrading to 4.0 SP1 and applicable HF's. Once on 4.0 you can use the Resource kit to view the logs a little easier but I haven't found anything that makes reading those darn things extremely easy on the eyes.

Hope this helps.


Policies get in the way of fun.
   
  (#12 (permalink)) Old
BB_God Offline
Thumbs Must Hurt
 
Posts: 73
Join Date: Jun 2005
Model: 7100i
Carrier: Rogers
Default 11-13-2005, 10:43 AM

To prevent or stop hung threads your first have to understand what event they are hanging on. Grep your Debug logs for "No Response" (without the quote" and it will show all the hung thread events. The key things that it will show, that are usefull, are:
- The action it is trying to perform
-> RESCAN_SOMETHING
-> NEW_MAILBOX_PACKET
-> NEW_MESSAGE Etc.
Are these the same or different?

- The user for who the thread is hung for
- The WAITCOUNT - 1=10 minutes and after 5 WAITCOUNTS it will print a stack trace.
- The Exchange server the user is on
-> Is this the same or are their any similarities (site etc.)?

If you are still stuck copy a couple of lines from your grep and someone might be able to give you some additional insight.
   
  (#13 (permalink)) Old
BBTechGuy Offline
Talking BlackBerry Encyclopedia
 
Posts: 299
Join Date: May 2005
Model: 7100
Carrier: T-Mobile
Default 11-14-2005, 07:19 AM

Quote:
Originally Posted by jrbes
Look for MAPI sessions being established but also MAPI connections not being released. Each hung thread that you see is 10 minutes (3 health checks = 30 minutes of delay for messages that are dependant upon those threads).

Are you seeing a lot of rescans in your log files?
The BES will perform roughly the same number of rescans in both a healthy and non-healthy environment. The question you should be asking is whether a lot of Messages are being picked up through Rescan which is where you would see Delay come in.
   
  (#14 (permalink)) Old
boma0021 Offline
Talking BlackBerry Encyclopedia
 
boma0021's Avatar
 
Posts: 202
Join Date: Jan 2005
Location: LE
Model: Pearl
Carrier: T-Mobile
Default 11-14-2005, 09:23 AM

Have you looked at this KB : Installing security update MS05-019 or Windows Server 2003 Service Pack 1 may cause network connectivity between clients and servers to fail

http://support.microsoft.com/default...-us;898060#kb2
   
  (#15 (permalink)) Old
Woding_d Offline
New Member
 
Posts: 1
Join Date: Dec 2005
Model: 7290
Default 12-07-2005, 10:50 AM

We have exactly the same problem. Blackberry version 3.6 with about 160 users.

Outside of working hours all is OK but between 09:00h and 12:30h and again between 14:00 and 18:30h the delays to synchronise email to the Blackberry handhelds increase even up to a couple of hours. t the same time we are seeing "hung threads" inthe error logs, usually associated with RELOAD_FOLDERS events. In time the reload folders does complete but can sometimes take over an hour to complete - the same RELOAD_FOLDERS for the same user completes in a matter of seconds when it runs outside of working hours.

For us this has been going on for over a month, despite all efforts to find a solution. Wakefield103, perhaps you could contact me by email so we can discuss this offline.
   
  (#16 (permalink)) Old
BBTechGuy Offline
Talking BlackBerry Encyclopedia
 
Posts: 299
Join Date: May 2005
Model: 7100
Carrier: T-Mobile
Default 12-07-2005, 07:31 PM

Have either of you looked up Microsoft's document for Exchange 2000 Troubleshooting? This will contain PerfMon Counters you'd want to watch and how to interpret the data. I don't know if there's a 2003 version but I'm sure the values would be similar anyways.
   
  (#17 (permalink)) Old
Lurch0001 Offline
New Member
 
Posts: 2
Join Date: Nov 2005
Location: Sydney
Model: 8100
Carrier: Telstra
Default 12-08-2005, 08:37 PM

We're seeing an identical problem, although we have a slightly different config:

Standalone BES 4.0 (Service Pack 3)
102 Users
3 Exchange Servers - one local (2003 SP2, ping <1ms), two remote (1x 2003 Sp2, ping 180ms, 1x 2000 sp3, ping 400ms).

Like you, everything is fine out of hours and on the weekends, but during the day, particularly late morning and mid-afternoon, delays can be anywhere from 20 mins to 1 hour. We've had a similar config for about 12 months, only started noticing the delays in the last 4 weeks or so.

We did a test yesterday where we removed all the users on the remote Exchange servers but this made little difference, if any.

Our local Exchange server is *very* busy (~2000 users, 600GB of stores) and RIM have suggested that the MAPI connections are timing out which is causing the delays. So today I tested that theory by moving my mailbox to a new Exchange server with no other users on it - and now I'm not seeing any delays. So I think our problem is that our Exchange server really is too busy to respond quick enough for BES to handle. Looks like it's time for someone to write a cheque for a new Exchange server...
   
  (#18 (permalink)) Old
technickel Offline
Thumbs Must Hurt
 
Posts: 112
Join Date: Mar 2006
Model: 8703e
Carrier: Sprint
Default 08-15-2006, 02:04 PM

Can anyone give a straight answer as to how to clear blocked threads..
The BES has gone from 1 to 6 heath checks and stops for mail server 3.


Your BlackBerry Did What!!
Exchange 2003 SP2
BES 4.1.2
   
  (#19 (permalink)) Old
Korbyn Offline
Thumbs Must Hurt
 
Korbyn's Avatar
 
Posts: 81
Join Date: May 2005
Location: Calgary, AB
Model: 8330
Carrier: Telus
Default 08-15-2006, 02:42 PM

I found issues, and RIM apparently has confirmed it, with Microsoft security patch MS05-051, and can block the UDP packets that the BES server is waiting for. What I hear you guys describing is that the emails are actually getting delivered to the devices when the BES does a rescan, which it does typically every 20-30 minutes.

If you go into your agent log files (servername_MAGT_1) and search for "Queuing new mail through rescan" (without the "") If you are seeing hundreds if not thousands of those, chances are the UDP packets from the exchange server aren't reaching the BES application, and can be blocked from everything including the firewall, port filtering, or the MS05-051 patch.

I also experience with that MS05-051 patch, smaller groups of users, usually between 22 and 32, getting a hung thread. I removed the patch and didn't have any issues for two weeks... Then I upgraded to 4.1 SP1... But before then I was seeing the hung thread 2 or 3 times a day.

Check your logs, but the is a good chance if you have no firewalls between your bes and exchange server that MS05-051 is the source of your issue.
   
  (#20 (permalink)) Old
||||||| Offline
CrackBerry Addict
 
|||||||'s Avatar
 
Posts: 615
Join Date: Jun 2006
Model: 7100
Carrier: Rogers
Default 08-15-2006, 03:11 PM

Quote:
Originally Posted by technickel
Can anyone give a straight answer as to how to clear blocked threads..
The BES has gone from 1 to 6 heath checks and stops for mail server 3.
Reboot

permanent fix is to decrease Ping times. 30 msec or less is required but you may be fine with under 90msec
   
  (#21 (permalink)) Old
Korbyn Offline
Thumbs Must Hurt
 
Korbyn's Avatar
 
Posts: 81
Join Date: May 2005
Location: Calgary, AB
Model: 8330
Carrier: Telus
Default 08-17-2006, 10:27 AM

I don't think the delay's for someof the people involved in this topic were based on health check failures, I do agree, high ping times are bad, but the 5-20 minute delays in email delivery without Health Check failures points more towards the server not receiving new mail UDP packets;

Anyway, here's what RIM's response on MS05-051:
There are steps listed in the Microsoft KB to apply each fix individually, and we recommend that you apply all except the COM+ fix. The COM+ fix prevents the MAPI sub-system from forwarding the notifications from the Exchange Server regarding new mail to the BlackBerry Enterprise Server software.

I haven't looked at the article to see, just sharing their response...
   
  (#22 (permalink)) Old
BB-Munkey Offline
New Member
 
Posts: 11
Join Date: Mar 2006
Location: Kansas City
Model: 8100
Carrier: Cingular
Default 08-30-2006, 12:28 AM

any tips on uninstalling MS05-051 patch (KB902400)? Add/Remove Software tells me the patch cannot be removed. Thanks.
   
  (#23 (permalink)) Old
fouttyw Offline
New Member
 
Posts: 3
Join Date: Jan 2007
Model: 7120
Default 01-02-2007, 08:21 AM

Do you have any offical documentation of this <35ms delay?
   
  (#24 (permalink)) Old
jhuck Offline
New Member
 
Posts: 3
Join Date: Nov 2006
Model: 8703
Default Any success? - 04-10-2007, 08:52 PM

I was wondering if anyone has had any success resolving this issue. I am having the same problems with delayed message delivery to the blackberries. Latency does not seem to be an issue. Ping time is always less the 1 sec between the bes and exchange. I have windows 2k3 sp1 running on both servers. We are running exchange 2k3 sp2 with about 200 mailboxes. The bes server is 4.0 with sp6 recently loaded on it to try and resolve this situation. The issue didn't seem to occcur until after i loaded the dst upgrades. I also had to upgrade the system manager to sp2 on the bes server. I'm not sure if any of the upgrades caused this. Any suggestions?
   
  (#25 (permalink)) Old
stuwhite Offline
Kneel before Zod!
 
stuwhite's Avatar
 
Posts: 1,263
Join Date: Jan 2007
Location: U to the K
Model: 9000
PIN: 42
Carrier: Most of them, it's a Global Village man!
Default 04-11-2007, 06:08 AM

Similar to Lurch's post, I see from all this info that it looks very much to be an Exch connection prob. Using perfmon on the Exch boxes is a really good idea but a general watch on your Exch boxes to see what they are doing, when is what it will take (unless you have some cool monitoring tools!).

IMHO, the only real way to troubleshoot dodgy exch boxes is to sit and watch them for a few hours and get a feel for what they are doing. You are going to have to go back to troubleshooting basics and sit and watch all cpu, memory and network traffic on the boxes for a while. It will be tedious but I am convinced you will find the prob. You probably know about this already but Process Monitor is top for this stuff.


Oh OK. Well when you put it like that, NO!

{4.1.6 x 2 & Exch 2K3 x many}
   
  (#26 (permalink)) Old
edonin Offline
Thumbs Must Hurt
 
Posts: 110
Join Date: Mar 2006
Model: 8800c
Carrier: Cingular
Default 04-11-2007, 08:22 AM

We had an instance after applying an upgrade where despite decent ping responses between BES and Exchange, we were experiencing hung threads and delivery delays.

TSupport had me open the registry and go to: HKEY_CURRENT_USER\Software\Microsoft\Windows NT\CurrentVersion\Windows Messaging Subsystem\Profiles

They then had me delete the dozens & dozens of stale CDO_ keys buried under there which were stressing the BES and causing the delay issue.

A few babies got thrown out with the bath water. It's by no means a clean solution, but we haven't experienced the delivery delays in the months since.

I don't know that this will help those of you with less than optimal network connectivity between BES and Exchange, but it sure helped us.
   
Closed Thread


Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On





Copyright © 2004-2009 BlackBerryFAQ.com, BlackBerryForums.com.
The names RIM © and BlackBerry © are registered Trademarks of Research In Motion Limited.