Re: [ldm-users] Assistance requested for "Gap in packet sequence" log entries from noaaportIngester

  • To: NOAAPORT <noaaport@xxxxxxxxxxxxxxxx>, "ldm-users@xxxxxxxxxxxxxxxx" <ldm-users@xxxxxxxxxxxxxxxx>, Steve Emmerson <emmerson@xxxxxxxx>, "yoksas@xxxxxxxx" <yoksas@xxxxxxxx>, Gregory Grosshans <gregory.grosshans@xxxxxxxx>
  • Subject: Re: [ldm-users] Assistance requested for "Gap in packet sequence" log entries from noaaportIngester
  • From: Raymond Weber <admin@xxxxxxxx>
  • Date: Wed, 17 Jun 2020 19:32:19 +0000
  • Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=ndws.com; dmarc=pass action=none header.from=ndws.com; dkim=pass header.d=ndws.com; arc=none
  • Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=fIbNIrr3bxkifnq6t1PvmyYFPxjWXKl9q+imGG9aH04=; b=GLEu8IRr/cd7RqK+18N+fUBNB8gi7LNwgiDMm3jDRNV9VFDHLlD1IBWIAQRgUybB+Qkj875dxf2wiqdQWUBzC0ETw6VpbT+h2Y1VFaX+J1c1chJFg7C76SosfGAlHoML+HLVecl3jZ4CLxOnYkTP4/SrVr3IsBA3XFYpEh8SgOI6B3uoLEsJHoGvqQManxgCW5GiXjD8Nzjabhn1BeGb66Fj9M52+bDnev7H2xrGkW8UGHxMIDut7r1GvFmW/o3X7hgyJuD/vRvtfZK23IZYzgPaGLbht46bMEm+SDPn4BKw+4bIHJPiLpr8ohfEyXSDSz/8pBSXAw7Q4fJGNh8lmw==
  • Arc-seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=Hcvxigun45GVTIdrCKVR/hGvLO4wjcNtrBh6xjt6QWvBJm5FQ4/BSI5mzg2QBlK8eeyr9XAKwbYGwIgjxhBKFP0t9gmnJpATLYtkPDEOQvMayLfO2WWcP2pCZPQ+csCky775vnBpt/+Dfs25rzCL8htPJuVtEFg1/9wbnm+qo1IOsAnDN+qALLBQPpkkR+BFcyekYBV/Uk+yE5SJdOrvIAaAdkwfMt51gMLvd9lgxhx1W/OYVoRBQ0a3zdJ50OBnP4s97GgXkrZR0StQZYxitnWiBGlXdoQ1v7Pt2kphE3SnZCvnVqS0Yc3sj79/gKt5POxWusTbv8GPvWufckg+6Q==
  • Authentication-results: unidata.ucar.edu; dkim=none (message not signed) header.d=none; unidata.ucar.edu; dmarc=none action=none header.from=ndws.com;
you need to set these linux parameters to allow out of sequence packets with 
noaaport:

sysctl -w net.ipv4.ipfrag_max_dist=8192
sysctl -w net.ipv4.conf.default.rp_filter=2
sysctl -p

Ray Weber
MA Skywarn
________________________________
From: ldm-users <ldm-users-bounces@xxxxxxxxxxxxxxxx> on behalf of Gregory 
Grosshans via ldm-users <ldm-users@xxxxxxxxxxxxxxxx>
Sent: Wednesday, June 17, 2020 1:34 PM
To: NOAAPORT <noaaport@xxxxxxxxxxxxxxxx>; ldm-users@xxxxxxxxxxxxxxxx 
<ldm-users@xxxxxxxxxxxxxxxx>; Steve Emmerson <emmerson@xxxxxxxx>; 
yoksas@xxxxxxxx <yoksas@xxxxxxxx>
Subject: [ldm-users] Assistance requested for "Gap in packet sequence" log 
entries from noaaportIngester

We are replacing legacy SBN ingest software and spinning up the Unidata 
noaaportIngester (i.e. LDM version 6.13.11) on RHEL7 / RHEL6.  Unfortunately, 
we continue to receive "Gap in packet sequence" entries in various log files 
(e.g. nwstg, nwstg2, nother, goes and polarsat), and in particular the 
polarsat.log file has many of these entries.  Can you please review the 
information below, ask clarifying questions, and hopefully offer suggestions on 
how to determine the cause of the Gap entries and steps to eliminate them?

Thank you for your time,
Gregg


The NOVRA firmware being used is:
V2R15

LDM Version and Server Information:
6GB tmpfs for LDM (6.13.11) queue on a Dell R410, 64 GB, dual Intel Xeon X5667 
@3.07GHz, RHEL7.8
2GB tmpfs for LDM (6.13.6)  queue on a Dell R410, 32 GB, dual Intel Xeon X5667 
@3.07GHz, RHEL6.10


The SBN Dish is a typical NWS SBN dish, feeding a splitter with multiple NOVRA 
boxes on the other side of the splitter.

I've checked with another National Center and they are not receiving these Gap 
entries, and we have tried their noaaportIngester executable on our SPC system 
and still receive the Gap entries.

We have tried multiple NOVRA boxes on the RHEL7 server and continue to receive 
Gap entries.  These same NOVRA boxes work on other systems at SPC with no 
issues.  Different NOVRA boxes have been tried with no success (i.e. in terms 
of having no Gap entries in the log files).


There are very infrequent instances of Gaps in packets appearing on other 
systems at the same time, for example on the AWIPS cpsbn1 server in the LDM 
noaaportIngester log file.  Thus indicating the Gap is in multiple systems, 
using different NOVRA boxes and connections, perhaps on the SBN uplink, or 
perhaps downlink if there is local weather (e.g. perhaps LTG) causing 
interference.


I've worked with several of my IT colleagues at SPC and we have eliminated the 
possibility of the Gaps in packets as a result of the NOVRA box, and 
connections between pre/post NOVRA box.  This leads us to believe the errors 
are a result of something on the RHEL6 and RHEL7 servers, to possibly include 
the noaaportIntester.


noaaportIngester is being invoked via LDM with the following entries from 
ldmd.conf:


EXEC    "noaaportIngester -I 10.0.5.50 -m 224.0.1.1  -n -c -u 3 -r 1 -s NMC"

EXEC    "noaaportIngester -I 10.0.5.50 -m 224.0.1.2  -n -c -u 4 -r 1 -s GOES -f"

EXEC    "noaaportIngester -I 10.0.5.50 -m 224.0.1.3  -n -c -u 5 -r 1 -s NMC2"

EXEC    "noaaportIngester -I 10.0.5.50 -m 224.0.1.4  -n -c -u 6 -r 1 -s 
NOAAPORT_OPT"

EXEC "noaaportIngester -I 10.0.5.50 -m 224.0.1.5  -n -c -u 7 -r 1 -s NMC3"

EXEC "noaaportIngester -I 10.0.5.50 -m 224.0.1.6  -n -c -u 4 -r 1 -s ADD"

EXEC "noaaportIngester -I 10.0.5.50 -m 224.0.1.7  -n -c -u 7 -r 1 -s ENC"

EXEC "noaaportIngester -I 10.0.5.50 -m 224.0.1.8  -n -c -u 7 -r 1 -s EXP"

EXEC "noaaportIngester -I 10.0.5.50 -m 224.0.1.9  -n -c -u 4 -r 1 -s GRW"

EXEC "noaaportIngester -I 10.0.5.50 -m 224.0.1.10 -n -c -u 4 -r 1 -s GRE"



LDM settings:


[ldmcp@sbn1 ~]$ regutil

/delete-info-files : 0

/hostname : sbn1.spc.noaa.gov<http://sbn1.spc.noaa.gov/>

/insertion-check-interval : 300

/oess-pathname : /home/ldmcp/etc/OESS-account.yaml

/reconciliation-mode : do nothing

/check-time/enabled : 1

/check-time/limit : 10

/check-time/warn-if-disabled : 1

/check-time/ntpdate/command : /usr/sbin/ntpdate

/check-time/ntpdate/servers : ntp.spc.noaa.gov<http://ntp.spc.noaa.gov/> 
ntp1.spc.noaa.gov<http://ntp1.spc.noaa.gov/> 
ntp2.spc.noaa.gov<http://ntp2.spc.noaa.gov/>

/check-time/ntpdate/timeout : 5

/metrics/count : 4

/metrics/file : /home/ldmcp/logs/metrics.txt

/metrics/files : /home/ldmcp/logs/metrics.txt*

/metrics/netstat-command : /bin/netstat -A inet -t -n

/metrics/top-command : /bin/top -b -n 1

/log/count : 7

/log/file : /home/ldmcp/logs/ldmd.log

/log/rotate : 1

/pqsurf/config-path : /home/ldmcp/etc/pqsurf.conf

/pqsurf/datadir-path : /home/ldmcp/data

/scour/config-path : /home/ldmcp/etc/scour.conf

/surf-queue/path : /home/ldmcp/queues/pqsurf.pq

/surf-queue/size : 2M

/server/config-path : /home/ldmcp/etc/ldmd.conf

/server/enable-anti-DOS : TRUE

/server/ip-addr : 0.0.0.0

/server/max-clients : 256

/server/max-latency : 3600

/server/port : 388

/server/time-offset : 3600

/queue/path : /ldmcp/data/queues/ldm.pq

/queue/size : 6000M

/queue/slots : default

/pqact/config-path : /home/ldmcp/etc/pqact.conf

/pqact/datadir-path : /home/ldmcp/data/data

[ldmcp@sbn1 ~]$



Gap entries in various log files starting with new logs starting at ~1537Z 
(from Monday June 15):


[ldmcp@sbn1 ~/logs]$ grep Gap *log | more

goes.log:Jun 15 15:37:55 sbn1 noaaportIngester[3457]: 
productMaker.c:439:pmStart() Gap in packet sequence: 513870098 to 514102655 
[skipped 232556]

goes.log:Jun 15 15:38:01 sbn1 noaaportIngester[3456]: 
productMaker.c:439:pmStart() Gap in packet sequence: 551431874 to 551639014 
[skipped 207139]

goes.log:Jun 15 15:38:08 sbn1 noaaportIngester[3449]: 
productMaker.c:439:pmStart() Gap in packet sequence: 72489 to 72516 [skipped 26]

goes.log:Jun 15 15:38:08 sbn1 noaaportIngester[3453]: 
productMaker.c:439:pmStart() Gap in packet sequence: 72489 to 72516 [skipped 26]


nwstg2.log:Jun 15 15:37:53 sbn1 noaaportIngester[3450]: 
productMaker.c:439:pmStart() Gap in packet sequence: 1469313177 to 1469734315 
[skipped 421137]

nwstg2.log:Jun 15 15:37:53 sbn1 noaaportIngester[3450]: 
productMaker.c:439:pmStart() Gap in packet sequence: 1469734334 to 1469734336 
[skipped 1]

nwstg2.log:Jun 15 15:37:53 sbn1 noaaportIngester[3450]: 
productMaker.c:439:pmStart() Gap in packet sequence: 1469734343 to 1469734345 
[skipped 1]

nwstg2.log:Jun 15 15:37:53 sbn1 noaaportIngester[3450]: 
productMaker.c:439:pmStart() Gap in packet sequence: 1469734353 to 1469734355 
[skipped 1]

nwstg2.log:Jun 15 15:37:53 sbn1 noaaportIngester[3450]: 
productMaker.c:439:pmStart() Gap in packet sequence: 1469734355 to 1469734361 
[skipped 5]

nwstg2.log:Jun 15 15:37:53 sbn1 noaaportIngester[3450]: 
productMaker.c:439:pmStart() Gap in packet sequence: 1469734362 to 1469734367 
[skipped 4]

nwstg2.log:Jun 15 15:37:53 sbn1 noaaportIngester[3450]: 
productMaker.c:439:pmStart() Gap in packet sequence: 1469734367 to 1469734377 
[skipped 9]


nwstg.log:Jun 15 15:37:54 sbn1 noaaportIngester[3448]: 
productMaker.c:439:pmStart() Gap in packet sequence: 560043371 to 560207084 
[skipped 163712]


polarsat.log:Jun 15 15:37:56 sbn1 noaaportIngester[3452]: 
productMaker.c:439:pmStart() Gap in packet sequence: 51060086 to 51060113 
[skipped 26]

polarsat.log:Jun 15 15:37:58 sbn1 noaaportIngester[3455]: 
productMaker.c:439:pmStart() Gap in packet sequence: 175690361 to 175761926 
[skipped 71564]

polarsat.log:Jun 15 15:38:07 sbn1 noaaportIngester[3455]: 
productMaker.c:439:pmStart() Gap in packet sequence: 175761926 to 175761990 
[skipped 63]

polarsat.log:Jun 15 15:38:11 sbn1 noaaportIngester[3455]: 
productMaker.c:439:pmStart() Gap in packet sequence: 175761990 to 175761992 
[skipped 1]

polarsat.log:Jun 15 15:38:41 sbn1 noaaportIngester[3455]: 
productMaker.c:439:pmStart() Gap in packet sequence: 175761992 to 175762207 
[skipped 214]

polarsat.log:Jun 15 15:38:44 sbn1 noaaportIngester[3455]: 
productMaker.c:439:pmStart() Gap in packet sequence: 175762207 to 175762269 
[skipped 61]

polarsat.log:Jun 15 15:39:11 sbn1 noaaportIngester[3455]: 
productMaker.c:439:pmStart() Gap in packet sequence: 175762269 to 175762624 
[skipped 354]

polarsat.log:Jun 15 15:39:17 sbn1 noaaportIngester[3455]: 
productMaker.c:439:pmStart() Gap in packet sequence: 175762624 to 175762626 
[skipped 1]

polarsat.log:Jun 15 15:39:47 sbn1 noaaportIngester[3455]: 
productMaker.c:439:pmStart() Gap in packet sequence: 175762626 to 175763154 
[skipped 527]

polarsat.log:Jun 15 15:39:50 sbn1 noaaportIngester[3455]: 
productMaker.c:439:pmStart() Gap in packet sequence: 175763154 to 175763349 
[skipped 194]

polarsat.log:Jun 15 15:39:50 sbn1 noaaportIngester[3455]: 
productMaker.c:439:pmStart() Gap in packet sequence: 175763349 to 175763351 
[skipped 1]

polarsat.log:Jun 15 15:40:17 sbn1 noaaportIngester[3455]: 
productMaker.c:439:pmStart() Gap in packet sequence: 175763351 to 175763945 
[skipped 593]

polarsat.log:Jun 15 15:40:19 sbn1 noaaportIngester[3455]: 
productMaker.c:439:pmStart() Gap in packet sequence: 175763945 to 175763947 
[skipped 1]

polarsat.log:Jun 15 15:40:30 sbn1 noaaportIngester[3455]: 
productMaker.c:439:pmStart() Gap in packet sequence: 175763947 to 175764006 
[skipped 58]

polarsat.log:Jun 15 15:40:30 sbn1 noaaportIngester[3455]: 
productMaker.c:439:pmStart() Gap in packet sequence: 175764006 to 175764037 
[skipped 30]

polarsat.log:Jun 15 15:40:38 sbn1 noaaportIngester[3455]: 
productMaker.c:439:pmStart() Gap in packet sequence: 175764037 to 175764251 
[skipped 213]

polarsat.log:Jun 15 15:40:49 sbn1 noaaportIngester[3455]: 
productMaker.c:439:pmStart() Gap in packet sequence: 175764251 to 175764283 
[skipped 31]


... 2200+ lines of polarsat Gap entries not listed and the last few at ~2140Z:


Jun 15 21:39:05 sbn1 noaaportIngester[3455]: productMaker.c:439:pmStart() Gap 
in packet sequence: 177780466 to 177780804 [skipped 337]

Jun 15 21:39:18 sbn1 noaaportIngester[3455]: productMaker.c:439:pmStart() Gap 
in packet sequence: 177780804 to 177780838 [skipped 33]

Jun 15 21:39:22 sbn1 noaaportIngester[3455]: productMaker.c:439:pmStart() Gap 
in packet sequence: 177780838 to 177780976 [skipped 137]

Jun 15 21:39:31 sbn1 noaaportIngester[3455]: productMaker.c:439:pmStart() Gap 
in packet sequence: 177780976 to 177781214 [skipped 237]

Jun 15 21:39:31 sbn1 noaaportIngester[3455]: productMaker.c:439:pmStart() Gap 
in packet sequence: 177781214 to 177781260 [skipped 45]

Jun 15 21:39:45 sbn1 noaaportIngester[3455]: productMaker.c:439:pmStart() Gap 
in packet sequence: 177781260 to 177781537 [skipped 276]

Jun 15 21:39:45 sbn1 noaaportIngester[3455]: productMaker.c:439:pmStart() Gap 
in packet sequence: 177781537 to 177781629 [skipped 91]

Jun 15 21:39:52 sbn1 noaaportIngester[3455]: productMaker.c:439:pmStart() Gap 
in packet sequence: 177781630 to 177781632 [skipped 1]

Jun 15 21:40:05 sbn1 noaaportIngester[3455]: productMaker.c:439:pmStart() Gap 
in packet sequence: 177781632 to 177782030 [skipped 397]

Jun 15 21:40:09 sbn1 noaaportIngester[3455]: productMaker.c:439:pmStart() Gap 
in packet sequence: 177782030 to 177783125 [skipped 1094]

Jun 15 21:40:09 sbn1 noaaportIngester[3455]: productMaker.c:439:pmStart() Gap 
in packet sequence: 177783125 to 177783202 [skipped 76]

[ldmcp@sbn1 ~/logs]$ !! |wc

grep Gap po*log | wc

   2254   33810  302849

[ldmcp@sbn1 ~/logs]$



Have you seen Gaps in packet log entries in the past and if so how did you 
mitigate them and what do you suggest to eliminate these extraneous entries?

--
=====================================================================
Email seems to be generating increasing inefficiencies in organizations.  I 
learned from a manager a Stanford Computer Science professor no longer uses 
email for communication, but uses SNAIL mail, telephone calls, and person to 
person visits.  I'm considering the same.

Storm Prediction Center
120 David L. Boren Blvd, Suite 2330
Norman, OK 73072
  • 2020 messages navigation, sorted by:
    1. Thread
    2. Subject
    3. Author
    4. Date
    5. ↑ Table Of Contents
  • Search the ldm-users archives: