Flex Lockup on 1.10.16

  • 8
  • Problem
  • Updated 2 years ago
  • In Progress

Hi,

I had successfuly run my Flex 6500 on Previous full release 1.9.13 for a few months often leaving it running for 24 hrs without any lockups.

I could not use the first recent Beta release due to numerous lockups with continuous audio tone requiring a long button press to reset.

I updated tp 1.10.16 on release and was lockup free until tonight when I experienced a lockup whilst listening to the radio. Same symptoms continuous audio tone and long button press required to reset.

Anybody else have lockup issues on 1.10.16 and does flex acknowledge that the problem still exists.

Not what I really want for the WPX contest.

Regards Andy M5ZAP

Photo of Andy M5ZAP

Andy M5ZAP

  • 179 Posts
  • 36 Reply Likes

Posted 2 years ago

  • 8
Photo of kk4x

kk4x

  • 27 Posts
  • 0 Reply Likes

I have had 2 lockup since upgrading about a week ago and yesterday. Same symptoms, continuous audio tone and long button press required to reset

73

Ed

kk4x

Photo of Jim Ervin

Jim Ervin

  • 4 Posts
  • 0 Reply Likes
I have been having the same problem with my 6300 since I upgraded.  Hopefully Flex will find a fix for the problem before I sell the 6300.
Photo of Steve Gw0gei

Steve Gw0gei

  • 193 Posts
  • 50 Reply Likes
I have still not had a lock up since the latest public release. Used in contests on hf and 4m over the last two weeks and radio left on for 24hrs plus several times.

Network set up here is similar to m5zap above with bt home hub 5 router downstairs feeding dchp via cat 5 cable into shack upstairs and then into an old 16 port network switch in the shack. My 6300 is plugged into the switch , as is my new faster w10 tower pc, and this week I have also plugged in an 8x2 Antenna Genius so2r switch which arrived and is under test in the shack before being relocated in the cupboard behind the shack wall.

Network performance monitored via ssdr and always shows full green lights and less than 1 ms latency.

Given that I have dimension 4 running all time and some people are getting lockups without d4 It looks like that is not the problem?

Could be an issue with the newer faster gb switches? Given that I am using a very old (15-20 yr old )16 port switch? Or of course, given that I work full time and mainly use the radio for week night contests and the odd bigger contests at weekends, maybe I am not using the radio as much as a retired person or an active daily dxer and haven't put enough hours in to trigger an event. My previous lockups were on my old dell i3 laptop and new tower w10 pc now in use for running ssdr in contests has not yet had the issue.

Hopefully everyone feeding in their set up will help in tracking down the cause of the issue.

73
Steve gw0gei / gw9j
Photo of Andy M5ZAP

Andy M5ZAP

  • 179 Posts
  • 36 Reply Likes

Hi Steve,

I have been running since last weekend without a lock-up after previously experiencing the lock-ups. If there are contributing factors, either hardware or software outside of the flex software platform then a matrix of setups for users both with and without lockups would help identify the root cause.

All software and hardware across the X axis and Callsign down the Y axis

Done in Excel as a pivot table would be ideal




 

Photo of Steve Gw0gei

Steve Gw0gei

  • 193 Posts
  • 50 Reply Likes
I must have tempted fate - left the radio on for the last 24 hrs and came into the shack this evening to find it had disconnected. Single button push wouldn't close the radio down (no beep present though). I did the long button press and it rebooted ok and ssdr showed it without the need for a power supply shut down on this occasion. The radio had been left on rx only on 80m but with the antenna genius switched off so all antennas were grounded with no signal present. Set up as above and my new w10 tower pc was not in sleep mode. Hey ho, not a major issue at present for me, but clearly there is a problem which appears to be random in nature. I will be leaving the pc and radio on again over next 24 hrs to see if it happens again.without rebooting the pc etc.
Photo of Jason NR0X

Jason NR0X

  • 17 Posts
  • 7 Reply Likes
I'm having what appears to be the same lockup problem on my 6700.  It's happened 5 or 6 times in the last 3 weeks or so.  The last one happened sometime during the night when I accidentally left ssdr running.

SW v1.10.16.174
HW v1.10.16.91

I don't use VOX
I don't use Dimension4

happy to provide as much info as I can, if anyone wants it.

Jason NR0X
Photo of Dave

Dave

  • 97 Posts
  • 13 Reply Likes
Adding my two cents. My radio has locked up twice in receive mode with continuous tone from radio, connection lost to radio, had to do a "hard" reboot of the radio i.e. hold power button in until radio turns off.

It actually ran for quit a while before I started having problems. I started having problems about 4 days ago. The swcond time was last night 3/29

No problem with 1.9.13 software

Radio 6300
SW  v1.10.16.174
HW v1.10.16.91
only one slice up mode is CW breakin on

Radio does recover on reboot. 

Anymore info you require?

Dave
Photo of Michael Aust

Michael Aust

  • 135 Posts
  • 32 Reply Likes
Same thing here, so now I just say with 1.9.13 on my 6700
Photo of Dave

Dave

  • 97 Posts
  • 13 Reply Likes
Just an update NO CRASHES since 3/29. BTW Windows 10 32 bit. There has been several Windows update since my last crash. Maybe a Windows issue?
Photo of Steve Gw0gei

Steve Gw0gei

  • 193 Posts
  • 50 Reply Likes
Just had another lock up of my 6300 - this time unfortunately in the last fifteen minutes of a domestic 80m contest :-(  

I was running for previous 150 mins on same freq on 80m - using footswitch for ptt into the radio as usual - and mouse clicking on the voice recorder button to send my cq contest and call. Around 5 mins prior to the radio crash I noticed that my audio was breaking up (monitor is always on here with heil headset) and the sure enough the radio stopped its connection. I was monitoring the network indicator and it never moved off the full green bar score (am on cat 5 lan here in the shack).

I closed ssdr down and opened it up again, but no radio showing. I pushed the off button and no response. I did the long press and it went off, and then I pressed it on again (without power supply shut down) and the radio showed up again and resumed ok with persistence to my original 80m run freq. Luckily its a slow contest and I didn't lose my run freq.

The radio had been on for around 48 hours as its a weekend and I usually leave it on. No issues with the pc which is a few weeks old fast i5 gaming machine with NVidia 10 graphics card and running w10, so I don't think its a pc memory issue. I was using n1mmplus as usual, latest (today) version as upgraded just before the contest. Had no issues with n1mm and it just reconnected to the 6300 when it came back online.

I guess the short term lesson for me is I should reboot the radio and remove the power supply as part of my pre contest hour set up routine, to see if that avoids the problem occurring during a contest and losing me valuable points.

I will be happy to provide any more required info and/or to try any newer version in test mode. The sooner this gets sorted the better as I need a reliable radio for contesting.

73

Steve gw0gei / gw9j
Photo of Steve Gw0gei

Steve Gw0gei

  • 193 Posts
  • 50 Reply Likes

Further to my report above, the only thing to note (and it may be unrelated) is that I was also experiencing an intermittent swr indication on the flex - kept going end of scale swr and back again when running 100w barefoot - when running with the tl922 amp switched in the swr problem seemed to disappear. Checking the 80m antenna this afternoon I couldn't find any problem and it working fine now barefoot or via the amp. This may be totally unrelated issue and could have been an owl or something sitting on the antenna (HI) as it was dark.

I left the 6300 running all night after the post crash reboot and it has remained ok all day today on 80m net this morning and on rx rest of the day. I have another domestic 80m contest this evening (cw this time) so I will leave the radio running still and wont reboot it prior to the contest so that it will have been running 24hrs plus by the end of the contest, given that I am just giving away points in this contest and it wont be a major issue if I get a crash mid contest. I run n1mm+ in so2r mode with intermittent use of the second slice on my 6300 for moving up and down rbn spots on my second bandmap when my run rate goes slow.

73

Steve gw0gei / gw9j

Photo of Al / NN4ZZ

Al / NN4ZZ

  • 1852 Posts
  • 672 Reply Likes
Hi Steve,
Do you have the ATU in your 6300?  And was it engaged when the lockup occurred?

I've had a few lockups since 1.10.16 here and they seem related to the ATU.  I normally don't use the ATU but wanted to see if this is a valid observation before I post the details.

Regards, Al / NN4ZZ  
al (at) nn4zz (dot) com
SSDR / DAX / CAT/ 6700 -  V 1.10.16
Win10

  
Photo of Steve Jones

Steve Jones

  • 104 Posts
  • 23 Reply Likes
Hi Al
No atu in my 6300, as all resonant antennas here.
73
Steve
Photo of Al / NN4ZZ

Al / NN4ZZ

  • 1852 Posts
  • 672 Reply Likes
Hi Steve,
Thanks. It may just be a coincidence then that the lockups I've seen only happen for me when the ATU is engaged. If anyone here or at FRS would like the details of the tests I did, let me know and I'll post them.

Most of my antennas are resonant or close to it and I also have an external Xmatch antenna tuner so I rarely use the ATU in the radio.

Regards, Al / NN4ZZ
Photo of Wayne VK4ACN

Wayne VK4ACN

  • 150 Posts
  • 22 Reply Likes
Ive had a few lockups last few days but i dont use atu. Feeding directly into spe amp
Photo of Rudy HB9MHB

Rudy HB9MHB

  • 12 Posts
  • 0 Reply Likes

Hi,

I have my Flex 6700 since about six weeks from a colleague with V 1.10.16 and at least two times a week I lose the connection, normally during the night when I am not active. Flex and Computer (i7/6700, Win10) with fixed IP. Reset with long button press and remove power. Sometimes when I try to connect with my iPhone (Smart SDR App. V. A1.2.8:0.50, Database D1.14 - 27.2.2017) I can hear a click from a relais and then the Flex is lockup.

Rudy HB9MHB

Photo of Steve Jones

Steve Jones

  • 104 Posts
  • 23 Reply Likes
Just finished another hour long 80m ssb domestic contest this evening and no problems this time. Radio and pc have been on since the lock up on Sunday evening, so now running on its third 24hr period now without crash or reboot. I haven't heavily used the radio during the three days but its been on rx during the day and have done an 80m cw contest of 90 mins on Monday evening and 60 mins ssb this evening with some 80m morning ssb nets on two of the mornings. Maybe there is a time in tx trigger rather than time period switch on? Dimension 4 still running all the time and I manually synced the time ten minutes before the contest start this evening. 

73 Steve gw0gei / gw9j
(Edited)
Photo of NM1W

NM1W

  • 136 Posts
  • 24 Reply Likes
I've had lockups before .16; since upgrading to .16 I can get at least one lockup a week; I've had 3 in the past 3 days... Once mid qso (cw at 5w);  between dax corrupting when using digital modes and the random lockups I'm not feeling much love....
Photo of Steve Gw0gei

Steve Gw0gei

  • 193 Posts
  • 50 Reply Likes
Hopefully a more stable version will be released soon. My 6300 has been on without a crash since last Sunday evening so I am leaving it on to see how long it lasts. A version with the performance improvement Gerald referred to, once it's been alpha tested, will be something to look forward to.
Photo of Kevin - KS0CW

Kevin - KS0CW

  • 90 Posts
  • 11 Reply Likes
I left mine on for weeks without crash... went back to shutting down nightly & pwring dwn 12vdc source without change... then i get 3 crashes out of the blue within a 60 minute windows after powering backup... It appears it stabilized after i loaded my saved config after the third crash. I noticed i was getting some scrambled representation of what appeared to be one of my configs with setting that didnt fit the intended config it resembled.
Photo of Kevin - KS0CW

Kevin - KS0CW

  • 90 Posts
  • 11 Reply Likes
I had 1 lockup weeks ago then three yesterday inside of  a 60 minute window... 1  was in the middle of a jt65 qso with radio being idle for the other two... Only one of the three produced the continuous tone the others did not...
Photo of NM1W

NM1W

  • 136 Posts
  • 24 Reply Likes
I just got another lockup after the rig was on for an hour; was just listening to 40m cw...  4 in 4 days now. 
Photo of Tony C kc2dis

Tony C kc2dis

  • 79 Posts
  • 5 Reply Likes
Still on the beta version here. Afraid to upgrade because of all the problems 
(Edited)
Photo of David Warnberg

David Warnberg

  • 692 Posts
  • 91 Reply Likes
For the first time I too experienced a lockup this morning, I simply started the radio like I always do, tuned to a local repeater utilizing a 2m transverter and was simply listening, no transmission nothing just listening.. suddenly the SmartSDR software stopped responding, the radio emitted a continuous tone and I could do nothing... only recovery was to hold power button on front of radio until power off.

I have since recovered everything and am listening once again.. will update if this happens again... radio and computer were freshly started this morning and the lockup occurred after approximately 2 hours after startup.

Running version:
Windows 10 pro
HW version:  v1.10.16.91
SW version:  v1.10.16.174

Thanks
Photo of David Warnberg

David Warnberg

  • 692 Posts
  • 91 Reply Likes
FYI... I see some updates as to "Network potential issues"  My Computer and Radio are on the same gigabit Netgear Switch.. about as close to directly connected to each other as possible... there is another PC on the same switch that I work from, it has never experianced a lockup or loss of connection through the switch.

So in my case I would not say this was network related and it has only happened the one time.

Just an update

Thanks
Photo of Tim - W4TME

Tim - W4TME, Customer Experience Manager

  • 9186 Posts
  • 3548 Reply Likes
We have had reports that KB4015217 is causing network packet loss.
Photo of David Warnberg

David Warnberg

  • 692 Posts
  • 91 Reply Likes
I am running Windows 10 Version 1607, OS build 14393.1066.... and KB4015217 was installed on 4/11/2017... so the timing is about right Tim..

4/11/2017 install of KB and lockup occurred 4/12/2017...  Nothing since though
Photo of Norm - W7CK

Norm - W7CK

  • 757 Posts
  • 163 Reply Likes
I hope some folks find this interesting reading.....

I have a single desktop computer that I built back in 2014 which is running Windows 10 Professional.  It has KB4015583 installed and I am having NO problems with SmartSDR v1.9.13.173.   I didn't want to run the Beta or the most recent release due to the COM port "in use" issue.   I didn't want to deal with a work-around or take the chance of any system corruption and have decided to wait for an actual fix from the folks who write and provide the FlexVSP driver software.  This might sound silly but that's me....

It is really remarkable what this desktop machine is actually running.  Here's a run down.

This is an i7 with 16gb of RAM, Samsung SSD for OS and apps. Logitech wireless keyboard and mouse. There is a single 1 Tb internal drive for video/photo storage and processing and 2 external USB3 drives for backups and video survelance processing and storage.  This single machine is very rarely ever rebooted and even more rare is to turn it off.  Hasn't been turned off in over 3 months.  Rebooted for updates only. This single machine is running a lot of software including:  iSpy security camera survelance with 4 IP cameras (motion detection done via iSpy on the computer), Remote Hams RCForb to an Icom IC-7100 that is available for remote HF/VHF/UHF operations to 60+ people via the Internet, Cobain backup, CloudStation real-time synchronisation of various shared drives to a private Synology Diskstation cloud, SmartCAT, DAX, SmartSDR v1.9.13.173, DDUtil (control of Flex, Elecraft KPA500 and KAT500 via USB), FRStack for squelch and 2m scanning of my Flex 6700, HRDLogbook, Digital Master, HRD Sat Track,  Malware Bytes (premium), SDRBridge / CWSkimmer, SoftEther VPN, Teamviewer when I am traveling, MS Office mostly using Word and Excel,  and a few other programs that just aren't popping into my head at the moment.

A precaution I've taken is to NEVER EVER use a browser on my Windows 10 machine to browse the Internet.  Instead, I use Oracle VirtualBox (Virtual Machine Hosting) where I have Ubuntu 64 running.  This is my sandbox for all Internet browsing via FireFox. This is how I also check my email.  Web browsers within Windows 10 are not allowed. I do not install any applications from the Internet unless absolutely necessary and well supported.  Within VirtualBox I also have a copy of Windows XP installed where I can install junk applications and not take the risk of corrupting my primary OS.  Though I very rarely ever use the XP VM.

That is a boat load of applications that are installed.  Many of them are running all of the time and others are run quite frequently.  I do NOT have a separate GPU (Graphics Card) installed, instead, I have decided to use the built in GPU of the i7.   I am able to run 8 panadapters and 8 slices with waterfalls and view 4 IP cameras along with any other software I want all at the same time and without issues. The i7 GPU is driving 2 HD monitors.  I'd love to go with SHD but haven't made the leap yet.  I was running a Asus AMD HD-7870 GPU but it just put out too much heat. I pulled it out and started using the onboard GPU and realized it performed just fine, so I sold the 9870.

Differential backups are done once a week to an external device.  Once a month or prior to any software updates or installs, I run the Windows 10 backup utility and make a System Image.  I catalogue these and am able to restore a system image and recover from any corruption or catastrophic failure in 15-20 minutes.  I highly recommend anyone running Windows to make system images of their stable system, name them appropriately and actually practice doing an image restore.

My network is wired Ethernet on POE Gb switches with fiber via GBIC to a small switch for the radio room and my computer.  This isolates the Internet router, wifi router and POE security camera network from my shack in case of near hit lightning strike.  I had this happen once and lost a POE switch, wifi router and some other gear in the utility room.  Luckily it stopped there and didn't mess up my computer, Flex or any other gear. 

Wireless for the laptops and the MAESTRO is via a very old Cicso WRT54G router running customized Tomato software.  I have never had a problem running my Maestro or laptops on this wifi router and I live in a fairly dense neighbourhood with several other router on the 2.4Ghz band although I have picked a channel that no one else is currently using.  My wife is constantly streaming Pandora or some movie off of Netflix.

I leave my 6700 running most of the time.  The last time it was rebooted was between 5-6 days ago (OS Update).  6700 typically runs for 10+ days between voluntary reboots.  I have not experienced any lock-ups, no issues with dropped packets or anything else. 

Its amazing what these computers will run these days. 

So, maybe it is a combination of the new Windows update and the latest release of SmartSDR.  Works fine on the older release!

Hope I didn't bore anyone.....
Photo of David Warnberg

David Warnberg

  • 692 Posts
  • 91 Reply Likes
UPDATE... this morning I experienced another lockup however I did some testing..  Computer frozen but still had audio from Radio... I was browsing the net while listening (Google Chrome browser) when the lockup occurred.

I then took my tablet, opened the iOS SmartSDR client and forced a connection to the radio, Radio was still working fine and sound transfered to tablet without a hickup.  Computer was still locked up, after a forced reboot the computer preformed sluggish.  I then proceeded to remove update KB4015217 with this uninstall a reboot is required.  Computer now working much better and connected back to radio without ever shutting down the Flex 6500

Will continue testing

NOTE (This PC would fail to update to the latest version of Windows, the new Creators update)

David
Photo of Eric - KE5DTO

Eric - KE5DTO, Official Rep

  • 880 Posts
  • 323 Reply Likes
How many of you that have experienced this problem have one or more XVTRs configured?
Photo of NM1W

NM1W

  • 136 Posts
  • 24 Reply Likes
no transverter here.
Photo of David Warnberg

David Warnberg

  • 692 Posts
  • 91 Reply Likes
Eric, I have 2 transverters configured, 70cm and 2 meter, both Elecraft..
Photo of Rob Fissel

Rob Fissel

  • 270 Posts
  • 48 Reply Likes
Not sure if this is relative to your question, Eric, but I experience this issue using both ANT1 as well as XVTR for antenna input (magloop RX antenna for HF). 
Photo of Dave - WB5NHL

Dave - WB5NHL

  • 285 Posts
  • 64 Reply Likes
no xvtr here
Photo of Wayne VK4ACN

Wayne VK4ACN

  • 150 Posts
  • 22 Reply Likes
No xvtr here
I do use dim 4
I dont use vox
I was using maestro wired to lan at time
Happens when on rx not tx, so far
(Edited)
Photo of Walt - KZ1F

Walt - KZ1F

  • 3040 Posts
  • 645 Reply Likes
If this just started happening with the GA of 1.10.16 and did not happen prior to the beta of 1.10, what were the diffs? I caught someone was on a Maestro so that might well eliminate SSDRfW.

How much volatility occurred in the radio SSDR over the last several releases?

Just in case, you know, it is a place not looked at yet. Its been my experience problems usually show up in the very last place ya look.

 The 2.x work is on a separate branch, right?
Photo of Steve Jones

Steve Jones

  • 104 Posts
  • 23 Reply Likes
I have a 4m Kuhne TR70H tvtr in line here for 4m, and one of my crashes was on rx only on 4m wsjt version 10. However, the latest crashes have been on antenna 1 to 80m dipole. When not in use the 4m tvtr is always switched off and power removed from it.

My 6300 has been on for over a week and a half now constant with no crash. I will probably power the pc and 6300 down tonight or tomorrow and then see how it fares after that.  There is a domestic 80m ssb contest here tomorrow night and a 6m contest on Thursday night which I may play in.

Need to sort this reliability issue out before I commit to my planned upgrade to a 6700 for next winter contest season. Sorting out some rx antennas in the meantime as the weather has improved :-)
73 Steve gw0gei / gw9j  
(Edited)
Photo of KA9CFD

KA9CFD

  • 19 Posts
  • 0 Reply Likes
I have had occasional lock ups and drop outs with my Flex 6500 and SmartSDR v1.10.16.174. I am beginning to think the problem is in my router. I have my computer and the Flex hooked up to 2 ethernet ports on the back of my Netgear R6300v2 router. I would like to know what others are using for a router, and if there is some settings in the router that should be looked at. de ka9cfd
Photo of Dave

Dave

  • 97 Posts
  • 13 Reply Likes
Mine is directly connect to my computers network connector. I use a USB wireless for my internet connection.
Photo of KA9CFD

KA9CFD

  • 19 Posts
  • 0 Reply Likes
I have changed my set up and now have the radio connected directly to the ethernet port on the back of the computer. Then using wifi from the computer to connect to the router. I will have to watch and see if there are any more disconnects. I am thinking about getting a faster wifi router to replace the Netgear R6300v2.
Photo of Bill W2PKY

Bill W2PKY

  • 528 Posts
  • 87 Reply Likes
Perhaps using a 5 port switch to connect the radio, computer and router together might be more stable, rather than using the router as the switch??
Photo of Rick Hadley - W0FG

Rick Hadley - W0FG

  • 600 Posts
  • 130 Reply Likes
I've had ocassional random lockups since V1.1, so it's nothing new.  The ones with the long tone have happened with the last three or four releases.  The others are random connection drops, and more frequently a total lockup of the computer running SSDR and this laptop, so I'm sure the latter are router related.  It used to be just the main computer that got them, but now somehow, those have been more frequent on the non-radio laptop and only once in a while on the main box.  Saturday morning I came down to the shack and the big box had tried to do the massive new W10 upgrade, which failed, and the radio had shut off.  I tried to turn it back on and it appeared that my power supply had failed simultaneously, but further checking showed that something in the 6500 main power buss had failed, causing any power supply to crowbar off.  Probably coincidence, but the 6500 is now on the way back to Austin for repair.  So if you hear W0FG, it'sjust  my little4-watt  HK1A QRP rig until the 6500 comes home.
(Edited)
Photo of Steve K9ZW

Steve K9ZW, Elmer

  • 1525 Posts
  • 762 Reply Likes

Main Station has been on for over three months solid.

There seems to be some merit in getting everything not needed to run your Flex-6000 Station off of your PC.

Win-10, i7 DXer built by Neal just completed yesterday 100 days running without a blip or hiccup, including switching back and forth from SmartSDR for Windows, SmartSDR for iOS and Maestro.

I did not have this sort of stability with the Win7 machine the purpose built PC replaced.

Pleased that the video driver issues that disappeared when I swapped from the original card Neal was using have not been an issue (they caused SmartSDR to puke irregularly but frequently.)

73

Steve K9ZW

Photo of Rick Hadley - W0FG

Rick Hadley - W0FG

  • 600 Posts
  • 130 Reply Likes
I know that part of my problem is with the Nvidia GT610 video card that is one of two in my main box.  I really should replace it as it's been troublesome since day one, both in Win7 and now Win10.
Photo of Frédéric Furrer

Frédéric Furrer

  • 1 Post
  • 0 Reply Likes
The lockups here started with the 1.10.15 beta. With lockups I mean only a long power button press brings the 6700 back to life. On the beta I had it almost once/day. Now with the 1.10.16 version it is less frequent, but I cannot really tell you how often because I do not run my 6700 24/7. The problem is identical on two different computers, one brand new, one about 4 years old. Both are running Windows 10. Software updates on Windows 10 do not seem to make a difference. Important: I DID NOT HAVE LOCKUPS with the pre-beta 1.9.xx version. My router is a Netgear R 7500 (my have a different name in the U.S.). I am eagerly awaiting a solution for this, because it really spoils all my remote activities. Thanks a lot and 73, Frederic, HB9CQK
Photo of James Skala

James Skala

  • 103 Posts
  • 24 Reply Likes
Flex 6300 I also get lockups on 1.10.16.  I have a ticket open with Flex Support.  They are having me run without any peripheral software or USB cables.  We will see what happens

P.S. glad I am not alone.

KY4JLS
Photo of Tim - W4TME

Tim - W4TME, Customer Experience Manager

  • 9186 Posts
  • 3548 Reply Likes
Thanks to everyone who have been providing data related to this issue.  We appreciate the feedback

I need to clarify something definitively that will help us a lot in analyzing these types of issues.  Please read the following carefully.

When the radio and client (SmartSDR and Maestro) are properly communicating, they exchange a "keepalive" packet to ensure the connection is established.  If either the client or the radio does not hear from the other within a prescribed period of time (15s), then the client and the radio will gracefully close the connection.

For example, if there is a network issue that occurs, but the PC and radio are operating normally, both the radio and the client will properly close the IP connection between them after the prescribed period of time.  In this case, the client pops up a message that there has been a disconnect.  The client does not know the nature of the disconnect, only that it can no longer communicate with the radio.  If the network communication issue resolves itself, then you can simply restart SmartSDR and the radio should appear in the chooser and you can reconnect. This is NOT a lockup or a crash, but a simple communications failure since there was no other mitigating action needed other than restarting SmartSDR.  We do not need these reports.

Now, if the radio experiences an event where the firmware or other internal software process experiences a serious issue where the radio stops communicating, then as before, the client will display a message that there has been a disconnect.  And as before, the client does not know why.  If you restart SmartSDR and the radio does not show up in the chooser after several seconds, then there are two possibilities; you have a continuing network issue or the radio has experienced a firmware failure of some sort.  If your PC can communicate with other devices on the same network as the radio, then you can for all intents and purposes rule out a network issue.  To rule out anything realted to the PC, you should shut it down and reboot.  If after the the reboot, you can connect to the radio, then the issue has something to do with your PC environment.  If neither of thees actions result in reconnecting with the radio, then, do a normal power off on the radio by pressing and releasing the power button.  Give it a good 15-30 seconds to respond. If it does not power down normally, then press and hold the power button on the radio until it powers down.  This indicates that the SmartSDR firmware did not receive the control signal to shut down and we had to instruct the radio to shut down Linux using the long power button press and hold.  This behavior IS a radio lock up or crash and we do want these reports.  We would like to know what you were doing at the time of the crash, how long the radio had been powered up and any other information of anomalies you observed just prior to the radio disconnecting from the client.

If the SmartSDR client crashes, in just about all cases there will be an unhandled exception error displayed on the screen.  We always want these errors and we need the error detail to properly debug why the SmartSDR application has crashed.  A screen capture of the error details is sufficient along with a description of what you were doing just prior to the crash.  I'd prefer you open a HelpDesk ticket to provide us this information on the outside chance that I or another FlexRadio employee happens to miss your report.
(Edited)
Photo of ka7gzr

ka7gzr

  • 218 Posts
  • 36 Reply Likes

Tim

Have you considered debug software to capture critical items just prior to the lockup?

Jim

Photo of Tim - W4TME

Tim - W4TME, Customer Experience Manager

  • 9186 Posts
  • 3548 Reply Likes
Yes, and we can do application level and console logging, but if the issue crashes the process running the logging facility before the log can be written, then logging is moot.  This is what has been happening while debugging this particular issue.  The only way to capture it is by using a serial console connection that is not feasible for in the field debugging.
Photo of ka7gzr

ka7gzr

  • 218 Posts
  • 36 Reply Likes
The other item that I found useful is to bring a third party(s) to review the issues and the code. Sometimes it is useful to get a fresh set of eye's on it. 
Photo of Dewey

Dewey

  • 24 Posts
  • 11 Reply Likes
Tim, I've had 2 lock-ups in last 3 weeks (not previously occurred, and I can't associate any changes made to induce lockup). No PC involved as I run Maestro and 6700 both connected to 100mbs switch (never seen a packet dropout registered on the Maestro menu screen and I check it regularly). Maestro reboot doesn't fix lock-up, but operational after 6700 reboot. Both times occurred with units in idle mode for few minutes in one case and about an hour in the other. I operate 90% CW and 10% phone, no digital modes.
Photo of James Skala

James Skala

  • 103 Posts
  • 24 Reply Likes
Tim I would be willing to work with you on the serial debugging
Photo of Jason NR0X

Jason NR0X

  • 17 Posts
  • 7 Reply Likes
So what is now turning out to be just as funny is how the problem just Stopped Happening.  I've had the radio on for over 3 weeks straight, and haven't had a single problem, where it was sometimes failing twice a day before.  I'm not sure what I did that may have had any effect on this.  but I may have reset to the default profile during that time.  Something to try anyways.
Photo of Tim - W4TME

Tim - W4TME, Customer Experience Manager

  • 9186 Posts
  • 3548 Reply Likes
We REALLY hate that when it happens.  It makes finding root cause even more challenging.
Photo of Norm - W7CK

Norm - W7CK

  • 757 Posts
  • 163 Reply Likes
Tim,

Same here.  I have not been able to duplicate a crash.  If you look back through my posts, I was able to take screen prints of the memory usage and sort of document what I thought was happening.  Now, all seems to be working fine again!

Now when I open 8 slices and make a bunch of band changes the memory usage continues to climb while I'm making the changes.  Once I stop making band changes, some of the memory does seem to get released and usage stabilises.  When I was experiencing the crashes, the memory usage would continue increasing even after I stopped making changes and would eventually crash.

I'm guessing there was some sort of update from MS that conflicted with SSDR for a short period of time and MS must have made another change and problem resolved?  Who knows?  

Like you said, this makes it really challenging finding the root cause!

Norm
Photo of James Skala

James Skala

  • 103 Posts
  • 24 Reply Likes
Just curious is anyone else see the increase of memory usage on the smartsdr process when performing band changes.  Note the memory on the smartsdr process and run through the bands twice and then not the memory on the smartsdr process.
Photo of James Skala

James Skala

  • 103 Posts
  • 24 Reply Likes
after closing both slices
Photo of James Skala

James Skala

  • 103 Posts
  • 24 Reply Likes
sorry other way around.  567.3 after and 571.8 before closing both slices
Photo of James Skala

James Skala

  • 103 Posts
  • 24 Reply Likes
in 15 mins I will send another image
Photo of Eric - KE5DTO

Eric - KE5DTO, Official Rep

  • 880 Posts
  • 323 Reply Likes
Interesting.  Well, that definitely looks like some kind of a leak.  We will look into this.
Photo of James Skala

James Skala

  • 103 Posts
  • 24 Reply Likes
After 15 min and both slices closed, it is still at 542meg

This conversation is no longer open for comments or replies.