.
To get more detailed infomation send email to:
vxworks_archive@ncar.ucar.edu
The message body must read:
send index
send index from vx
send index from unix
A summary of the archives is periodically posted to comp.os.vxworks.
Some of the usual titles available:
ansi, ansilib, benchmarks, bitcnt, c++builtin, c++headers, camaclib, cbench
cntsem_class, crc, deadman, dhrystones, dirlib, dt1451, fcompress
flags_class, force, gcc+68040, getdate, hkv30extintutil, ivecalloc, joblib2
lclflag, libX11, loadmeter, math, monitor, msgque_class, ntpvx, ping, pipe
poolLib, ring, semCnt, ss1, stevie, string, syslog, task_class, taskmon, tod
tp41, ty335, veclist, vtape, vwcurses, vx_cplusplus, vxrsh, wdog_class, shar
vxtool, vxview, xc
6. What application notes are available from Wind River?
|
List of Wind Technical Notes
Motorola MV147 Slave Base Control 9-1
System hang during lkup( ) output 10-1
Reserving Memory 11-1
Serial IO Vanishing 13-1
NFS: problems with writing files 14-1
Which interrupts does VxWorks use? 15-1
Debugging ftp problems 18-1
Interrupt handlers, floating point 19-1
Booting From a Memory Board 22-1
Changing Network Interfaces 23-1
Writing Non-buffered Sockets 24-1
RPC and VxWorks 25-1
Using SCSI devices with VxWorks 5.x 26-1
The Select Facility in VxWorks 28-1
Using on-board Serial Ports 29-1
Problems with ls() 30-1
SCSI 31-1
Trouble Shooting Booting Problems 32-1
7. How can I join the VxWorks user's group?
|
For User Group info contact WRS or Eric Rabinowitz of Panoramic Systems at:
THIS INFORMATION is OBSOLETE
elr@netcom.com or ericr@wrs.com
phone: 408-289-7757
8. Is comp.os.vxworks also available via a mailing list?
|
Lawrence Berkeley Labs maintains an automated mailing list which is
bi-directionally gatewayed to comp.os.vxworks
It is called the 'VxWorks Exploder'.
Mail to vxwexplo@lbl.gov is automatically mailed to a number
of sites, including Wind River.
Send subscription request to vxwexplo-request@lbl.gov.
9. Can I use gcc or g++ with VxWorks?
|
WRS's gcc GNU Toolkit distribution can be reshipped in its entirety.
WRS charges are for media and support, so obviously copies thereof
don't matter to them.
Lots of customers use g++ as provided by the "net" --
see the VxWorks Archive information below.
You can get a generic freely distributable GCC/G++ and compile your
code for VxWorks. Or you can just get a copy from someone who has a
working GCC cross-development setup from WRS.
WRS worked with Cygnus to polish up their release of GCC but a generic
GCC distribution works just fine as well.
For MC68K targets:
- You can also use native SunOS 4.X cc running on Sun-3's.
- You can also use various other free m68k C compilers
such as lcc, sozobon, etc. provided that you're willing
to hack on them.
Richard Neitzel (thor@thor.atd.ucar.edu) writes,
Thanks to some feedback I've corrected the archive instructions on how to
build gcc, libgcc and libg++ for VxWorks. To make my life simpler the
patches referenced are no longer included in the vx_cplusplus file. Instead
there are now seperate patch files for the effected parts:
libg++-2.5-src.patch: Patches libg++/src.
libgcc2-2.5.0.patch: Patch libgccc2.c for gcc-2.5.0.
libgcc2-2.5.2.patch: Patch libgcc2.c for gcc-2.5.2.
libio-2.5.patch: Patch the stream library.
See VxWorks Archive information in this FAQ for details on how to
get these files.
10. Other C/C++ Compiler tools for VxWorks?
|
WRS re-sells (re-engineered?) CenterLine cfront product and WindC++
Gateway for CenterLine ObjectCenter. They come with browsers, etc.
Not free.
Wind C++ Gateway for ObjectCenter sold by WRS:
$995 / user 1-4 copies
$875 / user 5-9 copies
Note that this is the cost over and above ObjectCenter.
11. Which cross-debuggers can I use with VxWorks?
|
GDB & other more expensive tools from WRS, MicroTec Research, etc.
WRS sells a lightly modified version of xxgdb which has a lousy GUI
interface. In 5.1 xvxgdb may have been slightly improved -- but the
ObjectCenter C++ with VxWorks solution provides better GUI interface.
With a little bit of hacking, regular GDB works just fine. Personally,
I find GUI to a debugger gets in the way of real work. I use GNU Emacs
GDB interface which works well and can be easily customized.
There might be some old VxWorks customers that also use VxWorks-aware
dbxtool on Sun machines. This used to be maintained and sold by SUN
Consulting.
If you're only interested in debugging your "application" on VxWorks,
the vxgdb approach (using RPC) works just fine.
If you are rather more interested in the guts of the system as well as
your application you might want to spend some time building
cross-debugging tools from generic GDB distribution into VxWorks.
12. What are differences between traditional UNIX and VxWorks?
|
They're both hacks. UNIX has a larger installed base. :-)
Seriously though, similarities end there, IMHO. VxWorks does have a
lot of UNIX "compatible" routines in the user libraries. So porting a
UNIX application is not that hard. But there are enough differences to
make such a port take longer than normally expected.
VxWorks runs in one mode. No protected vs. user mode switching is
done. Running in supervisor mode on most processors, and not using
traps for system calls, VxWorks can achieve minimal overhead on a given
piece of hardware than UNIX. Programming on VxWorks can be more tricky
than UNIX for the same reason.
UNIX provides resource reclamation; by default, VxWorks does not. [
editorial: using deleteHooks or whatever, you could implement this on
your own.] Instead programmers write what they need as needed. As a
result, the context switch time in VxWorks is on the order of a few
micro-seconds (since there is a lot smaller context to save and
restore). VxWorks does not have full "process"; it only has tasks, or
"threads", or light weight processes as some people like to call them.
Like any other multi-threaded environments (or MP environments), care
should be taken when writing multi-tasking code. Each routine should
be written carefully to be re-entrant (if it is going to be called from
multiple contexts), semaphores are used a lot for this. And static
variables are frowned upon. Sometimes, when porting a UNIX
application, you may need to add "task variables" for this reason (as
done for rpcLib in VxWorks).
VxWorks: minimal interrupt latency (e.g. spl's are quasi-implemented
as semaphores). Traditional UNIX: high interrupt latency (e.g. spl's
are implemented as interrupt lock and unlock calls).
VxWorks: priority interrupt-driven preemption, optional round-robin
time-slicing. Traditional UNIX: prioritized round-robin preemptive
time-slicing. Since VxWorks is just a glorified "program" it can be
changed and customized pretty easily. Task scheduling can be
customized as desired, for example.
VxWorks networking code, however, is very UNIX compatible [editorial:
it is essentially "ported" version of BSD UNIX TCP/IP code -- tahoe
release ]. It is relatively easy to port socket based code to
VxWorks. [ editorial: except the not-so-compatible hostLib routines,
etc.]
VxWorks most definitely is not a "realtime UNIX", or a varient of UNIX
as often misunderstood by some people. The confusion perhaps is due to
the fact that UNIX hosts are used most widely to develop applications
for VxWorks (and VxWorks itself).
There are a lot more differences! In short, UNIX is a nice system to
run emacs on. VxWorks is much better at playing pin-ball game
machines.
Having said all this, I should also note that there are Realtime capable
UNIX systems out there. Most of these systems do not come close to
the capabilities and performance of VxWorks in realtime processing.
An exception to this might be QNX, which is a very well designed
Realtime POSIX operating system.
13. What are the performance/benchmark numbers for WIND kernel?
|
A WRS VxWorks 5.1 Benchmark Report hot off the press:
Benchmark numbers based on: mv167-25Mhz, 5.1
Cache Cache
Key Measurements Enabled Disabled
Raw Context Switch Time 4 us 14 us
Cyclic Test Time 172 us 638 us
Suspend/Switch/Resume/Switch 23 us 86 us
Kernel Timings
Task Related
taskSpawn 124 us 370 us
taskInit 58 us 181 us
taskActivate 12 us 33 us
taskDelete 101 us 303 us
Task Create / Delete 249 us 684 us
taskLock
CASE 1: no lock 3 us 4 us
CASE 2: lock exists 2 us 5 us
taskUnlock
CASE 1: no lock 2 us 12 us
CASE 2: lock exists 5 us 6 us
taskSuspend
CASE 1: ready task 11 us 30 us
CASE 2: pended task 9 us 19 us
CASE 3: suspended task 8 us 19 us
CASE 4: delayed task 9 us 19 us
taskResume
CASE 1: ready task 6 us 19 us
CASE 2: pended task 10 us 19 us
CASE 3: suspended task 13 us 30 us
CASE 4: delayed task 9 us 18 us
Semaphore Related
semBCreate 66 us 152 us
semCCreate 46 us 150 us
semMCreate 45 us 139 us
semDelete
Binary 49 us 157 us
Counting 49 us 163 us
Mutual Exclusion 48 us 157 us
semGive
CASE 1: tasks in queue
Binary 18 us 44 us
Counting 20 us 46 us
Mutual Exclusion 25 us 59 us
CASE 2: no tasks in queue
Binary 4 us 8 us
Counting 5 us 11 us
Mutual Exclusion 6 us 15 us
semTake
CASE 1: semaphore available
Binary 4 us 9 us
Counting 5 us 11 us
Mutual Exclusion 5 us 13 us
CASE 2: semaphore unavailable
Binary 10 us 25 us
Counting 11 us 27 us
Mutual Exclusion 4 us 12 us
Semaphore Give / Take
Binary 7 us 15 us
Counting 9 us 21 us
Mutual Exclusion 10 us 26 us
semFlush
Binary 11 us 20 us
Counting 11 us 20 us
Mutual Exclusion 10 us 16 us
Miscellaneous Operating System Timings
Message Queue Related
msgQCreate 93 us 280 us
msgQDelete 71 us 229 us
msgQSend
CASE 1: task pending 39 us 102 us
CASE 2: no tasks pending 23 us 64 us
CASE 3: queue full 14 us 45 us
msgQReceive
CASE 1: message available 20 us 62 us
CASE 2: message unavailable 15 us 41 us
Memory Related
malloc 28 us 81 us
free 32 us 104 us
Watchdog Related
wdCreate 42 us 106 us
wdDelete
CASE 1: timer started 48 us 160 us
CASE 2: timer not started 44 us 150 us
wdStart
CASE 1: timer in queue 20 us 70 us
CASE 2: no timer in queue 18 us 55 us
wdCancel 11 us 34 us
Floating-Point
robot application 18 sec 51 sec
[ editorial:
If you care enough to count pico-seconds in comparing realtime
kernels, you might want to actually get each of the evaluation copies
from various vendors and test them yourself. But remember benchmarks
are misleading and almost always biased and inaccurate. Given similar
benchmark numbers, give or take a few microseconds, etc., your dollars
are better spent in getting something you'll enjoy using. ]
14. What are the performance/benchmark numbers for VxWorks TCP/IP?
|
According to WRS, using VxWorks 5.1 on mv167-25mhz (i82596 ethernet)
w/ cache w/o cache enabled
TCP/IP Throughput (KB/sec) 859 KB/sec 682 KB/sec
No numbers available on latency.
Using a reasonably fast processor 25Mhz MC68040 and a reasonably well
made ethernet chip like SONIC or LANCE put together on a reasonable board
design will achieve TCP throughput close to full bandwidth of ethernet.
[ editorial:
This, of course, is rather slow in comparison with other fast
implementations, since a 16Mhz MC68020 with onboard LANCE
or a PeeCee with an ethernet board can easily do the same.
I know at least one implementation based on el-cheapo i486-50mhz/EISA/SONIC
that does: 1170KB/sec. ]
15. What is the VxSim VxWorks Simulator?
|
Propaganda from WRS:
It really is VxWorks running under UNIX! So sure it
is not realtime, although all tasks and resources interact
in the same way -- great for prototyping "high-level" code.
Using the simulator saves wear and tear on h/w. It (only)
allows sytem level debugging with native GDB.
Portably written object code compiled for VxSim (for SunOS SPARC)
will usually load without recompilation on a SPARC target. And,
BTW under 5.1 switching from one architecture to anthoer really is
pretty painless.
16. Can I use one boot EPROM for multiple boards on the same net?
|
WRS provides EPROMs with a default bogus bootline, virtually all
boards come with non-valiatile RAM which is set as soon as the user
fills in his parameters (which include CPU #). Therefore 1 EPROM may
be duplicated and used in all boards at a site. If the board does not
contain nvram then ROMs have to be specially blown, unless a custom
scheme for reading some switches or something is coded to index into a
bootline table. In 5.1 BOOTP is supported -- no more repeated EPROM
burning is necessary.
17. What's the deal with 68881 FPU code in interrupt handlers?
|
In general, FP context is optimally saved only when the scheduler
notices that the new task coming in also uses the fpu (VX_FP_TASK).
ISRs don't. If no tasks are using the FPU then ISRs may go ahead,
unless different levels of ISRs could interrupt each other and again
cause a protocol violation.
And Stan Schneider says,
You have to set the "VX_FP_TASK" option flag when you spawn your task. You
also need to make sure you don't use the FPU in any interrupt service
routines. Even if your code uses no floating point, some (brain-dead)
compilers save some FPU registers at the start of all routines if there's any
floating point in the file. That's not usually a problem if you're using the
gcc compiler (at least with the "-O" flag on). A sure way to check is via the
dissassembler.
And Leonid Rosenboim says,
This problem is quite common, and really simple to fix, all you have to
do is make sure that all tasks that do a float operation ever, will
be spawned with the VX_FP task option set. This is the best and only
solution. Also, if you run floating operations in ISRs care must be taken,
to call fppSave() and fppRestore(). Also if you are using 5.1 on a 68040,
there is a bug in the compiler that you must work around:
If you write an ISR that uses float instructions, it is not
enougth just to call fppSave()/fppRestore(), since the compiler
for 68040 inserts FP instructions BEFORE your first line
of C code, hence you need to write a dummy ISR that:
dummyIsr()
{
fppSave( .. );
yourRealISR();
fppRestore( ... );
}
And Kent Long says,
This was indeed a real problem in the context switch code in 5.0.x which
was corrected in 5.1.
In both OS versions, there is an optimization that causes the FP context
to be swapped only when the incoming task has been spawned with the
floating point (VX_FP) flag set. In 5.0.x, the copying-in of the FP context
was done via an fppSave() call. This created problems if a new FP
task was created after a previous FP task had been pre-empted by a non-FP
task in the middle of an instruction. The new task ended up with a
mid-instruction context (which causes the protocol violation), and the
old swapped-out FP task ultimately ended up with its context set to IDLE
(which is equally incorrect).
In 5.1, the FP context initialization was changed so that when a task
is created, a pre-defined IDLE frame is copied into that tasks's context.
Since there is no assumption about current FP state (as with fppSave),
task creation is now decoupled from the regular switch logic.
18. Why does ls() not work on netDrv devices?
|
Because the way directory information retrieval IOCTL calls are not acompatible
between different types of "filesystems" in VxWorks. Another reason why
some think VxWorks filesytem does not exist; they're just a collection of
ioDevice drivers, and there is not a real consistent "design" to it.
The lsOld() should work on "filesystems" that does not support ls().
Chuck Mead proposes the following special routine in case lsOld() does not
work for you:
#include "vxWorks.h"
#include "bootLib.h"
#define RSHD 514
STATUS lsHost(path)
char *path ;
{
char *lsString;
int dataSock ;
int n ;
char nextChar ;
extern BOOT_PARAMS sysBootParams ;
extern char *sysBootHost ;
if (path == (char *) NULL)
{
lsString = (char *) malloc(4) ;
strcpy(lsString, "ls") ;
}
else
{
lsString = (char *) malloc(strlen(path) + 5) ;
sprintf(lsString, "%s%s", "ls ", path) ;
}
dataSock = rcmd (sysBootHost, RSHD, sysBootParams.usr,
sysBootParams.usr, lsString, (int *) NULL) ;
if (dataSock == ERROR)
{
printf("Error opening socket") ;
return (ERROR) ;
}
n = fioRead(dataSock, &nextChar, 1) ;
while (n == 1)
{
printf("%c",nextChar) ;
n = fioRead(dataSock, &nextChar, 1) ;
}
close(dataSock) ;
}
19. Why can't I do ".." at top level directories or NFS mount points?
|
Because, again, VxWorks does not really have a "filesystem" as most people
understand it. The top level directories are just implemented as device
driver "node", which is used to identify the ioDev associated with the
specific VxWorks "filesystem". Since there is no underlying filesystem
layer, the story ends there. When you're at the top of the directory
hierarchy within a given ioDev/filesytem, you simply cannot do "..".
20. Why do I have trouble using relative symbolic links with NFS?
|
See [Q: Why can't I do ".." at top level directories or NFS mount points?]
above. This is just another problem caused by the fact that a real filesystem
does not exist for VxWorks. NFS client implementation actually does implement
the symbolic links correctly, using lookup and readlink. The problem is
due to the fact that, for some relative links that use ".." or whatever, that
crosses over filesystems, VxWorks cannot have underlying subsystem that will
handle file pathname to device mapping.
Using absolute symbolic links work just fine (i.e. full path name from top).
WRS has a product called windX which supports Motif. There is also
libX11 contribution in the VxWorks Archive. This package is perhaps
fairly old and out of date. Essentially, to port X stuff to VxWorks
you'll need to do make sure code is re-entrant everywhere. There is a
"multi-thread" safe version of Xlib available somewhere on the net, one
might try porting that. There are also vendors that have built X servers
using VxWorks. Jupiter Systems, in Alameda, makes high-end X server
machines based on VxWorks. Other X terminal vendors (HP?) also use VxWorks.
22. IEEE-488 (GPIB) driver for VxWorks
|
- National Instruments has lots of GPIB stuff
- THEMIS computers has TSVME-409 whic hincludes a GPIB interface.
- APLABS probably has some GPIB stuff too.
23. How does one disable NFS client caching?
|
VxWorks caches read and write requests in NFS client code.
To completely disable read and write cache, set nfsCacheSize to 0.
To just flush the write cache as needed, use nfsCacheFlush() or
FIOSYNC ioctl().
24. Why doing a lot of slipInit()/slipDelete() cause routing table corruption?
|
This is due to a bug in slipDelete() and/or if_dettach(). slipDelete()
calls if_dettach() to clean up after itself (SLIP network interface driver).
Not only is if_dettach() misspelled, it also doesn't do a complete job.
One deficiency is that it does not delete routes that are pointing to the
interface being deleted. This is remedied via another function that deletes
all routes for a give netif device driver. [ ifRouteDelete() ??? ]
slipDelete() does call this routine to delete routes. Another problem is
that if_dettach() does not delete a pointer to the netif device driver
structure in the global in_ifaddr linked list. The in_ifaddr list is
used by the network kernel code to find IP addresses of available network
interfaces, among other things. This lack of proper cleanup turns out to
be a rather hard-to-find memory corruption problem in network code, and
usually manifests itself as routing table corruptions.
To fix this add the following routine, and call it right after calling
slipDelete:
void in_ifaddr_remove(ifp)
struct ifnet *ifp; /* ifp = ifunit("sl0") before slipDelete() called */
{
struct in_ifaddr *ia, *prev_ia;
if (!ifp)
return;
prev_ia = 0;
for (ia = in_ifaddr; ia; ia = ia->ia_next) {
if (ia->ia_ifp == ifp) {
if (prev_ia)
prev_ia->ia_next = ia->ia_next;
else
in_ifaddr = ia->ia_next;
return;
}
prev_ia = ia;
}
}
This, along with the route cleanup, should be incorporated into if_detach().
25. How does one get better network I/O performance?
|
Most of the overhead is due to socket to network core interface overhead.
The copy that happens between the socket layer and network core code
can be avoided by using the routines in uipc_socket.c (as in BSD tahoe
release code available on various archive sites) and using mbufs directly.
You can also try using raw etherLib routines. However, etherLib also
does copying between user application and network driver.
If you must use the socket interface (sockLib), make sure you tune the
socket level buffers sizes to optimal values using setsockopt() calls
SO_SNDBUF and SO_RCVBUF. You might also just try changing the globals
that control the following default parameters to larger numbers (all
the way upto 48K):
tcp_sendspace (default 4K)
tcp_recvspace (default 4K)
udp_sendspace (default 2K)
udp_recvspace (default 4K)
To get around extra latency in some cases, you might turn on TCP_NODELAY
option on TCP sockets.
26. How does one troubleshoot a backplane driver malfunction?
|
There are a few rules of thumb:
1) Try the simplest case first -- use polling instead of
bus or mailbox interrupts and software test-and-set
instead of hardware test-and-set. See if this
works first. And then try hardware test-and-set
and then the desired mailbox or bus interrupt.
2) Use bpShow() to see what's up. Also look for magic code
0x1234 in share mem area being used for messages,
and verify heartbeat is being incremented.
At the "anchor" you should see the magic code (4 bytes)
followed by a long word which should be incrementing (the
heartbeat) every second.
3) Verify all memory mapping and make sure there's no
address conflicts on the bus, and the anchor area is
properly set up. If the anchor and ring buffer area
is on a CPU, make sure the sysLib.c:sysHwInit()
does the right thing to allow access to on-board memory
by other CPU's in the chassis. Be careful, as some
VxWorks BSP's turn off on-board memory access by other CPU's
if the CPU is not processor 0. This should be changed
if your anchor CPU is not processor 0 (first CPU in
VME chassis/backplane) -- this is a boot time configuration
parameter [ based on the assumption that bpInit() will be done
by the processor 0 ].
4) Make sure bus controller is functioning properly.
Some combinations of boards might not work well
especially if your system controller board
arbitrates the bus in one way and other boards
expect to be arbitrated in a different way.
Sometimes you might need to use a separate
system controller. Of course, also make sure
you only have one bus master. And that your VME
bus strappings (BREQ, IACK daisychains) are right.
5) Call Wind River's VxWorks tech-support...
27. How do I add select support to my driver?
|
#include "selectLib.h"
...
xxx_init(...)
{
...
selWakeupListInit(&xxxdev->selwakeupList);
...
}
xxx_ioctl(...)
{
...
switch(request_type) {
...
case FIOSELECT:
selNodeAdd(&xxxdev->selwakeupList,
(SEL_WAKEUP_NODE *)request_arg);
if ((selWakeupType((SEL_WAKEUP_NODE*)request_arg) == SELREAD)
&& readable_condition_is_met)
selWakeup((SEL_WAKEUP_NODE*)request_arg);
if ((selWakeupType((SEL_WAKEUP_NODE*)request_arg) == SELWRITE)
&& writable_condition_is_met)
selWakeup((SEL_WAKEUP_NODE*)request_arg);
break;
...
case FIOUNSELECT:
selNodeDelete(&xxxdev->selWakeupList,
(SEL_WAKEUP_NODE*)request_arg);
break;
...
}
}
And, add calls to selWakeup() as appropriate in your interrupt handlers
and read/write routines as selective conditions are toggled or satisfied.
28. bind() gets EADDRINUSE, how do I fix it?
|
Fix: do setsockopt() SO_REUSEADDR
29. Common errors in interrupt handlers with floating point co-proc hardware
|
Don't forget to use:
fppSave() and
fppRestore()
Interrupt handler encapsulation code doesn't always save fpp registers for you.
30. Finding entry point of a given module using its name
|
Example from a poster in vxworks newsgroup (who?):
FUNCPTR start; /* found entry point goes here */
UINT8 symType;
int tid;
if(symFindByName(sysSymTbl,"_nlos_start",(char**)&start,&symType)==OK){
/* taskSpawn(name,priority,options,stacksize,entryAddress,arg1,..) */
tid = taskSpawn("nlos",TASK_PRI_NLOS,SPAWN_OPTS,STACK_SIZE_NLOS,start,
arg1,arg2,.....);
}
31. The problem with irint() in earlier (5.0.2 ?) releases
|
The problem:
/* Include Files */
#include "vxWorks.h"
#include "math.h"
long irint_count = 0;
sinTest()
{
int sinTable;
while(1)
{
sinTable = irint(sin(4./1024.*(2.*3.14))*10.);
irint_count++;
}
}
% cc68k -I/vxworks/vw/h -c sinTest.c
-> ld < sinTest.o
/* 0x08 Option = VX_FP_TASK within taskLib.h */
-> taskSpawn ("sinTest", 100, 0x08, 4000, sinTest)
/* OR without the Floating Point Option */
-> sp sinTest
-> irint_count
Bus Error
Program Counter: 0xb0ac0124
Status Register: 0x3004
Access Address : 0xb0ac0120
Special Status : 0x01e6
Task: 0xfcb82c "sinTest"
The answer:
Submitted-by wrs!yuba!kent@uunet.uu.net Fri Sep 27 18:55:25 1991
Submitted-by: wrs!yuba!kent@uunet.uu.net (Kent Long)
> In the function irint, there is a bug that sets the floating point
> Exception enable byte register to random values. Here is the
> disasembled code:
>
> _irint:
> 00e034 4e56 0000 LINK .W A6,#0
> 00e038 f227 6b80 FMOVE .X F7,-(A7)
> 00e03c f22e 5780 0008 FMOVE .D ($8,A6),F7
> 00e042 f22e 6380 0008 FMOVE .L F7,($8,A6)
> 00e048 202e 0008 MOVE .L ($8,A6),D0
>
> 00e04c f201 9000 FMOVE .L D1,#
> 00e050 504f ADDQ .W #8,A7
> 00e052 4e5e UNLK A6
> 00e054 4e75 RTS
>
> Line 0e04c is the line that sets the FPCR to some random value,
> as D1 is unknown going into the function. I rewrote the routine,
> without line 0e4c, and everything works fine.
> If anyone out there knows why this line was put in, I would
> appreciate knowing. Hope this may keep someone else from spending
> a couple of days tracking down this problem.
I have confirmed that this is a bug in all 5.x versions of VxWorks for
the 68k. (In fact, it's in 4.0.2 as well.) As Mark correctly observed,
the problem is that the FPCR register is erroneously being set.
This was a simple cut-and-paste error in the VxWorks source module.
The line which sets the FPCR should instead be restoring the value
of FP7, which was saved on the stack earlier (as you can see in the
code above). So, it would appear that another side effect of this
bug is to clobber FP7.
The fix is pretty simple. The following patch scripts should get things
back to what they should be. (You can just include the appropriate lines
in your startup script, or enter them from the VxWorks shell.)
For VxWorks 5.0.1 and 5.0.2:
pMathPatch = mathHardIrint + 0x18;
*pMathPatch = (short) 0xf21f;
pMathPatch = mathHardIrint + 0x1a;
*pMathPatch = (short) 0x4b80;
This bug does NOT affect VxWorks 5.1. The disassembled code for Vx5.1,
(HK68K/V3D) is:
_mathHardIrint:
2042e58 4e56 0000 LINK .W A6,#0
2042e5c f227 6800 FMOVE .X F0,-(A7)
2042e60 f22e 5400 0008 FMOVE .D (0x8,A6),F0
2042e66 f22e 6000 0008 FMOVE .L F0,(0x8,A6)
2042e6c 202e 0008 MOVE .L (0x8,A6),D0
2042e70 f21f 4800 FMOVE .X (A7)+,F0
2042e74 4e5e UNLK A6
2042e76 4e75 RTS
Kent Long further clarifies,
This was indeed a real problem in the context switch code in 5.0.x which
was corrected in 5.1.
In both OS versions, there is an optimization that causes the FP context
to be swapped only when the incoming task has been spawned with the
floating point (VX_FP) flag set. In 5.0.x, the copying-in of the FP context
was done via an fppSave() call. This created problems if a new FP
task was created after a previous FP task had been pre-empted by a non-FP
task in the middle of an instruction. The new task ended up with a
mid-instruction context (which causes the protocol violation), and the
old swapped-out FP task ultimately ended up with its context set to IDLE
(which is equally incorrect).
In 5.1, the FP context initialization was changed so that when a task
is created, a pre-defined IDLE frame is copied into that tasks's context.
Since there is no assumption about current FP state (as with fppSave),
task creation is now decoupled from the regular switch logic.
32. What are +T, +I thingies in the "i" output?
|
The following is an excellent description of all these symbols by many people
on the net, including "Fred J. Roeber" and others:
Description Status symbol
===================================== ===============
and task's priority inherited + I
Delayed and suspended DELAY+S
Pended and suspended PEND+S
Pended and Delayed PEND+T
Pended, delayed and suspended PEND+S+T
The DELAY state indicates that the task has done some sort of delayed
call while PEND means the task has done something that caused it to
block like trying to semTake a semaphore someone else was holding.
PEND+T means that the task is both delaying and pending; it has done a
semTake with a timeout. +I means that the task has (temporarily)
inherited a higher priority through the use of a mutex semaphore.
The priority inheritance protocol also accounts for the ownership of
more than one mutual exclusion semaphore at a given time. A task in
such a situation will execute at the priority of the highest priority
task blocked on any of the owned resources. The task will return to
its normal, or standard, priority only after relinquishing all of the
mutual exclusion semaphores with the inversion safety option enabled.
If you use nested mutex semaphores with priority inheritance turned on then
when a task inherits a high priority due to some inner semaphore it owns,
it doesn't lose that priority until it relinquishes all the semaphores it
holds. This doesn't quite follow the rules for priority inheritance (to
the extent that there really are any rules) in that normally, a task's
inherited priority should decrease as it releases each nested semaphore to
whatever the priority ceiling is for the semaphores it still holds. Getting
this incremental priority reduction to work right in practice, though,
is extremely difficult (some of the SUN papers on the Solaris real time
scheduling indicate that this was one of the hardest things for SUN to
get right in their OS upgrades). Given that VxWorks is a real time embedded
OS, I, for one, don't care if WRS uses the current implementation even
though it isn't "pure" because the result is a more reliable implementation
that runs more deterministically. Anyway, my guess is that you will find
that you have some nested semaphore code where you are doing something
after releasing one of the nested semaphores that shouldn't be done at a
high priority.
33. Gotchas w.r.t watchdogs
|
watchdog handlers run at interrupt level. You should not use
routines that can block in interrupt level code. Frequent mistakes:
using printf() in watchdog routines -- use logMsg() instead.
34. Is it possible to delete a memory partition in VxWorks?
|
No. memPartDestroy() is not really implemented. Perhaps it will
be in the future. Currently it just returns ERROR.
35. rename() does not work in netDrv and nfsDrv filesystems, why?
|
Because rename() is not implemented for netDrv (although it could be),
and nfsDrv does not implement rename() completely either.
Talk to WRS to get these fixed.
36. Free NFS Server for VxWorks
|
A free, incomplete, sample implementation (i.e. hack) of NFS Server
for VxWorks is available in:
ftp.netcom.com:/pub/hjb/vxnfsd.tar.gz
There is a README file there that describes further details. The current
snapshot of this implementation is a result of a couple of days of
hacking, doesn't do everything right, and intended for educational and
further hacking purpose.
There is someone else who's porting the MS-DOS PC-based nfs server (SOSS?)
to VxWorks. Not clear on its availability yet (let me know!)
37. Free SNMP for VxWorks
|
hoff@bnlux1.bnl.gov (Lawrence T. Hoff) reports,
We ported the CMU SNMPv2 code to vxWorks 5.1. This latest round of posts has
prompted me to put it in anonymous ftp (ctrldev1.rhic.bnl.gov --
130.199.96.82).
SNMP Research sells VxWorks compatible port of their SNMP implementation
with support. Their's cost $$$$$, though.
38. What third party products are available for VxWorks?
|
I tried to include the third party products, list of consultants,
services, goodies, etc. available for VxWorks from various sources but...
there are too many to list here. Instead,
the file:
ftp.netcom.com:/pub/hjb/vxworkers.gz
is updated in realtime to contain a list of individuals and companies
that offer help, services (paid or unpaid), and goods for VxWorks.
To get a copy (if you don't have ftp access) or to be listed in this file,
please contact or send info in ASCII to:
hjb@netcom.com.
39. What kind of products have been developed using VxWorks?
|
- Flight simulators
- Radio and optical telescopes
- Automative ABS & realtime suspension
- Naviation systems
- Deep sea instrumentation
- PBXs
- Traffic control systems
- Modems
- Sonar systems
- Comm/weather satellite test equipment
- X terminals
- PostScript laser printers
- Video Editing, Audio Visual systems
- robotics
- NFS protocol accelerator product
- mass storage tape robot systems
40. A complete list of CPU hardware supported by VxWorks
|
Complete list of WRS supported BSP's are available in:
ftp.netcom.com:/pub/hjb/vxbsp.gz
VxWorks runs on a lot of different hardware. Majority of hardware
supported is based on VME bus. Porting VxWorks to a new VME board
based on MC68K takes only a few days, give or take a week, depending
on your karmic condition at the time. A lot of the ports are
initially done by the customers and later "approved" by WRS, for
which they charge, in order to keep them on "supported" list.
Porting to a new "architecture" (new processor) takes longer.
This varies more widely -- from a few months to a few years.
41. A complete list of peripheral devices supported by VxWorks
|
Complete list of WRS supported BSP's are available in:
ftp.netcom.com:/pub/hjb/vxbsp.gz
VxWorks supports a wide variety of devices. A lot of device drivers
are written both by customers and WRS staff. There are device drivers
for almost popular available ethernet chips (except perhaps SEEQ and
Fujitzu, etc.), various serial chips (MC68681 DUART, Zilog 8350 Sync/Async
COMM chip, etc.), little I/O thingies in micro-controllers (MC68302
serial I/O, etc.), SCSI, etc.
Customers of VxWorks, hackers and other hardware vendors (especially
VME) usually have a VxWorks driver for their board. There are drivers
for FDDI boards, GPIB boards, A/D D/A boards, Graphics controllers,
frame grabbers, stepper motors, pin-ball machines, and etch-a-sketch
toy games.
A list of available device driver for VxWorks can be found in:
ftp.netcom.com:/pub/hjb/vxdrivers.gz
42. What's with these unbundled "accessories"?
|
Propaganda from WRS:
The new product/feature doesn't need to wait for the
next OS release. Only the users who want/need it pay for it
lengthens price list which keeps individual items lower
but still enhances WRS revenue growth.
Please Note:
WRS still always adds features to the core product, and
has never taken items out of core product to make them
unbundled. Unlike UNIX vendors and others.
43. How come my 5.0.2 BSP isn't available in 5.1, damn it?
|
Propaganda from WRS:
WRS tries to give customers 1 year warning when any product
may be discontinued. Unfortunately, all the bugs in
the notification system are still to be worked out.
Complain vehemently to your sales rep. if he didn't
keep you informed. WRS BSP obsoletion policy is primarily
based on BSP volume and h/w avalability.
The 5.1 Guide and Release Notes provide a step by step recipe to
upgrade from 5.0.2 -- minimal changes, start by ANISifying. The BSP
Port Kit 1.1 provides extensive info for the masochist.
In general: Not free, in fact, quite the opposite.
- Development License $23.5K (per project?)
- Source $120K.
- Target Licenses from $1000 for single quantity to $10 for 100,000+.
VxWorks is also available as a kernel-only product (MicroWorks 1.0) for
the following processors:
i960, 680x0, 683xx
MicroWorks is -- half the product at a half the price.
It has no network, native debug, shell, or profiling. Comes with
VxMon a very portable ROM monitor to talk with an enhanced vxGDB 2.0
also included -- this is the debug agent which allows true system
level debugging in ISR or wherever. In future, VxWorks may also be
able to work with VxMon.
Development License $12,500.
46. Other Unbundled Products for VxWorks?
|
Other unbundled "accessory" products are: VxMP ($4K) which is an
extended shared memory capabilities for the kernel allowing semaphores
and other objects to be manipulated over the backplane transparently
(really!).
VxVMI ($3K) is a library of virtual memory interface routines allowing
text & kernel data protection. complementary products: BSP Port Kit
1.1 ($2K), VxSim 1.0 ($5K), WindX ($3.5K), WindC++ ($2.5K), WindC++
Gateway for ObjectCenter ($?K's), and Realtime Innovations StethoScope (3K).
47. How can I find out more about VxWorks?
|
Read: comp.os.vxworks
Email: inquiries@wrs.com
Call: 1-800-KIK-WIND
48. What other net.resources are available on real-time systems?
|
There is at least one other newsgroup devoted exclusively to a particular
vendor's real-time operating system:
comp.os.os9 Discussions about the OS/9 operating system.
Here are some other related newsgroups:
comp.arch Computer architecture.
comp.arch.bus.vmebus Hardware and software for VMEbus Systems.
comp.os.misc General OS-oriented discussion not carried elsewhere.
comp.realtime Issues related to real-time computing.
comp.os.os9 Issues related to OS9 and OS9000 realtime OS.
There are too many other newsgroups devoted to computer operating systems
to list here. The interested reader is advised to check the "newsgroups"
file on a local news service machine.
The automatic server for users of pSOS RTOS is now in place.
PSOSUSER - A list intended for the discussion of topics relating to
pSOSystem and other products of Integrated Systems Inc.,
Software Components Group.
Send articles to psosuser@isi.com and administrative requests to
listserver@isi.com.
If you aren't already subscribed and would like to, please send a mail
message to listserv@isi.com containing the following in the body of
the message
SUBSCRIBE PSOSUSER
49. How do i use FIONBIO in 5.0.2 when there is no fcntl()?
|
Use ioctl() instead.
...
{
...
int on = 1; /* turn it on */
...
ioctl(fd, FIONBIO, &on);
...
}
50. Free lex and yacc for use with VxWorks
|
John Winas (winas@phebos.aps.anl.gov) writes,
I just (moments ago) uploaded the two packages to thor.ncar.ucar.edu where
the vxWorks archive is. When ever the maintainer moves it into the release
area they will be available to everyone. I named the file lexyacc.tar.Z
and it contains all of the sources and make files for you to build them.
It all seems to work perfectly on my sun sparc running 4.1.3. The
only thing you have to configure is the full path name to where you wish
to keep flex so that it can find its skeleton file when you use it to
lexify your .l files. Byacc has no skeleton files and simply needs to
be in your path.
This file is now available in:
epics.aps.anl.gov:/pub/lexyacc.tar.Z
I am interested in any bugs found in because we are using them here. Feel
free to email me at winans@phebos.aps.anl.gov.
[ From: kent@wrs.com (Kent Long) ]
In article <9311230139.AA21147@focal.com> bobk@focal.com (Bob Klawuhn) writes:
>I am currently trying to user the timerLib to obtain
>the amount of time that a timer has left before it
>expires. I am trying to use the timer_gettime function.
>The value that it seems to return is always the time
>that the start timer was given, not what is left on the
>timer.
This is indeed a bug in the 5.1 and 5.1.1 VxWorks versions.
It is now being tracked as WRS SPR #2673.
As a workaround, the following could be done following the
timer_gettime() call, to convert the erroneous results into
the desired remainder value:
#include "private/timerLibP.h"
struct timespec timeNow;
(void) clock_gettime (CLOCK_REALTIME, &timeNow);
TV_SUB (timerid->exp.it_value, timeNow);
...which leaves the remainder in it_value.
52. bogus INCLUDE_TCP_DEBUG
|
[ From: hipp@wrs.com (Emily Hipp) ]
>Bus Error
>Program Counter: 0x0001c738
>Status Register: 0x3000
>Access Address : 0xbfbfbfd3
>Special Status : 0x0505
>Task: 0x3dcc54 "tExcTask"
>TCP tracing not enabled (use INCLUDE_TCP_DEBUG).
This is misleading information. INCLUDE_TCP_DEBUG is not supported
as a configAll.h include option.
[ editorial:
INCLUDE_TCP_DEBUG never got integrated into VxWorks config files.
To get around this bug, until WRS fixes it, either unset SO_DEBUG
socket option using getsockopt()/setsockopt(), or call tcpTraceInit() (sp?)
which will drag in tcp_debug.o and set the tcp_trace() routine to
be called when debug option is set on TCP sockets.
]
Is available via anonymous ftp :
ftp.netcom.com:pub/peacefulstar/dab/vpppd-1.0.tar.gz
54. how to disable cache on mc68040 or mc68030 using TT regs?
|
[ From: Steve Morris ]
/**************************/
/* for 68030 (e.g. mv147) */
/**************************/
/* 2 large areas, R/W, cache disabled */
#define TT0_VALUE 0x403f8507 /* from $40000000 -> $77ffffff */
#define TT1_VALUE 0x03018507 /* from $02000000 -> $03ffffff */
test_tt ()
{
register int *pVal;
int ttVal;
pVal = &ttVal;
ttVal = TT0_VALUE;
asm ("pmove %0,tt0" : : "g" (*pVal));
ttVal = TT1_VALUE;
asm ("pmove %0,tt1" : : "g" (*pVal));
}
/**************************/
/* for 68040 (e.g. mv167) */
/**************************/
/* 2 large areas, R/W, cache disabled */
#define TT0_VALUE 0x403f8507 /* from $40000000 -> $77ffffff */
#define TT1_VALUE 0x03018507 /* from $02000000 -> $03ffffff */
test_dtt ()
{
asm ("movec %0,dtt0" : : "r" (DTT0_VALUE));
asm ("movec %0,dtt1" : : "r" (DTT1_VALUE));
}
55. work-arounds for MS-DOS filesystem bug when lseek() past eof
|
[From: georg@sgl.ists.ca (Georg Feil)]
[editorial: This is a workaround for the bug in VxWorks ms-dos
implementation which produces incorrect error return on write() after
lseek() beyond eof. N.B.: VxWorks versions upto 5.1.1 have buggy IO
system layer that does not support "correct" write() to normal files.
When writing to a file via write() expect to check the return value
even if it is not ERROR. Unlike most other systems (e.g. UNIX) VxWorks
write() upto version 5.1.1 will return number of bytes actually written
even when write() was not completely successful on devices that are
not marked non-blocking and/or are subject to flow control. ]
There's been enough heated debate on this so I'm sending out my brute force
workaround. Thanks to Kent Long who managed to let slip enough information
on the bug to identify the prerequisite: lseek() past the end of file.
My workaround simply extends a file by writing 0's on the end whenever there
is a seek past the end. (This may result in a file being extended when it
shouldn't have been, i.e. no write follows the seek, but what the hell.)
Use file_seek() below instead of lseek() to seek. Note that file_seek() is
not meant to be plug-replaceable with lseek(), that feature is not
required in our system. 'zero8k' is a character array of 8192 zero bytes.
int file_seek(int fd, int offset)
/*
* Sets the byte offset for the next write or read from a file.
* Simple interface to lseek() function. This returns ERR_NONE iff the actual
* actual seek offset returned by lseek() exactly matches the desired offset.
* 'fd' is the file descriptor to seek on.
* 'offset' is the absolute file position to seek to in bytes, where 0 is
* the beginning of file.
* Return value is error code (not a VxWorks error code, another type).
*/
{
STATUS st;
struct stat filestat; /* file status info obtained from fstat() */
int aoff; /* actual seek offset returned by lseek() */
long int fillsz; /* (***) for SPR #2739 kludge */
long int fillamt; /* (***) for SPR #2739 kludge */
if (FileDebug && Verbose>=2) {
wprintw(interact,"file_seek(): seeking to offset %d on fd %d\n",
offset,fd);
}
/* (***) workaround for VxWorks 5.1.1 bug SPR #2739 (write() returns with
transfer count too low but errno not set after seeking past current
end of file). Note that this may extend the file prematurely, i.e.
even if no write() calls follow the seek. */
/* get current file size */
st=fstat(fd, &filestat);
if (st==VX_ERROR) {
if (Verbose>0) {
wprintw(interact,"file_seek(): Error performing fstat() on fd %d: %s\n",
fd, vw_errmsg(0));
}
return(ERR_VXIO);
}
/* manually extend file using zero writes if seek offset past end of file
(note: tried ioctl() with FIOTRUNC, but this only works to shorten
files! */
if (offset>filestat.st_size) {
/* seek to the end of the file first */
errno=0;
aoff=lseek(fd, filestat.st_size, SEEK_SET);
if (aoff != filestat.st_size) {
/* returned value should always match 'filestat.st_size' */
if (Verbose>0) {
wprintw(interact,"file_seek(): Incorrect actual file position after seeking to EOF on fd %d (%d, should be %d) (%s)\n",
fd, aoff, filestat.st_size, vw_errmsg(0));
}
return(ERR_VXIO);
}
/* fill file with zeroes to bring length up to 'offset' */
fillsz=offset-filestat.st_size;
while (fillsz>0) {
if (fillsz>8192)
fillamt=8192;
else
fillamt=fillsz;
errno=0;
aoff=write(fd, zero8k, fillamt);
if (aoff == VX_ERROR) {
if (Verbose>0) {
wprintw(interact,"file_seek(): Error writing zeros to %d at pos %d: %s\n",
fd, ioctl(fd,FIOWHERE,0), vw_errmsg(0));
}
return(ERR_VXIO);
}
if (aoff != fillamt) {
if (Verbose>0) {
wprintw(interact,"file_seek(): Bad xfer count writing zeros to fd %d at pos %d (%d, should be %d): %s\n",
fd, ioctl(fd,FIOWHERE,0), aoff, fillamt, vw_errmsg(0));
}
return(ERR_VXIO);
}
fillsz -= fillamt;
}
/* flush the output to disk immediately */
st=ioctl(fd, FIOFLUSH, 0);
if (st!=VX_OK) {
if (Verbose>0) {
wprintw(interact,"file_seek(): Error flushing zeros written to fd %d at pos %d: %s\n",
fd, ioctl(fd,FIOWHERE,0), vw_errmsg(0));
}
return(ERR_VXIO);
}
}
/* (***) end of workaround for VxWorks 5.1.1 bug SPR #2739 */
errno=0;
aoff=lseek(fd, offset, SEEK_SET);
if (aoff == VX_ERROR) {
if (Verbose>0) {
wprintw(interact,"file_seek(): Error seeking to offset %d on fd %d: %s\n",
offset, fd, vw_errmsg(0));
}
return(ERR_VXIO);
}
/* returned value should always match 'offset' */
if (aoff != offset) {
if (Verbose>0) {
wprintw(interact,"file_seek(): Incorrect actual file position after seeking on fd %d (%d, should be %d) (%s)\n",
fd, aoff, offset, vw_errmsg(0));
}
return(ERR_VXIO);
}
return(ERR_NONE);
}
[ From: vanandel@rsf.atd.ucar.edu (Joe Van Andel) ]
Tool Command Language, version 7.0 (TCL7.0) for VxWorks 5.1 is
on thor.atd.ucar.edu:~ftp/pub/vx/tclvx7.0.v4.tar.gz
If you've ever been frustrated that the VxWorks shell is not re-entrant,
and has no control flow (e.g. if then else, switch, case ),
then you will find TCL very useful since it is a very complete language,
that allows you to add your own application specific commands to
the interpreter.
I find it invaluable for system testing, since I register TCL commands for
all major functionality of my real-time application. This allows me
to test most pieces of my data acquisition system from a command line,
and build nice flexible scripts to test and operate my system. As a
matter of fact, I can even invoke specific methods of C++ classes via
TCL.
Also, you can control your real-time system from a Unix workstation by
sending TCL commands from either a TCL or Tk/TCL application (via
tclTCP). I find that sending TCL commands (which are just strings) is
much easier and more flexible than writing a Remote Procedure Call
(RPC) for each piece of functionality that I need to remotely invoke.
A default route is a route table entry with destination field set to 0.
To do the equivalent of "route add default gateway metric" in VxWorks,
do:
routeAdd("0",addr_of_gateway);
58. adjusting network driver MTU size
|
VxWorks network driver are compatible (mostly) with tahoe BSD drivers.
To change MTU you should modify "if_mtu" field of "struct ifnet" you
pass to ether_attach() or if_attach().
59. tcpdump like utility for vxworks
|
Take a look at a hacked up packet trace program in:
ftp.netcom.com:/pub/hjb/vxsniff.c.gz
[editorial: if you have something better, let me know]
60. VxWorks performance on i960 -- unofficial benchmark
|
[ From: djanssen@mswe.dnet.ms.philips.nl (Ton Janssen, 62203 (TSSW)) ]
We did some measurements on a Heurikon HK80/V960E.
Equiped with I960CA on 33MHz. Here they are:
Test-item Conditions Time in us
==========================================================================
- semGive/semTake pair (binary semaphore) 11,2
- taskSpawn (+/- 8 processes active) 790,0
- taskIdVerify 0,72
- taskSuspend/taskResume pair (+/- 8 processes active) 32,6
- lstAdd/lstDelete pair (10 nodes in list) 3,9
- msgQNumMsgs 1,2
- msgQReceive (NOWAIT, no message available) 16,8
- rngBufPut/rnBufGet pair (0x20 bytes) 24,5
- msgQSend/msgQReceive pair (NOWAIT, no arguments) 65,7
- bcopy (Quad aligned data) 0,515/Quad
- mutliply two floats (depends on the value!) 1,6
- multiply two doubles ( '' '' ) 2,6
- Raw context switch (8 processes active) 34,0
61. VxWorks SCSI Performance -- unofficial benchmark
|
[ From: mea@mclean.sparta.com (Mike Anderson) ]
System : Heurikon HK68G/V4D
(33 MHz 68040, 16 MB RAM, NCR53C710 SCSI
w/ Hk SCSI DMA routines in Asynchronous SCSI mode)
VxWorks : 5.1.1 (Yes, using Heurikon VxWorks 5.1 BSP)
Disk : Seagate ST11200N (1 GB SCSI)
File System: VxWorks RAW partition mounted as "/sd1/"
Clock Rate : 60 Hz
Approach:
My application has the data coming in in 16Kbyte chunks. So, I devised
a piece of test code that would allow me to specify how many 16K chunks
I sent to the disk in each write and how many total bytes to write.
The actual number of bytes written is generally a little larger
(typically one more block size) due to the quick and dirty way I wrote
the code, but the calculations are based on the actual number of bytes
written to the disk. The technique I used was to write a block to the
disk starting at the end of the last write and then seek back to
relative zero and write the current pointer (just as I would in real
life to know how many bytes had been streamed total). Also, you
may notice the disk seek time coming into effect in the 200MByte files.
Here are the results:
Buffer Size Total Requested Bytes/sec total secs
16K 1024000 (1 MB) 1,032,192 1.000 secs
10240000 (10 MBs) 1,036,087 9.883 secs
102400000 (100 MBs) 1,049,538 97.566 secs
204800000 (200 MBs) 1,043,123 196.333 secs
32K 1024000 1,613,193 0.650 secs
10240000 1,627,997 6.300 secs
102400000 1,627,980 62.900 secs
204800000 1,610,485 127.166 secs
48K 1024000 1,935,360 0.533 secs
10240000 1,994,712 5.150 secs
102400000 1,992,209 51.416 secs
204800000 1,970,335 103.950 secs
64K 1024000 2,169,467 0.483 secs
10240000 2,244,905 4.583 secs
102400000 2,246,332 45.600 secs
204800000 2,216,450 92.400 secs
96K 1024000 2,495,409 0.433 secs
10240000 2,580,480 4.000 secs
102400000 2,571,534 39.833 secs
204800000 2,533,374 80.866 secs
128K 1024000 2,621,440 0.400 secs
10240000 2,761,250 3.750 secs
102400000 2,771,147 36.983 secs
204800000 2,726,693 75.133 secs
256K 1024000 2,859,752 0.366 secs
10240000 3,130,077 3.350 secs
102400000 3,139,304 32.650 secs
204800000 3,083,428 66.483 secs
512K 1024000 3,145,728 0.333 secs
10240000 3,346,519 3.133 secs
102400000 3,361,846 30.566 secs
204800000 3,298,416 62.150 secs
1024K 1024000 3,311,292 0.316 secs
10240000 3,475,942 3.016 secs
102400000 3,487,345 29.466 secs
204800000 3,416,806 60.150 secs
62. VME bus arbitration gotchas
|
[ From: mea@mclean.sparta.com (Mike Anderson) ]
> Sender: Rob Steele, e-mail: steele@telerobotics.jpl.nasa.gov
>
>
> I have the following VME hardware configuration:
>
> VME MOTES chassis:
> 1 Motorola system controller card,
> 6 Huerikon 68020 CPU boards (HKV2A),
> 1 Micro Memory shared memory cards,
> 1 Bit-3 memory interface card,
> 2 XYCOM parallel boards, and
> 1 Excelan ethernet cards.
>
> VadsWorks 2.0.1 (equivalent to vxWorks 5.0).
>
> I am unable to consistently boot the chassis. I will get bus errors and file
> not present errors during the boot process on different boards. On rare
> occasions it will boot. To get it to boot I halt the boot process on the last
> 4 boards in the chassis and let the first two boot. After that I boot the
> last 4 one at a time.
>
This sounds like the bus arbitration bug biting. If the Heurikon
boards are all at the same bus grant level (they typically come from
the factory set for BG3) then you will experience "random" failures
that manifest themselves as bus errors when a board is requesting
access to the bus and the bus is so busy that the bus watchdog timer
(1.6ms on the HKV2F I think) expires. This is proper behaviour. The
watchdog timer is designed to alert the task that it was unsuccessful
in accessing a VME address in the alotted time and to signal that
failure with a bus error.
Now, there are several ways to fix this problem. One which I heard
about was to modify the boot ROMs to insert a variable length delay so
the first CPU booted immediately, the second booted 30 secs later the
third 60 secs later, etc. This approach was clearly a software hack
trying to get around a hardware problem that was obviously not
understood. This approach *may* get the boards to boot, but with heavy
bus traffic during execution, they will continue to exhibit bus
errors. You see, because the VMEbus is a daisy chain, the first CPU in
the chain has an inherently higher priority than all others that follow
it. The 2nd board gets the next highest priority, etc. This means
that a graph of bus access at a given BG level looks something like
this:
% 60 |
|
A 50 | *
|
C 40 |
|
C 30 |
| *
E 20 |
| *
S 10 |
| *
S 0 |------------------*---*--
1 2 3 4 5 6
# CPUs @ BG level
I.e., it is roughly an exponential decay such that, given that the
level of bus activity is moderately high, the first 1-3 CPUs will hog
85-95% of the bus cycles and CPUs 4 and beyond will exhibit bus
errors. If the bus activity level is very high such as during a boot
up via the backplane, you may not be able to get past CPU 2 before bus
errors manifest themselves.
Now, as I said, there are other ways to circumvent this problem.
Boards equipped with the VIC68 VMEbus interface, for instance, can have
their bus timeouts changed to the point of an infinite wait (i.e., the
bus watchdog is disabled). This is still just a kludge that masks the
real problem of a poorly thought-out arbitration scheme. With this
approach, *real* bus errors resulting from errant pointers and/or
failed boards are never reported and the system just seems to hang.
The REAL solution to this problem is to adjust the arbitration levels
of the CPUs. In the VMEbus, there are 4 levels of arbitration (0-3)
where level 3 is the highest priority. By default, virtually all
manufacturers ship their boards configured for BG3. Therein lies the
problem. What the real-time designer needs to do is to realize this
and make adjustments. Within the 4 BG levels, there are two modes of
operation: 1) priority and 2) round robin. Priority mode means that
requests at level 3 will be honored before requests at level 2. Level
2 before level 1 and so on. This means that with priority mode
arbitration, put your highest priority boards at level 3 and distribute
the other boards at the other levels according to their relative
priorities.
On the other hand, round robin arbitration mode means that all 4 BG
levels are treated equally and each level gets a fair slice of the pie
so to speak. Use this mode when you have lots of boards, but no one
has an inherently higher priority than any other. With this approach,
we've been able to get as many as 16 CPUs running in the same backplane
(4 at each level) without experiencing bus error problems due to
arbitration. Remember, just because the boards are at different BG
levels doesn't circumvent the implied priorities of the CPUs at the
*same* level based on the bus daisy chain.
Typically, these modes are configurable on the system controller either
through a jumper, PAL or software bit twiddling. However, there are
boards that are hardwired for arbitration level 3 and they can't be
changed. The Force SPARC 1E board is one of them. This means that
you'll have to use these boards sparingly in VxWorks chassis (i.e.,
probably no more than 3-4 booting up from the backplane but probably
more if you boot each of them up from local Ethernet). I understand
that the SPARC 2CE board uses the same VMEbus interface as the 1E which
simply means that the bus errors will happen twice as fast. Hopefully,
the 3E board will get a real bus interface on it if it ever comes out.
63. 5.1.X new MMU supporting code gotchas, as explained in the Manual
|
Carl Lacy points out:
1) Programmer's Guide, Chapter 8.3, page 319
How to setup the the MMU for VMEbus address spaces
2) 5.1 Known Problems, Apr '93, page 5
"Several BSPs turn on basic MMU support ... even if
this option is not specified in configAll.h"
64. "Cannot Boot. Error 0x1a9" while downloading/booting
|
This is usually due to the built-in 'feature' inside UNIX inetd program.
The inetd tries to protect itself against clients that tries to connect
too often too quick as a security measure, and disallows more than a
certain number of connections per given period. When this happens, you
will usually see some error messages about "looping..." in syslog (if
your syslog is configured correctly) on your UNIX server (where inetd
is running). You can either try "kill -HUP inetd_process_id" to get
out of this looping mode by sending HUP signal to inetd, or try and
install the patched version of inetd that has larger default limit for
looping detection (some OS vendors will supply this to you), or just
get a copy of BSD UNIX distribution of inetd source code and compile it
with larger default values for looping and use it instead.
65. problem with qsort in some versions of VxWorks
|
Some version of VxWorks are shipped with incorrect qsort(), to remedy
this problem, get a copy of qsort() from BSD UNIX distribution and
compile it.
From: jbrandt@sadira.gb.nrao.edu (JOE BRANDT)
remainder = fmod(value, divisor) - floating point modulus routine.
The fmod math subroutine for MC68040 (mv167) based systems has been a
topic of discussion on the exploder for many months now. The first
discussions centered around the routine's accuracy (or even
correctness). Recently I found that the execution time for the fmod
subroutine is highly dependent upon the ratio of the values passed as
arguments. At a ratio of about 50,000 I was seeing execution times of
30ms! The VxWorks exploder FAQ now includes a proposed replacement
routine for 68040 systems, however, I have examined this routine and
found it to be grossly in error in cases where the divisor is less than
1.0.
Fortunately, I have found public domain source, authored by Sun Micro,
for an alternative algorithm. This algorithm claims an exact result.
It can be found at:
ftp: plan9.att.com in /netlib/fdlibm/e_fmod.c.Z
(I believe ftp.uu.net has copies also).
This appears to be a ieee 754 compliant routine. (I'm not sure what that
means, but it sounds good).
Execution times for this routine are in the range of 7.6 to 100 usec's.
From: vince@rti.com (Vince Chen)
> Submitted-by jbrandt@sadira.gb.nrao.edu Thu Mar 24 12:41:16 1994
> Submitted-by: jbrandt@sadira.gb.nrao.edu (JOE BRANDT)
>
> While I currently am not questioning the accuracy of the fmod() function,
> the performance of it on the mv167 card (68040) is certainly an issue.
>
> Here's some benchmarks from my 5.1/mv167 system:
>
> -> timexN(fmod, 49999.9, 1.0)
> timex: 25 reps, time per rep = 30661 +/- 666 (2%) microsecs
>
> -> timexN(fmod, 2.0, 1.0)
> timex: 50000 reps, time per rep = 9 +/- 0 (0%) microsecs
>
> -> timexN(fmod, 499.9, 1.0)
> timex: 2525 reps, time per rep = 305 +/- 6 (1%) microsecs
>
> Note the dependency on value. Unfortunately I am doing an fmod on the
> modified julian date (which is currently in the 50,000 range). which
> took an amazing 30ms/call !
>
> Wow, can some elaborate on the algorithm being used here. seems like it's
> a straight while loop!
I believe it is a while loop. A while loop is faster in some situations,
but obviously no in yours!
Until WindRiver resolves the problem w/ the Motorola Floating Point
Exception library, you may want to roll your own on this one.
The code below on our 167/25MHz VxWorks 5.1 consistently results in 30 usecs.
double myfmod1( double arg1, double arg2 )
{
return (arg1 - ((int) (arg1 / arg2)) * arg2);
}
67. MVME167 copyback cache and various problems
|
From: georg@sgl.ists.ca (Georg Feil)
Well, I think we now know everything there is to know about the memPartFree()
problem we were having. The original subject here was "memPartFree error with
copyback caching enabled (167)". Other past subjects that possibly related to
the same problem include "Large disk writes w/cache cause MVME167 to freeze".
Apparently the root cause is a snooping problem related to TAS on the
VME bus, and is Motorola's fault.
The solution to the problem is the "MV167 VME TAS patch" from Wind River.
It consists of a number of configuration changes (config.h, usrConfig.c etc.)
and is reproduced below. Strangely, it has not been assigned an SPR number
because, according to Wind River, the new version of the 167 BSP has fixed
the problem so no SPR is necessary(?). This situation may have changed since
the memos I received were written. Our related TSR number was 16443, which may
help others to track things down.
I had already come up with my own patch that worked in a completely different
(and very specific) way, and the WRS patch seemed not to apply to our problem.
However I tested the patch anyway and sure enough it fixes our problem.
As a bonus, SCSI performance seems to have improved slightly.
From: stan@rti.com (Stan Schneider)
Under VxWorks 5.1.x, you can use 'unld "file.o"' (or use the module id,
see "moduleShow"). Under 5.0.x, you can't unload the object & reclaim the
space, but you could remove the symbols from the symbol table with symRemove.
That would prevent unnecessary maiming.
I have drivers/libraries and application software for the Siemens 82532
multiprotocal chip (SYNC/ASYNC serial communication HDLC/SDLC protocol).
Maybe there is some compatibility. This is running well in our inhouse
designed VME module running vxWorks for the mc68332.
Graham Waters waters@reg.triumf.ca
TRIUMF Control Systems Engineer.
Meson Research Facility,
4004 Wesbrook Mall,
Vancouver, B.C. Canada.
V6T 2A3
There are two types of keep alive packets. I don't know how smart snoop
program is, but most analyzers i've seen do not flag a keepalive packet
seperately. KEEPALIVE is not really a part of original TCP spec anyway, so
this is understandable. it was an add-on hack by BSD implementation.
KEEPALIVE packet was implemented in two different ways. The 4.2 BSD
implemented it wrong, and expected a zero length packet. It send out a
packet with sequence number set to snd_una -1 (send unack'ed - 1) and ack
set to rcv_next - 1. This forces a response from a well behaved TCP peer
either in the form of a ACK or RST depending on the peer's status.
The 4.3 BSD implemented it better, and send a packet with the sequence
number set to snd_una -1 and ack set to rcv_next. the same behavior was
expected from a TCP peer.
The keep alive will start after connection has been idle for 75 seconds, and
keepalive probe will continue for ~2 hours. This is in adherence to the
spirit of TCP -- you're not supposed to randomly drop connections. the
keepalive is, in a way, a necessary evil, that provides realistically slow
garbage collection of zombie connections. It was not meant to be used to
detect the other side dying away in realtime. It can be a headache, for
example, over a very high latency link, when you're doing a telnet, if
keepalive randomly or too quickly decided to terminate the connection
because it timed out too quick. You might think 2 hours is excessive, but
in the old days, this was a real problem -- like when you are telneting to a
host in spain over various low quality, high latency, noisy links... type a
character and wait for 10 mintues... that was common.
The only application controllable, expectedly behaving way to detect the
other side going away is to use a seperate connection, or a seperate
connection-less channel.... for out of band status info delivery. a kind of
i'm alive or are you there... many TCP and UDP based standard protocols do
a form of this... FTP, RSH, etc.
SO_RCVTIMEO and SO_SNDTIMEO are not used in reality. they are set and
recorded but never reference for any useful work.
Adjusting keepalive packet intervals is only possible using global
variables. This will affect all TCP sessions on the machine that has
keepalive mode set.
You can play games with the following variables:
tcp_keepidle, tcp_keepintvl, tcp_maxidle.
Please get a copy of BSD UNIX TCP/IP code from FTP sites and look into
tcp_timer.c and other TCP files before you go and change things though. The
keepalive is actually also used in connection setup phase (as a hack).
Changing these values carelessly will result in wierd TCP behavior on all
TCP sockets.
71. Contributions to comp.os.vxworks FAQs.
|
The following net.folks, among others, have contributed to this posting:
[ If you have contributed, but are not listed here please me know. ]
Name email address
------------ ----------------------------
Mike Anderson mea@mclean.sparta.com
Mark Linimon linimon@nominil.lonestar.org
Geoff Espin geoff@wrs.com
Rev. Bob Crispen crispen@foxy.boeing.com
Stan Schneider stan@rti.com
Fred J Roeber fjr@ssd.ray.com
Marc Friedman maf@verdix.com
Joe Van Andel vanandel@ncar.ucar.edu
Emily Hipp hipp@wrs.com
Bob Marino ram@mr.picker.com
Richard Kowalsky cmdorat@tc.fluke.com
Kent Long kent@wrs.com
James Moore james@wrs.com
Chuck Meade chuckm@verdix.com
Patrick T. Pinkowski ppinkow@jupiter.ksc.nasa.gov
D'Anne Thompson dat@noao.edu
Leonid Rosenboim leonid@rst.hellnet.org
David Lim lim@robotics.jpl.nasa.gov
Richard Neitzel thor@thor.atd.ucar.edu
John Winas winas@phebos.aps.anl.gov
Lawrence T. Hoff hoff@bnlux1.bnl.gov
Georg Feil georg@sgl.ists.ca
Steve Morris morriss@smtplink.indigo.co.il
Don Brooks dab@netcom.com
Ton Janssen djanssen@mswe.dnet.ms.philips.nl
Brian Brunner brunner%mother@icarus.ssd.loral.com
Carl Lacy lacy@draper.com
Joe Brandt jbrandt@sadira.gb.nrao.edu
Vince Chen vince@rti.com
Graham Waters waters@reg.triumf.ca