ignatz@chinet.UUCP (ignatz) (10/20/86)
Memory allocation problems exist with both Microport and the SCO Xenix compiler, kids. Just ran into it tonight when trying to get a program working that heavily depends on using sbrk(). FYI, the traditional version of sbrk() will extend the current data space by the requested byte increment, returning the old 'break', or end-of-data-address, value. This memory was contiguous, and could also be released to the OS by using a negative increment in another sbrk() call. This call, and the related brk() call, comprise the OS interface to basic memory managment that is used in more general-purpose packages such as malloc(). A common use of the direct sbrk() interface is to provide direct stack managment in such programs as the shell; you get a memory access violation on a stack push, then do an sbrk() to extend the stack (if you can), and continue processing. The Microport version of sbrk() works properly--or at least, the documentation claims it does--in the small memory configuration. HOWEVER, in the Large memory model, it's got the following exotic behavior (the following from the Microport documentation): For *any* sbrk() call with a positive increment, **all** of the rest of the addresses in the current segment are skipped, and the entire incremental allocation is provided in the *next* segment. Not only that, but you are explicitly warned that addresses will not be necessarily contiguous with previous sbrk() call values, because of this--there may be 'holes' for the skipped memory--and that return values may not necessarily have address relationships with the current break value. The Xenix System V manual is less blatant about the whole thing, but it's there, too. It states that in large model programs, if the requested increment is greater than the number of unallocated bytes remaining in the current segment, then it skips to the next segment for the entire request. While this is a bit nicer than the Microport default of always skipping, it still leaves that bloody non-contiguous window. Also, as for the Microport version, the result of sbrk(0) is not truly indicative of anything more than a marker for the previous memory location. In both cases, handling of the sbrk() call with a negative increment reflects similar deviations from what was previously considered 'normal' Unix(Tm, and all that) behavior. Now, I understand the problems inherent with dealing with a segmented architecture and memory management--I was fighting this, with a MMU, in 1981. STILL--this behavior is both a terrible deviation from the original behavior of the Unix sbrk(), and a violation of the current proposed IEEE Std. 1003.1. Now, granted, the other bible--The Sys V Interface Document, from AT&T--seems to beg the issue of brk() and sbrk() (As I remember--my SVID docs are at work right now, and I'm at home; corrections will, I am sure, be provided!!); but nevertheless, it seems obvious to me that if you're going to try and say it's Unix, then it ought to provide downward compatibility in the behavior of already-extant system services. And the standard, while not accepted, is still pretty reliable. The upshot? SCO and Microport had better get on the bandwagon and provide routines that work. I'm sorry, I know it may not be easy, but then nobody ever said it was going to be easy, did they?... -- Dave Ihnat Analysts International Corporation aicchi!ignatz or wlcrjs!ignatz (w) 882-4673 (h) 784-4544
andrew@amdahl.UUCP (Andrew Sharpe) (10/21/86)
In article <646@chinet.UUCP>, ignatz@chinet.UUCP (ignatz) writes: > Memory allocation problems exist with both Microport and the SCO Xenix > compiler, kids .... > Also, as for the Microport version, the result of sbrk(0) is > not truly indicative of anything more than a marker for the previous > memory location. This is not quite correct. The System V/286 version (for sbrk(0) in large model) will return a memory location that is the _beginning_ of the next segment, so that the pointer may be remembered and used later after an sbrk(+n). Now, Microport is supposed to be using System V/286. This is how it works in the System V/286 kernel, available from AT&T. If Microport has changed the behavior of brk() and sbrk(), that's a shame, because we worked very hard (in conjunction with AT&T) to make the memory management of SystemV/286 act (as much as possible) like the VAX. This was one of the charters of the 286 sanctioned port. -- Andrew Sharpe !{ihnp4,decwrl}!amdahl!andrew -------------------------------------------------------- The opinions expressed above do not reflect the views of the employees, nor the management, of Amdahl Corporation --------------------------------------------------------
ignatz@aicchi.UUCP (Ihnat) (10/25/86)
I recently posted a bit to the net about the apparently anomalous behavior of sbrk() under Microport Sys V and Xenix. I have, as you might guess, received a number of responses to both the primary item in my original posting, and to some incidental comments I made. Possibly the simplest item to address is the observation made in article <4004@amdahl.UUCP> by Andrew Sharpe on my comment that the value returned isn't to be considered more than a marker: This is not quite correct. The System V/286 version (for sbrk(0) in large model) will return a memory location that is the _beginning_ of the next segment, so that the pointer may be remembered and used later after an sbrk(+n). He goes on to comment that they worked quite hard to emulate the Vax behavior, (which I'm sure they did), and that he hopes that Microport didn't change anything; and I'm sure they did not. I simply repeated a caveat in the man page that warned the user not to consider this address to be something the program can use in address calculations, because it won't be meaningful in terms of contiguous memory address calculations with previous values returned by calls to sbrk(). In light of this fact, no program with claims to portability should consider the returned value anything more than a "magic cookie" to be passed to subsequent sbrk(+/- n) calls. Now, as to the other responses I've received via mail...I'd truly forgotten how rapidly people can type; often long before the brain can engage. There's some line noise somewhere between the brain and fingers of some people that translates into sarcastic and abusive aspersions on the intelligence, experience, and professional competence of the target (me). Now, mind you, I did receive intelligent, well-written queries that simply asked if I knew the problems inherent in a memory architecture implemented on a segmented machine, with a memory managment unit that doesn't lend itself to a flat address space. I just want some of the apparently technically competent people who sent the *other* type of response to be aware that you can question someone else's conclusions or statements without turning it into a contest of egos or a professional challenge. Now...back to issues. Yes, I *do* know the problems with the 286, and segmented architecture, and the screwy memory managment addressing. I know just how hard it is/would be to recreate the flat address space that the traditional sbrk() attempted to provide. However, it can be done, and must be if the portable operating system standard is to be followed. No, every reference need not be checked; the huge model Microsoft compiler generates inline code to build the segment/offset reference on the fly, and it's not "500 times slower" than the corresponding small model that assumes that references are within the current 64K segment. It's more inefficient, certainly; but it works well enough to use. I definitely think that the 286 architecture could have used some more thought when it was designed--say, put those two mode bits in the high- order bits of the segment, instead of the low end, and then provide a memory incrementing scheme that allowed the offset increment overflow to bump the associated segment; you could then easily have modelled a flat contiguous address space. But working with what we've got, it would make sense to offer a version that either follows the Xenix model--allocates increments in the current segment until the request would go off the end-- or jumps to a new segment, but guarantees that all addresses from the old break value to the new segment are actually mapped to valid real memory. It would then be the responsibility of the compiler, or the programmer, not to fall into the 64K offset wraparound trap; but at least you could count on contiguous addresses. (Yes, I know--contiguous meaning you go from offset 0-64K, then increment the upper 14 bits of the segment while preserving the two mode bits...) I didn't really want to go into a discussion of the whole issue; I assumed that anyone reading the original posting would realize that I understood the issues, and was pointing out that the model, as implemented, changed the basic assumptions of the "classic" Unix brk()/sbrk() behavior; and, particularly, clobbers the traditional shell stack allocation method. Oh, well...I'll individually answer the people who were either kind enough to write decent letters, or were hasty enough to write flames. But this is my "technical" explanation. I don't intend to say anything more on the net about this; please mail to me if you wish to express any other opinions or ideas. Of course, if something truly significant is brought up, I'll summarize, but otherwise, this is really an issue that's probably run its course in terms of net discussion. -- Dave Ihnat Analysts International Corporation (312) 882-4673 ihnp4!aicchi!ignatz || ihnp4!homebru!ignatz
tim@ism780c.UUCP (Tim Smith) (10/25/86)
sbrk has to behave funny because of program that assume a linear address space. Many program that want N bytes do the following: char * ptr; ptr = sbrk(0); brk(ptr+N); To make these work correctly, sbrk(0) has to return the start of the next segment. For example, here is how brk and sbrk work on the AT&T pc6300+ ( which is a good indication of how AT&T thinks things should work on 80286 based machines ) ( all byte counts and addresses are rounded to a multiple of 512 ) In small model, everything looks like a normal unix system In large model, sbrk(0) returns start of next data segment sbrk(N) N > 0 allocates N bytes from next data segment sbrk(N) N < 0 does this: N = -N while ( N >= size of last segment ) N = N - size of last segment remove last segment remove N bytes from last segment brk(addr) ( where addr == byte N of segment S ) if addr in current last segment, then set segment size to N if addr in next segment, then same as sbrk(N) if addr in an allocated part of a previous segment then throw away everything above addr else error -- member, all HASA divisions POELOD ECBOMB -------------- ^-- Secret Satanic Message Tim Smith USENET: sdcrdcf!ism780c!tim Compuserve: 72257,3706 Delphi or GEnie: mnementh
tony@wldrdg.UUCP (Tony Andrews) (10/27/86)
From Tim Smith sdcrdcf!ism780c!tim > sbrk has to behave funny because of program that assume a linear > address space. Many program that want N bytes do the following: > > char * ptr; > ptr = sbrk(0); > brk(ptr+N); > > To make these work correctly, sbrk(0) has to return the start of > the next segment. Unfortunately, malloc wasn't fixed to properly deal with the screwy behavior of brk and sbrk. For small malloc requests, malloc asks for a 1K chunk and allocates from that. Each time it decides to grab another 1K it gets it from a NEW segment. The net effect is that malloc squanders segments and can run out of memory because of a lack of LDT entries before it even comes close to hitting other limits. Tony Andrews ...!ihnp4!onecom!wldrdg!tony Wildridge Consulting, Inc. Boulder, CO
dan@prairie.UUCP (Daniel M. Frank) (10/29/86)
>Unfortunately, malloc wasn't fixed to properly deal with the >screwy behavior of brk and sbrk. For small malloc requests, >malloc asks for a 1K chunk and allocates from that. Each time it >decides to grab another 1K it gets it from a NEW segment. The >net effect is that malloc squanders segments and can run out >of memory because of a lack of LDT entries before it even comes >close to hitting other limits. In the System V/286 port done by Intel, and further ported by Microport and AT&T (SV/AT and 6300+ Unix), there is an additional malloc library (described in malloc(3x)) which may be used instead of the usual malloc stuff. This library includes a routine called mallopt(), which allows the following parameters to be set: M_MXFAST Allocate all blocks below the size of maxfast in large groups. M_NLBLKS Each large group allocated will contain numlblks. M_GRAIN The sizes of all blocks smaller than maxfast are considered to be rounded up to the nearest multiple of grain. M_KEEP Emulates behaviour of old malloc() by keeping the data in a block intact until that block is used again. By judicious use of these calls, the segment usage behaviour, and most likely the speed, of memory allocation under SV/286 can be significantly enhanced. -- Dan Frank uucp: ... uwvax!prairie!dan arpa: dan%caseus@spool.wisc.edu