HTML Tidy  5.4.0
The HTACG Tidy HTML Project
Memory Allocation

Detailed Description

Tidy uses a user provided allocator for all memory allocations.

If this allocator is not provided, then a default allocator is used which simply wraps standard C malloc/free calls. These wrappers call the panic function upon any failure. The default panic function prints an out of memory message to stderr, and calls exit(2).

For applications in which it is unacceptable to abort in the case of memory allocation, then the panic function can be replaced with one which longjmps() out of the tidy code. For this to clean up completely, you should be careful not to use any tidy methods that open files as these will not be closed before panic() is called.

TODO: associate file handles with tidyDoc and ensure that tidyDocRelease() can close them all.

Calling the withAllocator() family ( tidyCreateWithAllocator, tidyBufInitWithAllocator, tidyBufAllocWithAllocator) allow settings custom allocators).

All parts of the document use the same allocator. Calls that require a user provided buffer can optionally use a different allocator.

For reference in designing a plug-in allocator, most allocations made by tidy are less than 100 bytes, corresponding to attribute names/values, etc.

There is also an additional class of much larger allocations which are where most of the data from the lexer is stored. (It is not currently possible to use a separate allocator for the lexer, this would be a useful extension).

In general, approximately 1/3rd of the memory used by tidy is freed during the parse, so if memory usage is an issue then an allocator that can reuse this memory is a good idea.

Data Structures

struct  TidyAllocator
 An allocator. More...
 
struct  TidyAllocatorVtbl
 An allocator's function table. More...
 

Typedefs

typedef void(TIDY_CALLTidyFree) (void *buf)
 Callback for "free" replacement. More...
 
typedef void *(TIDY_CALLTidyMalloc) (size_t len)
 Callback for "malloc" replacement. More...
 
typedef void(TIDY_CALLTidyPanic) (ctmbstr mssg)
 Callback for "out of memory" panic state. More...
 
typedef void *(TIDY_CALLTidyRealloc) (void *buf, size_t len)
 Callback for "realloc" replacement. More...
 

Functions

Bool TIDY_CALL tidySetFreeCall (TidyFree ffree)
 Give Tidy a free() replacement. More...
 
Bool TIDY_CALL tidySetMallocCall (TidyMalloc fmalloc)
 Give Tidy a malloc() replacement. More...
 
Bool TIDY_CALL tidySetPanicCall (TidyPanic fpanic)
 Give Tidy an "out of memory" handler. More...
 
Bool TIDY_CALL tidySetReallocCall (TidyRealloc frealloc)
 Give Tidy a realloc() replacement. More...
 

Data Structure Documentation

◆ _TidyAllocator

struct _TidyAllocator

An allocator.

The allocator.

To create your own allocator, do something like the following:

typedef struct _MyAllocator {
TidyAllocator base;
...other custom allocator state...
} MyAllocator;
void* MyAllocator_alloc(TidyAllocator *base, void *block, size_t nBytes)
{
MyAllocator *self = (MyAllocator*)base;
...
}
(etc)
static const TidyAllocatorVtbl MyAllocatorVtbl = {
MyAllocator_alloc,
MyAllocator_realloc,
MyAllocator_free,
MyAllocator_panic
};
myAllocator allocator;
TidyDoc doc;
allocator.base.vtbl = &MyAllocatorVtbl;
...initialise allocator specific state...
doc = tidyCreateWithAllocator(&allocator);

Although this looks slightly long winded, the advantage is that to create a custom allocator you simply need to set the vtbl pointer correctly. The vtbl itself can reside in static/global data, and hence does not need to be initialised each time an allocator is created, and furthermore the memory is shared amongst all created allocators.

Data Fields
const TidyAllocatorVtbl * vtbl

◆ _TidyAllocatorVtbl

struct _TidyAllocatorVtbl

An allocator's function table.

The allocators function table.

All functions here must be provided.

Public Member Functions

void *TIDY_CALLalloc (TidyAllocator *self, size_t nBytes)
 Called to allocate a block of nBytes of memory. More...
 
void *TIDY_CALLrealloc (TidyAllocator *self, void *block, size_t nBytes)
 Called to resize (grow, in general) a block of memory. More...
 
 void (TIDY_CALL *free)(TidyAllocator *self
 Called to free a previously allocated block of memory. More...
 
 void (TIDY_CALL *panic)(TidyAllocator *self
 Called when a panic condition is detected. More...
 

Data Fields

voidblock
 
ctmbstr msg
 

Member Function Documentation

◆ alloc()

void* TIDY_CALL* alloc ( TidyAllocator *  self,
size_t  nBytes 
)

Called to allocate a block of nBytes of memory.

◆ realloc()

void* TIDY_CALL* realloc ( TidyAllocator *  self,
void block,
size_t  nBytes 
)

Called to resize (grow, in general) a block of memory.

Must support being called with NULL.

◆ void() [1/2]

void ( TIDY_CALL free)

Called to free a previously allocated block of memory.

◆ void() [2/2]

void ( TIDY_CALL panic)

Called when a panic condition is detected.

Must support block == NULL. This function is not called if either alloc or realloc fails; it is up to the allocator to do this. Currently this function can only be called if an error is detected in the tree integrity via the internal function CheckNodeIntegrity(). This is a situation that can only arise in the case of a programming error in tidylib. You can turn off node integrity checking by defining the constant NO_NODE_INTEGRITY_CHECK during the build.

Field Documentation

◆ block

void* block

◆ msg

ctmbstr msg

Typedef Documentation

◆ TidyFree

typedef void(TIDY_CALL * TidyFree) (void *buf)

Callback for "free" replacement.

◆ TidyMalloc

typedef void*(TIDY_CALL * TidyMalloc) (size_t len)

Callback for "malloc" replacement.

◆ TidyPanic

typedef void(TIDY_CALL * TidyPanic) (ctmbstr mssg)

Callback for "out of memory" panic state.

◆ TidyRealloc

typedef void*(TIDY_CALL * TidyRealloc) (void *buf, size_t len)

Callback for "realloc" replacement.

Function Documentation

◆ tidySetFreeCall()

Bool TIDY_CALL tidySetFreeCall ( TidyFree  ffree)

Give Tidy a free() replacement.

◆ tidySetMallocCall()

Bool TIDY_CALL tidySetMallocCall ( TidyMalloc  fmalloc)

Give Tidy a malloc() replacement.

◆ tidySetPanicCall()

Bool TIDY_CALL tidySetPanicCall ( TidyPanic  fpanic)

Give Tidy an "out of memory" handler.

◆ tidySetReallocCall()

Bool TIDY_CALL tidySetReallocCall ( TidyRealloc  frealloc)

Give Tidy a realloc() replacement.