The hlml from dangmoody

Include .inl files in source files, and .h files in headers

Currently, we include .h everywhere and the .h files will include their respective .inl files (if any) at the bottom of the file. This will affect the user's compile times.

Instead, source files that include HLML files should include .inl files, which include .h files. The header of the user's app can still include HLML headers.

Optimisations to quaternion functions

Had a look through this part of the library not long ago and I think we can optimise some of those functions.

support for half-precision/float16?

I've been thinking about this a bit. Would this even be worth it?

It could be a fair amount of work to even know whether or not this is worth doing.

The generator probably needs better benchmarking support to be able to tell if this would be faster.

Let users generate only the math code they need?

Would it be more beneficial to have users run the generator locally so that they can only have the math types they need in their codebase?

The main advantage of this would be that the user only has the code they care about in their codebase, meaning less code bloat.

This would have to be done via some kind of config file, which specifies which types and features the user wants the generator to generate.

Generate assembly for each function when also generating the code files?

This could be cool to show what the assembly of each function is likely to be (with -O3 and -ffast-math on, for instance).

When making changes to the generated code you'd also be able to see diffs in the assembly too, to see maybe exactly when a function became faster/slower?

C++ standard 20 conflicting definitions of lerp function

On Ubuntu 22.04.3 LTS using G++ and Clang++ with CPP standard 20 there are conflicting definitions of the lerp function. Here is the full error message:

[build] [ 50%] Building CXX object CMakeFiles/testexe.dir/main.o [build] In file included from /home/sigill/Dev/CPP/HLMLTest/./cpp/hlml. [HLML_lerp_conflict_test.zip](https://github.com/dangmoody/HLML/files/14343402/HLML_lerp_conflict_test.zip) h:127, [build] from /home/sigill/Dev/CPP/HLMLTest/main.cpp:1: [build] /home/sigill/Dev/CPP/HLMLTest/./cpp/hlml_functions_scalar.h:170:69: error: ‘float lerp(float, float, float)’ conflicts with a previous declaration [build] 170 | HLML_INLINE float lerp( const float a, const float b, const float t ) [build] | ^ [build] In file included from /usr/include/c++/11/math.h:36, [build] from /home/sigill/Dev/CPP/HLMLTest/./cpp/hlml_functions_scalar.h:41, [build] from /home/sigill/Dev/CPP/HLMLTest/./cpp/hlml.h:127, [build] from /home/sigill/Dev/CPP/HLMLTest/main.cpp:1:

Please find attached the sample project used to generate the above error message:
HLML_lerp_conflict_test.zip

The lerp functions were added in standard 20: https://en.cppreference.com/w/cpp/numeric/lerp but for some reason MSVC does not throw any compilation errors here.

Running Doxygen process doesn't work on Linux/MacOS for Travis VMs

Just FYI

The zip on the releases page isn't actually a zip. I could get it extracted with 7z but Windows complained.

need to do another optimisation pass

Use premake to generate windows solution

It'll be neater than what I'm doing right now.

Negate operator got removed between v1 and v2

Whoops! Need to re-add that!

GitHub Actions CI runners randomly fail to install MinGW and/or other packages for some reason

Do they just need updating or something?

Add negate operator

E.G:

float3 x = float3( 1.0f, 1.0f, 1.0f );

float3 y = -x;

I thought I'd added this before but it looks like I haven't. This wants to get added sooner rather than later because this is used quite often!

In C this will want to be float3_negate( &x ) (using the example above).

some tests randomly take a lot more time than the same test

Sometimes in the tests you'll see a test (float4x4_caddv, for instance) take 0.5 microseconds in one test, and then 10 microseconds in the next one. It would be good to know why that is.

Identical Bool Sizes Between Compilers

The size of a bool can change depending on what compiler is used to compile the program (Or even defined by the user).

HLML ideally needs some way to always guarantee that its bool types are the same size. My use case for this would be sending data to a shader via a uniform buffer, data may become misaligned when using different compilers to target different platforms.

Consolidate SIMD input structs

It's become apparent that with the current way the SIMD functions are laid out that we really only need the following types of input structs:

Single matrix.
LHS/RHS.
Translate a matrix (unique to translate_sse()).
Scale a matrix (unique to scale_sse()).

So the current implementation where we have a separate input struct for each SIMD function should be re-done.

Doing this would make for nicer use, and potentially faster code as the user wouldn't be having to shuffle lots of data/registers around nearly as much.

Major generator refactor needed

The codebase has become a mess over time due to me not being able to see just what the code for the generator would look in it's current state. There's a lot more code than there probably needs to be. No semantic compression happening, etc.

It would be good to refactor the entire generator so that all the code for generating the C files is in one file, all the code for generating the C++ files in another, etc. I think that would be much neater than what we have now.

Then main could just be something like the following:

int main( int argc, char** argv ) {
	// some other pre-existing setup that's probably the same as before...

	Gen_CodeC( ... );
	Gen_CodeCPP( ... );

	// any other shutdown stuff that was here from before

	return EXIT_SUCCESS;
}

I've been trying to treat C and C++ as the same thing with some minor differences, but I think it could much more beneficial to just treat them as two separate languages, and then any similarities can be compressed into helper functions as needed.

If I'm right, this will significantly reduce the amount of code that exists in the generator atm, the codebase would be easier to navigate, and it would be easier to read.

I could be wrong about this, and it could actually be worse but this would be worth for me setting some time aside one day to look at.

Either way, the current state of the generator codebase is a mess and could definitely be done a lot better than how it is now.

Remove "comp_" prefix in favour of just "c"

Typing comp_ every time for a component-wise transformation can be a little bit of a PITA. Typing c would be easier.

For example, instead of typing:

float4_comp_addv( &a, &b );

It would be easier to type:

float4_caddv( &a, &b );

I'm still not 100% sure this is something worth doing. What do we think?

Show performance numbers for scalar and SIMD functions on average!

Even if they're crap to begin with! It's important to be transparent with people about this sort of thing.

File IO on windows is flakey

@Flave229 is seeing folder deletion/creation randomly fail and crash the generator locally for him.

This likely just needs someone adding a bunch more calls to GetLastError() and then working off the returned errors from there.

Travis Windows VM is constantly failing

It just looks like Travis have changed how their Windows VMs work on their end and I need to change some config stuff to get this to work again.

Then again - it's only MSVC; will we really miss it?

Compilation errors in VS 2022

Trying to compile in a cpp project on visual studio 2022 tool chain results in the following two errors:

Error C3861 'assert': identifier not found in bool2.inl:69
Error C4146 unary minus operator applied to unsigned type, result still unsigned in hlml_functions_vector.h:2409

Error c3861 is easily solved by including assert.h at the top of hlml.h. C4146 can be fixed by removing the "-" from the return types of negation operator functions.

Add Quaternion Documentation

better alternative to writing and batch and bash scripts

Every time I need a script I have to write it twice: Once for Batch, and again for Bash. This is annoying and not that easy to maintain. I need a better solution.

I was looking at ODIN recently, could be good?

add more tests

We could do with tests for the following:

constructor tests for each different constructor for vectors and matrices
assignment operator tests
array access operators

Warnings for C4201 on /W4

When warnings Level 4 is enabled for projects including HLML, compiling with MSVC, the following warnings occurs: C4201: nonstandard extension used: nameless struct/union

Rewrite tests for Temper 2.0

Temper 1.0 didn't have things like parametric test support, which Temper 2.0 now does.

We can really start to throw a lot of tests at HLML now to harden it.

This would take a while because nearly every test probably wants to get parameterised.

Optional `#define` for allowing C++ users to use a `hlml` namespace.

I saw the xxHash documentation the other day that they have an optional #define for using the API with namespaces. It's completely optional and seems to 'just work'.

This means that HLML could also do the same thing and provide support for users who have been complaining of name collisions with other libraries/modules.

I'm only going to do an initial investigation for now which looks into how much friction and boilerplate this ends up introducing to the codebase. My guess is quite a bit.

Warning C4244 in hlml_functions_vector.inl

The following error occurs multiple times for many functions in this file:
warning C4244: 'return': conversion from 'int32_t' to 'float', possible loss of data

Pseudo-inverse for non-square matrix types

Currently division for non-square matrix types does a component-wise division. Should do a multiply-by-pseudo-inverse to be consistent with square-matrices (which do a multiply-by-inverse).

add all()

Useful helper function, definitely faster than comparing against a bool vector/matrix constructor of all true. Should definitely go in.

Basically should just look like this:

bool all( const bool2& x ) {
	return x.x && x.y;
}

// and so on for bool3, bool4, bool3x2, etc.

No 'extern "C"' for the C API files

Well, this is embarrassing...

This probably needs to be added.

Add ARM neon SIMD backend

Machines with ARM ISAs are becoming more present so it's probably a good idea to make HLML support it.

Not sure how much work this will be but hopefully it's not going to be a massive amount.

Will testing on a Raspberry Pi be sufficient?

Add float4x4_rotate_quaternion

Will mean I need to rename float4x4_rotate to float4x4_rotate_angle_axis.

Timer on Linux returns incorrect results

I've just checked the build logs on Travis and it looks like the Mac OS/Linux timer implementation is just returning results that are incorrect.

I thought I'd got this correct the first time around. Is it possible that testing via a Linux VM guest was screwing with the results somehow?

I'll sort it.

allow the generator to output generated files to user-specified location driven by command line arg

Make zero-initialisation of maths types optional.

People have said that they don't like the fact that if they initialise any math type they'll have to pay the cost of the zero-initialisation. Therefore this needs to be option (either in the generator or through the code usage itself) that people can opt-in to.

I've initially overlooked the fact that it makes more sense that the code doesn't do anything other than what the programmer tells it to, but something like this should still be given as optional functionality that's minimal and easy to use.

Swizzling with .wzxy() function call syntax not compatible with HLSL

One of the main issues trying to compile a couple of generic HLSL functions with HLML was swizzling. https://github.com/redorav/hlslpp has a templated solution to this and it looks like a perfect thing for autogeneration: https://github.com/redorav/hlslpp/tree/master/include/swizzle :)

dot_lean()?

A friend of mine using the library found that he wanted to do a dot product for 2 float4s but he only cared about dotting the X, Y, and Z components. He suggested a dot_lean() function which would do this.

Sounds like a good idea, but I'm wondering if stuff like this could be added per-application instead of just having in the library; would it add confusion? Would this be bloat?

Sort out primary include headers for users

The main headers that users include are currently:

hlml_functions_scalar.h
hlml_functions_vector.h
hlml_functions_matrix.h

It's not obvious that these are the main headers to include (both by name and documentation).

So either this needs to be documented, or better header names need to be thought of.

Some operators have double reference operators (&&) when there should only be one.

Noticed this in operator+= for instance in some vector types.

"cannot convert from 'int2' to 'float2'" - explicit conversion constructors needed to match HLSL

Hi! Thanks for the great lib. Just trying to compile some HLSL code, stumbled upon this one.

dangmoody / hlml Goto Github PK

hlml's People

Contributors

Stargazers

Watchers

Forkers

hlml's Issues

Recommend Projects

Recommend Topics

Recommend Org