unicode string library
git clone git://
Log | Files | Refs | README | LICENSE

DateCommit messageAuthorFiles+-
2022-08-17 07:55Build tests and benchmarks locally and add separate shared build-flagsLaslo Hunhold2+14-8
2022-08-16 16:56Separate CC into CC and BUILD_CC, add option to disable ldconfig(1)-callLaslo Hunhold2+24-18
2022-08-16 15:43Revert "Use ar(1)'s s-flag instead of invoking ranlib(1)"Laslo Hunhold2+3-1
2022-08-16 14:51Use SIZE_MAX instead of (size_t)(-1)Laslo Hunhold1+1-1
2022-08-16 14:25Move get_codepoint_*()-util-functions to src/util.cLaslo Hunhold6+75-137
2022-07-31 09:46Use (size_t)(-1) instead of SIZE_MAX and fix styleLaslo Hunhold1+3-2
2022-07-31 09:44Rename reallocarray() to reallocate_array() to prevent manglingLaslo Hunhold1+9-9
2022-07-30 21:29Add reallocarray implementationrobert1+10-0
2022-07-20 13:33Use ar(1)'s s-flag instead of invoking ranlib(1)Laslo Hunhold2+1-3
2022-06-29 03:07macro-hygiene: wrap arguments in parenthesisNRK1+1-1
2022-06-24 10:05Only add updated object files to the static libraryLaslo Hunhold1+1-1
2022-06-23 22:45Explicitly use object-files in library-generationLaslo Hunhold1+2-2
2022-06-21 08:45Implement case-conversion and -checkLaslo Hunhold14+48375-72
2022-06-19 09:12Add data-LICENSE fileLaslo Hunhold2+60-10
2022-06-16 22:02Remove some testing residueLaslo Hunhold2+1-38
2022-06-16 21:47Implement line-segmentationLaslo Hunhold16+15015-16
2022-06-10 20:20Fix link in src/sentence.cLaslo Hunhold1+1-1
2022-06-10 20:18Fix comment in src/sentence.cLaslo Hunhold1+1-1
2022-06-10 19:15Add Word-data-filesLaslo Hunhold2+3292-0
2022-06-10 19:11Implement sentence-segmentationLaslo Hunhold9+3995-0
2022-06-09 09:12Remove redundant branchLaslo Hunhold1+1-6
2022-06-06 20:58Remove unused variable in test/util.cLaslo Hunhold1+0-1
2022-06-06 20:16Implement word-segmentationLaslo Hunhold19+827-148
2022-03-01 08:27Properly handle cp == NULL in grapheme_decode_utf8()Laslo Hunhold1+9-0
2022-01-09 16:30Refactor benchmark codeLaslo Hunhold4+97-84
2022-01-09 11:48Explicitly log which class wants to override whichLaslo Hunhold1+3-1
2022-01-09 11:42Homogenize and rename some types/functions in gen/ and test/Laslo Hunhold8+63-66
2022-01-09 11:22Add a general break-test-check-function in test/utilLaslo Hunhold3+56-35
2022-01-08 16:15Remove unused includes in gen/character.cLaslo Hunhold1+0-8
2022-01-08 16:10Rename gen/character-properties.c to gen/character.cLaslo Hunhold3+6-7
2022-01-08 16:08Clean up gen/types.h from the old range-list-typesLaslo Hunhold1+0-10
2022-01-08 16:04Add comment-parameter to benchmark for optional further contextLaslo Hunhold4+15-11
2022-01-08 15:56Rename gen/properties.c to gen/character-properties.cLaslo Hunhold3+6-6
2022-01-08 15:53Update LICENSELaslo Hunhold1+1-1
2022-01-08 15:50benchmark/util.c: Explicitly convert to doubleLaslo Hunhold1+2-1
2022-01-08 15:49Consistently use least-typesLaslo Hunhold4+8-7
2022-01-08 15:45Explicitly cast to uint_least8_t when storing property in stateLaslo Hunhold1+1-1
2022-01-08 14:45gen/util: Add properties-handling and clean up old range-list-functionsLaslo Hunhold4+323-404
2022-01-08 14:44Increase iteration count in character-benchmarkLaslo Hunhold1+1-1
2022-01-07 17:34Silence a few warnings and make it clear that speclen is at most 255Laslo Hunhold1+3-3
2022-01-07 17:30Refactor derived-properties-table-generationLaslo Hunhold1+14-11
2022-01-07 17:16Insert numbers instead of enum-names into mono-tableLaslo Hunhold1+7-9
2022-01-07 17:01Determine maximum value manually in print_lookup_table()Laslo Hunhold1+13-8
2022-01-07 16:41Refactor data-table-generation, use enum-minor-tableLaslo Hunhold2+167-123
2022-01-05 15:52Revert "Use proper iteration-count for benchmark-time-calculation"Laslo Hunhold1+2-2
2022-01-05 15:40Use proper iteration-count for benchmark-time-calculationLaslo Hunhold1+2-2
2022-01-05 15:02Calculate benchmark-times as average time taken for each callLaslo Hunhold4+14-13
2022-01-04 17:56Mark likely branchesLaslo Hunhold2+23-8
2022-01-04 17:47Use "#ifdef" instead of "#if defined"Laslo Hunhold2+2-2
2022-01-04 17:11Add UTF-8 decoder benchmarkLaslo Hunhold2+124-1
2022-01-04 17:29Generate separate utf8proc_int32_t buffer to preserve strict aliasingLaslo Hunhold1+18-2
2022-01-04 17:10Generalize benchmark-function with payload-structLaslo Hunhold3+32-26
2022-01-02 11:48Introduce mostly branchless character break detectionLaslo Hunhold6+163-341
2022-01-02 11:24Create separate comparative benchmark/ folderLaslo Hunhold3+153-0
2021-12-31 14:41Add generator for compressed O(1)-lookup-tableLaslo Hunhold4+370-3
2021-12-25 21:35Quote constructed string in make distLaslo Hunhold1+11-11
2021-12-25 21:33Automatically gzip tarballLaslo Hunhold1+1-1
2021-12-25 19:36Separate benchmark from testLaslo Hunhold8+52-95
2021-12-22 14:39Bump to version 1Laslo Hunhold1+1-1
2021-12-22 14:59Also include README in distribution tarballLaslo Hunhold1+1-1
2021-12-22 14:37Add "clean-data" to PHONY-targetsLaslo Hunhold1+1-1
2021-12-22 14:20Add a remark on standard conformance in READMELaslo Hunhold6+36-5
2021-12-22 12:46Add README and change title and introduction in Makefile and manualsLaslo Hunhold3+52-5
2021-12-22 11:56Call ldconfig after install and uninstallLaslo Hunhold1+2-0
2021-12-19 15:37Alphabetic ordering of variables in config.mkLaslo Hunhold1+1-1
2021-12-19 15:31Fix a few manpage-errors found by the linterLaslo Hunhold2+37-29
2021-12-19 00:36Break some long linesLaslo Hunhold3+51-25
2021-12-19 00:22Match function parameters in code and documentationLaslo Hunhold4+69-69
2021-12-18 23:52Rewrite grapheme_next_character_break() and add size-parameterLaslo Hunhold3+53-42
2021-12-18 19:21Use SIZE_MAX instead of (size_t)-1Laslo Hunhold2+6-6
2021-12-18 18:49Rename API functions to improve readabilityLaslo Hunhold17+407-407
2021-12-18 12:24Consistently refer to "codepoints" as "codepoints", not "code points"Laslo Hunhold9+42-42
2021-12-18 12:13Rename GRAPHEME_SEGMENTATION_STATE to GRAPHEME_STATELaslo Hunhold5+8-10
2021-12-18 11:48Reintroduce the "grapheme_" prefixLaslo Hunhold18+423-421
2021-12-18 11:02Rename functions/defines/files from "grapheme" to "character"Laslo Hunhold12+466-466
2021-12-18 00:44Add lg_utf8_*() manuals to Makefile MAN3-variableLaslo Hunhold1+2-0
2021-12-18 00:26Add manual pages for lg_utf8_*() and refactor lg_grapheme_nextbreak()Laslo Hunhold3+202-3
2021-12-18 00:25Refactor manual pages for lg_grapheme_*()Laslo Hunhold2+13-10
2021-12-18 00:04Improve a small edge-case in lg_utf8_decode()Laslo Hunhold2+62-3
2021-12-17 00:28Rename LG_CODEPOINT_INVALID to LG_INVALID_CODE_POINTLaslo Hunhold4+28-28
2021-12-16 23:52Ensure const-correctness in cast in lg_utf8_decode()Laslo Hunhold1+4-4
2021-12-16 23:34Encourage strict aliasing for library users (uint8_t * -> char *)Laslo Hunhold6+54-45
2021-12-15 12:34Make lists in Makefile more idiomatic and avoid breaksLaslo Hunhold1+13-10
2021-12-15 11:53Refactor Makefile, add dist-target and add test-utilLaslo Hunhold7+72-33
2021-12-15 09:59Refactor manual pages, document lg_grapheme_isbreak()Laslo Hunhold5+160-108
2021-12-14 15:25Reduce number of iterations for grapheme-performance-testLaslo Hunhold1+1-1
2021-12-14 15:23Refactor error-messages and test-outputLaslo Hunhold5+33-25
2021-12-14 14:59Fix memory leaks during header-generationLaslo Hunhold4+27-2
2021-12-14 14:58Update UTF-8-decoder test for modified NULL-caseLaslo Hunhold1+1-1
2021-12-14 13:06Make lg_utf8_*() NULL-agnosticLaslo Hunhold1+8-4
2021-12-14 12:53Reorder CFLAGSLaslo Hunhold1+1-1
2021-12-14 12:45Avoid undefined signed integer overflow in heisenstate_set()Laslo Hunhold1+3-3
2021-12-14 12:41Optimize bsearch-comparison-function for cp-rangesLaslo Hunhold1+8-2
2021-12-14 12:39Change return-type of time_diff to doubleLaslo Hunhold1+6-5
2021-12-14 11:51Coding style and explicit double-castLaslo Hunhold1+4-3
2021-12-13 19:16Cast pedanticallyLaslo Hunhold3+5-5
2021-12-12 15:03Change lg_grapheme_isbreak() return type from int to boolLaslo Hunhold2+9-6
2021-12-12 14:51Improve parsing of hexadecimal stringsLaslo Hunhold1+33-25
2021-12-12 12:10Properly set internal functions as staticLaslo Hunhold1+5-4
45 more commits remaining, fetch the repository