Skip to content

OOM in sme-smj, loops through same rules over and over again (not sure if it ever ends) #97

@unhammer

Description

@unhammer
$ echo '– Lea stáhtahálddašeaddji rolla sihkkarastit ahte boazodoallit,
báikkálaš ja regionálalaš eiseválddit gulahallet ja lea maid
stáhtahálddašeaddji bargu oahpahit aktevrraide boazodoalo
areáladárbbu. Departemeanta lea 2021 vuosttaš jahkebeale gárveme
sierra bagadallama boazodoalu ja plána- ja huksenlobi birra mii galgá
nannet boazodolliid plána- ja huksenlobi gelbbolašvuođa ja mii galgá
nannet fylkkagielddaid ja gielddaid gelbbolašvuođa boazodoalus ja
boazodoallovuoigatvuođain, lohká Skogan.'  | apertium -d . sme-smj_rtx

hangs.

or with input-to-rtx.txt since giella-smj doesn't have updated packages to build with:

$ cat input-to-rtx.txt | rtx-proc --anaphora sme-smj.rtx.bin
^–<punct>$ ^Liehket<vblex><indic><pres><p3><sg>$ ^stáhttaháldadiddje<n><nomag><sg><gen>$ ^roalla<n><sg><nom>$ ^sihkarasstet<vblex><inf>$ ^jut<cnjsub>$ ^ælloniehkke<n><pl><nom>$^,<cm>$
^bájkálasj<adj><attr>$ ^ja<cnjcoo>$ ^regiåvnålasj<adj><attr>$ ^oajválasj<n><pl><nom>$ ^guládallat<vblex><indic><pres><p3><pl>$ ^ja<cnjcoo>$ ^liehket<vblex><indic><pres><p3><sg>$ ^stáhttaháldadiddje<n><nomag><sg><nom>$
^aj<adv>$ ^barggo<n><sg><nom>$ ^åhpadit<vblex><supn>$ ^akterra<n><pl><ill>$ ^ællosujtto<n><sg><gen>$
^areálla<n><cmp_sgnom><cmp>+dárbbo<n><sg><acc>$

and then it hangs.

With --rules we see it go through the same rules over and over again.
(Could some sort of per-sentence memoisation / dynamic programming be useful?)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions