Hi
I have this corpus which looks like this:
LEXEME vehicle
CLASSIFICATION M1x
ARTNR 27405
DEF car ((noun automoblie sks hui)(noun vehicle sks hui)) is ((verb hji
ska) (noun ako hui)) blue ((adj hji hui))
=
LEXEME people
ARTNR 27406
DEF strong ((adj hji hui) (adv ly hui)) is ((verb hji ska) (verb kos
hji)) man ((noun ako hui) (prop hui))
DEF beautiful ((adj hji hui) (adv ly hui)) women ((verb hji ska) (verb
kos hji)) are ((noun ako hui) (prop hui)) successfull ((adj sko hui) (adv ly
hui))
=
and with this program I can transform this corpus into this structure:
CL-USER 3 > (parse-corpus corpus)
(#S(CORPUS-DEFINITION LEXEME VEHICLE CLASSIFICATION M1X ARTNR 27405 DEFS
((#S(TOKEN NAME CAR ATTRIBUTES ((NOUN AUTOMOBLIE SKS HUI) (NOUN VEHICLE SKS
HUI))) #S(TOKEN NAME IS ATTRIBUTES ((VERB HJI SKA) (NOUN AKO HUI))) #S(TOKEN
NAME BLUE ATTRIBUTES ((ADJ HJI HUI)))))) #S(CORPUS-DEFINITION LEXEME PEOPLE
CLASSIFICATION NIL ARTNR 27406 DEFS ((#S(TOKEN NAME BEAUTIFUL ATTRIBUTES
((ADJ HJI HUI) (ADV LY HUI))) #S(TOKEN NAME WOMEN ATTRIBUTES ((VERB HJI SKA)
(VERB KOS HJI))) #S(TOKEN NAME ARE ATTRIBUTES ((NOUN AKO HUI) (PROP HUI)))
#S(TOKEN NAME SUCCESSFULL ATTRIBUTES ((ADJ SKO HUI) (ADV LY HUI))))
(#S(TOKEN NAME STRONG ATTRIBUTES ((ADJ HJI HUI) (ADV LY HUI))) #S(TOKEN NAME
IS ATTRIBUTES ((VERB HJI SKA) (VERB KOS HJI))) #S(TOKEN NAME MAN ATTRIBUTES
((NOUN AKO HUI) (PROP HUI)))))))
To make it more readable I write it like this (I put a # for the DEFS part)
(#S(CORPUS-DEFINITION LEXEME VEHICLE CLASSIFICATION M1X ARTNR 27405 DEFS ((#
# #))) #S(CORPUS-DEFINITION LEXEME PEOPLE CLASSIFICATION NIL ARTNR 27406
DEFS ((# # # #) (# # #))))
Now I want to extract parts of the information. For instance I would like to
have the defs for a given LEXEME ie I would like something like:
(find-defs people) --> ((#S(TOKEN NAME BEAUTIFUL ATTRIBUTES ((ADJ HJI HUI)
(ADV LY HUI))) #S(TOKEN NAME WOMEN ATTRIBUTES ((VERB HJI SKA) (VERB KOS
HJI))) #S(TOKEN NAME ARE ATTRIBUTES ((NOUN AKO HUI) (PROP HUI))) #S(TOKEN
NAME SUCCESSFULL ATTRIBUTES ((ADJ SKO HUI) (ADV LY HUI)))) (#S(TOKEN NAME
STRONG ATTRIBUTES ((ADJ HJI HUI) (ADV LY HUI))) #S(TOKEN NAME IS ATTRIBUTES
((VERB HJI SKA) (VERB KOS HJI))) #S(TOKEN NAME MAN ATTRIBUTES ((NOUN AKO
HUI) (PROP HUI)))))
Can this be done???
I know already that
(corpus-definition-defs (car (parse-corpus corpus)))) -->
((#S(TOKEN NAME CAR ATTRIBUTES ((NOUN AUTOMOBLIE SKS HUI) (NOUN VEHICLE SKS
HUI))) #S(TOKEN NAME IS ATTRIBUTES ((VERB HJI SKA) (NOUN AKO HUI))) #S(TOKEN
NAME BLUE ATTRIBUTES ((ADJ HJI HUI)))))
but this is the defs part of the first element in our corpus (vehicle), the
question is how I can generalize this function to work for any given lexeme?
tnx
ab talebi