Hi
I have this corpus which looks like this:
LEXEME vehicle
CLASSIFICATION M1x
ARTNR 27405
DEF car ((noun automoblie sks hui)(noun vehicle sks hui)) is
((verb hji
ska) (noun ako hui)) blue ((adj hji hui))
=
LEXEME people
ARTNR 27406
DEF strong ((adj hji hui) (adv ly hui)) is ((verb hji ska) (verb
kos
hji)) man ((noun ako hui) (prop hui))
DEF beautiful ((adj hji hui) (adv ly hui)) women ((verb hji ska)
(verb
kos hji)) are ((noun ako hui) (prop hui)) successfull ((adj sko hui)
(adv ly
hui))
=
and with this program I can transform this corpus into this structure:
CL-USER 3 > (parse-corpus corpus)
(#S(CORPUS-DEFINITION LEXEME VEHICLE CLASSIFICATION M1X ARTNR 27405
DEFS
((#S(TOKEN NAME CAR ATTRIBUTES ((NOUN AUTOMOBLIE SKS HUI) (NOUN
VEHICLE SKS
HUI))) #S(TOKEN NAME IS ATTRIBUTES ((VERB HJI SKA) (NOUN AKO HUI)))
#S(TOKEN
NAME BLUE ATTRIBUTES ((ADJ HJI HUI)))))) #S(CORPUS-DEFINITION LEXEME
PEOPLE
CLASSIFICATION NIL ARTNR 27406 DEFS ((#S(TOKEN NAME BEAUTIFUL
ATTRIBUTES
((ADJ HJI HUI) (ADV LY HUI))) #S(TOKEN NAME WOMEN ATTRIBUTES ((VERB
HJI SKA)
(VERB KOS HJI))) #S(TOKEN NAME ARE ATTRIBUTES ((NOUN AKO HUI) (PROP
HUI)))
#S(TOKEN NAME SUCCESSFULL ATTRIBUTES ((ADJ SKO HUI) (ADV LY HUI))))
(#S(TOKEN NAME STRONG ATTRIBUTES ((ADJ HJI HUI) (ADV LY HUI)))
#S(TOKEN NAME
IS ATTRIBUTES ((VERB HJI SKA) (VERB KOS HJI))) #S(TOKEN NAME MAN
ATTRIBUTES
((NOUN AKO HUI) (PROP HUI)))))))
To make it more readable I write it like this (I put a # for the DEFS
part)
(#S(CORPUS-DEFINITION LEXEME VEHICLE CLASSIFICATION M1X ARTNR 27405
DEFS ((#
# #))) #S(CORPUS-DEFINITION LEXEME PEOPLE CLASSIFICATION NIL ARTNR
27406
DEFS ((# # # #) (# # #))))
Now I want to extract parts of the information. For instance I would
like to
have the defs for a given LEXEME ie I would like something like:
(find-defs people) --> ((#S(TOKEN NAME BEAUTIFUL ATTRIBUTES ((ADJ HJI
HUI)
(ADV LY HUI))) #S(TOKEN NAME WOMEN ATTRIBUTES ((VERB HJI SKA) (VERB
KOS
HJI))) #S(TOKEN NAME ARE ATTRIBUTES ((NOUN AKO HUI) (PROP HUI)))
#S(TOKEN
NAME SUCCESSFULL ATTRIBUTES ((ADJ SKO HUI) (ADV LY HUI)))) (#S(TOKEN
NAME
STRONG ATTRIBUTES ((ADJ HJI HUI) (ADV LY HUI))) #S(TOKEN NAME IS
ATTRIBUTES
((VERB HJI SKA) (VERB KOS HJI))) #S(TOKEN NAME MAN ATTRIBUTES ((NOUN
AKO
HUI) (PROP HUI)))))
Can this be done???
I know already that
(corpus-definition-defs (car (parse-corpus corpus)))) -->
((#S(TOKEN NAME CAR ATTRIBUTES ((NOUN AUTOMOBLIE SKS HUI) (NOUN
VEHICLE SKS
HUI))) #S(TOKEN NAME IS ATTRIBUTES ((VERB HJI SKA) (NOUN AKO HUI)))
#S(TOKEN
NAME BLUE ATTRIBUTES ((ADJ HJI HUI)))))
but this is the defs part of the first element in our corpus
(vehicle), the
question is how I can generalize this function to work for any given
lexeme?
tnx
ab talebi