From: ab talebi
Subject: extracting from def-structures
Date: 
Message-ID: <3cb6d481.98156423@news.uio.no>
Hi



I have this corpus which looks like this:



LEXEME         vehicle

CLASSIFICATION    M1x

ARTNR           27405

DEF     car ((noun automoblie sks hui)(noun vehicle sks hui)) is
((verb hji
ska) (noun ako hui)) blue ((adj hji hui))

=

LEXEME         people

ARTNR           27406

DEF     strong ((adj hji hui) (adv ly hui)) is ((verb hji ska) (verb
kos
hji)) man ((noun ako hui) (prop hui))

DEF     beautiful ((adj hji hui) (adv ly hui)) women ((verb hji ska)
(verb
kos hji)) are ((noun ako hui) (prop hui)) successfull ((adj sko hui)
(adv ly
hui))

=



and with this program I can transform this corpus into this structure:

CL-USER 3 > (parse-corpus corpus)

(#S(CORPUS-DEFINITION LEXEME VEHICLE CLASSIFICATION M1X ARTNR 27405
DEFS
((#S(TOKEN NAME CAR ATTRIBUTES ((NOUN AUTOMOBLIE SKS HUI) (NOUN
VEHICLE SKS
HUI))) #S(TOKEN NAME IS ATTRIBUTES ((VERB HJI SKA) (NOUN AKO HUI)))
#S(TOKEN
NAME BLUE ATTRIBUTES ((ADJ HJI HUI)))))) #S(CORPUS-DEFINITION LEXEME
PEOPLE
CLASSIFICATION NIL ARTNR 27406 DEFS ((#S(TOKEN NAME BEAUTIFUL
ATTRIBUTES
((ADJ HJI HUI) (ADV LY HUI))) #S(TOKEN NAME WOMEN ATTRIBUTES ((VERB
HJI SKA)
(VERB KOS HJI))) #S(TOKEN NAME ARE ATTRIBUTES ((NOUN AKO HUI) (PROP
HUI)))
#S(TOKEN NAME SUCCESSFULL ATTRIBUTES ((ADJ SKO HUI) (ADV LY HUI))))
(#S(TOKEN NAME STRONG ATTRIBUTES ((ADJ HJI HUI) (ADV LY HUI)))
#S(TOKEN NAME
IS ATTRIBUTES ((VERB HJI SKA) (VERB KOS HJI))) #S(TOKEN NAME MAN
ATTRIBUTES
((NOUN AKO HUI) (PROP HUI)))))))



To make it more readable I write it like this (I put a # for the DEFS
part)



(#S(CORPUS-DEFINITION LEXEME VEHICLE CLASSIFICATION M1X ARTNR 27405
DEFS ((#
# #))) #S(CORPUS-DEFINITION LEXEME PEOPLE CLASSIFICATION NIL ARTNR
27406
DEFS ((# # # #) (# # #))))





Now I want to extract parts of the information. For instance I would
like to
have the defs  for a given LEXEME ie I would like something like:



(find-defs people) --> ((#S(TOKEN NAME BEAUTIFUL ATTRIBUTES ((ADJ HJI
HUI)
(ADV LY HUI))) #S(TOKEN NAME WOMEN ATTRIBUTES ((VERB HJI SKA) (VERB
KOS
HJI))) #S(TOKEN NAME ARE ATTRIBUTES ((NOUN AKO HUI) (PROP HUI)))
#S(TOKEN
NAME SUCCESSFULL ATTRIBUTES ((ADJ SKO HUI) (ADV LY HUI)))) (#S(TOKEN
NAME
STRONG ATTRIBUTES ((ADJ HJI HUI) (ADV LY HUI))) #S(TOKEN NAME IS
ATTRIBUTES
((VERB HJI SKA) (VERB KOS HJI))) #S(TOKEN NAME MAN ATTRIBUTES ((NOUN
AKO
HUI) (PROP HUI)))))



Can this be done???

I know already that

(corpus-definition-defs (car (parse-corpus corpus)))) -->

((#S(TOKEN NAME CAR ATTRIBUTES ((NOUN AUTOMOBLIE SKS HUI) (NOUN
VEHICLE SKS
HUI))) #S(TOKEN NAME IS ATTRIBUTES ((VERB HJI SKA) (NOUN AKO HUI)))
#S(TOKEN
NAME BLUE ATTRIBUTES ((ADJ HJI HUI)))))



but this is the defs part of the first element in our corpus
(vehicle), the
question is how I can generalize this function to work for any given
lexeme?



tnx

ab talebi