dictionary idiom needed

Brandon · Dec 11, 2008

Hi all,

I have a series of lists in format ['word', 'tagA', 'tagB']. I have
converted this to a few dicts, such as one in which keys are tuples of
('word', 'tagB'), and the values are the number of times that key was
found. I need an dictionary idiom whereby I can find all instances of
a given 'word' with any 'tagB', and then subdivide into all instances
of a given 'tagB'. In both cases I would want the value as a count of
all instances found. Can this be done with dictionaries? Or should I
back up and do this with lists? All of the nested for loops I have
tried return replicated results, so I can't trust those values.

Thanks for any pointers,

Brandon

bearophileHUGS · Dec 11, 2008

Brandon:

I need an dictionary idiom whereby I can find all instances of
a given 'word' with any 'tagB', and then subdivide into all instances
of a given 'tagB'. In both cases I would want the value as a count of
all instances found.

If I have understood you well enough, I think you can do with a dict
that has the tuple ('word', 'tagB') as key, and as value has a
collections.defaultdict(int) that maps 'tagB' to its count.

Bye,
bearophile

bearophileHUGS · Dec 11, 2008

bearophile:

you can do with a dict
that has the tuple ('word', 'tagB') as key, and as value has a
collections.defaultdict(int) that maps 'tagB' to its count.

Where's 'tagA'?
Probably I haven't understood your problem well enough. I need a
better example of your data and what you need...

Sorry, bye,
bearophile

Brandon · Dec 11, 2008

Thanks bear -

Some outside advice has me looking at nested dictionaries. But I am
still bogged down because I've not created one before and all examples
I can find are simple ones where they are created manually, not with
loops. Maybe a further example:

data:
POS1 POS2 POS3
['word1','tagA','tagB']
['word2','tagC','tagD']
['word1','tagE','tagB']
['word1','tagC','tagF']

.... and so on. FWIW: I am guaranteed that the set of tags that may
occur in position2 is complementary to the set of tags that may occur
in position3.

Now I want to get an accounting of all the tags that occurred in
position3 in the context of, say, word1. Here I've shown that for
word1, tagB and tagF occurs. I want a way to access all this
information such that nested_dict['word1']['tagB'] = 2, and nested_dict
['word1']['tagF'] = 1.

As I mentioned, I already have dicts such that dictA['word1'] = 3, and
dictB['tagB'] = 2. I used defaultdict to build those, and that seems
to be causing me to get some funky values back from my initial
attempts to build a nested dictionary. I am stumbling at constructing
a "for" loop that automatically creates such nested dictionaries.

I hope that clears things up. If you still have any advice, it's much
appreciated.

Thanks,

Brandon

Brandon · Dec 11, 2008

Smells like homework without a particular application.

@Scott:

Even if that were the case

I'd still like to figure out how to
create nested dictionaries!

Brandon

Arnaud Delobelle · Dec 11, 2008

Brandon said:
Thanks bear -

Some outside advice has me looking at nested dictionaries. But I am
still bogged down because I've not created one before and all examples
I can find are simple ones where they are created manually, not with
loops. Maybe a further example:

data:
POS1 POS2 POS3
['word1','tagA','tagB']
['word2','tagC','tagD']
['word1','tagE','tagB']
['word1','tagC','tagF']

... and so on. FWIW: I am guaranteed that the set of tags that may
occur in position2 is complementary to the set of tags that may occur
in position3.

Now I want to get an accounting of all the tags that occurred in
position3 in the context of, say, word1. Here I've shown that for
word1, tagB and tagF occurs. I want a way to access all this
information such that nested_dict['word1']['tagB'] = 2, and nested_dict
['word1']['tagF'] = 1.

As I mentioned, I already have dicts such that dictA['word1'] = 3, and
dictB['tagB'] = 2. I used defaultdict to build those, and that seems
to be causing me to get some funky values back from my initial
attempts to build a nested dictionary. I am stumbling at constructing
a "for" loop that automatically creates such nested dictionaries.

I hope that clears things up. If you still have any advice, it's much
appreciated.

Thanks,

Brandon

from collections import defaultdict

data = [['word1','tagA','tagB'],

Click to expand...

Click to expand...

.... ['word2','tagC','tagD'],
.... ['word1','tagE','tagB'],
.... ['word1','tagC','tagF']].... d[w][t2] += 1
....

d['word1']['tagB'] 2
d['word2']['tagD'] 1
d['word2']['tagF']

Click to expand...

Click to expand...

0

HTH

Brandon · Dec 11, 2008

d = defaultdict(lambda: defaultdict(int))

Arnaud

Ah... so that's what lambdas are for. Many thanks!

Brandon

Help building a dictionary of lists	1	Nov 12, 2012
Advanced Dictionary	6	Jun 16, 2010
help needed with dictionary	3	Aug 29, 2008
Translater + module + tkinter	1	Feb 16, 2023
Zipping a dictionary whose values are lists	1	Apr 12, 2012
ANNOUNCE: Thesaurus - a recursive dictionary subclass using attributes	9	Dec 11, 2012
accessing dictionary keys	3	Oct 1, 2009
Over 30 types of variables available in python ?	0	Jan 6, 2013

dictionary idiom needed

Brandon

bearophileHUGS

bearophileHUGS

Brandon

Brandon

Arnaud Delobelle

Brandon

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads