Re: [graph-tool] How to access properties (stackoverflow

10 Apr 2012

      Hi Tiago,

Thanks for your advice - in particular, on using defaultdict, will try it out!

M

==========================================
Date: Mon, 09 Apr 2012 15:23:22 +0200
From: Tiago de Paula Peixoto <tiago@skewed.de>
To: Main discussion list for the graph-tool project
        <graph-tool@skewed.de>
Subject: Re: [graph-tool] How to access properties (stackoverflow
        question)?
Message-ID: <4F82E2CA.9030700@skewed.de>
Content-Type: text/plain; charset="utf-8"

On 04/09/2012 01:46 PM, Mikhail Spivakov wrote:
...
Hello Tiago and everyone,
I'd like to bring your attention to a question on stackoverflow, which
I'd also like to know the answer to:
http://stackoverflow.com/questions/9526827/graph-tool-how-to-access-properti...
I've posted an answer there.
...
In particular, this bit:
"How do I get an object [vertex] with a certain title back from the
graph ? One way seems to be to create a vertex filter using
graph.set_edge_filter(...) and apply that - which seems a pretty
expensive operation considering all I want is to get one single object
back. I really don't want to maintain my own object title / vertex
index mapping as IMO, that is one of the tasks of the graph".
Consider, for example, the following simple situation: I've got an
existing set of vertices and then obtain information on edges between
them from somewhere, which may sometimes also involve vertices not yet
in the graph. In networkx, I would simply do: add_edge(x,y), where x
and y are vertex names (that have or have not been defined a priori
using add_node(x)).
Is there a straightforward way to do the same in graph_tool?
This question is asked frequently, so I should probably put it in the
documentation.

Differently from networkx, graph-tool's vertices have only one canonical
"name", which is its index number. One may define arbitrary property
maps, but this mapping is inherently one-way only, since it does not
need to be unique. Thus, in order to find a vertex with a given
property, one needs to search the graph, or maintain a reverse mapping
by hand, with a dictionary object, for instance.

This way of doing things makes the underlying graph data structure very
straightforward, and allows for very efficient algorithms. On the other
hand, it is slightly less convenient for the user, if he needs to
constantly lookup vertices based on property values.

In my use, I've always been able to solve this by creating a single dict
object which implements the reverse mapping, and keeping it together
with the graph. Note that there is no easy way of implementing this in
the library itself, since the property values are often exposed in a
"bare bones" way as C++ vectors to the underlying algorithms, or as
Numpy arrays. In order for the mapping to be consistent, one would need
to check every time a value in the map has been changed, essentially
killing the performance of any algorithm which needs to modify property
values (basically all of them).

If you want to make the lookup more convenient, more or less in the way
you have described, could use a defaultdict, as follows,

    name_map = defaultdict(lambda: g.add_vertex())
    v = name_map["foo"]               # 'foo' gets created
    g.add_edge(v, name_map["bar"])    # 'bar' gets created

In my view, this is sufficiently straightforward... But people may have
different opinions.

Cheers,
Tiago

Mikhail Spivakov

tags

participants (1)