Request for advice: sbml+xml Media Type

5 Jul 2003

      Hi,  

I work on SBML (Systems Biology Markup Language) at Caltech.  We  
are thinking of proposing a new XML MIME media type.  

When learning about the process for getting a new IETF-tree media  
type approved, I was strongly advised to consult the members of  
the ietf-types and ietf-xml-mime discussion lists for advice  
before diving in.  So here I am.  I'll appreciate any advice you  
can give me, especially any that saves us from making some stupid  
mistakes that the biological modeling community will regret for  
years to come.  Please forgive me if I reveal my ignorance in  
some questions below.  

First, a little background.  SBML is an XML format for  
representing systems of biochemical reactions.  Making it a MIME  
media type would enable browser-based simulation tools to  
conveniently download, run, and edit models.  Work is now  
beginning on a web infrastructure to make it easy for biological  
researchers to share models on the web, download models used in  
published papers, etc.  

Two "levels" of SBML have been defined so far.  Specifications,  
including the XML schemas, are at:  

http://www.sbw-sbml.org/sbml/docs/papers/sbml-level-1-version-2/sbml-level-1...  
http://www.sbw-sbml.org/sbml/docs/papers/sbml-level-2-version-1/sbml-level-2...  

We are thinking that the ideal name would be either:  

   application/sbml+xml  

or:  

   model/sbml+xml  

Now, here are a few questions.  

1. Would it be a bad idea if we used RFC3236 (The  
application/xhtml+xml Media Type) as a model for the document w  
write?  I'm hoping that we don't need to explain the full  
semantics of SBML in the RFC, since there are already some  
weighty papers that do that (referenced above).  At only 8 pages,  
RFC3236 seems like a model of simplicity and clarity that we  
would like to emulate.  Or is it possible to get even simpler?  
Some of the docs I found for XML MIME media types seemed to do  
little more than list the name of the type and who submitted it.  

2. We are thinking of including required parameters of "level"  
and "version".  Anything to watch out for here?  Is this a wrong  
idea?  SBML has multiple levels to enable different simulation  
tools to interoperate at different levels of complexity and  
sophistication.  Each level can come in different versions.  More  
levels are planned.  

3. Is it completely stupid to even consider model/sbml+xml?  The  
other model/ media types have been for spatial models.  SBML is  
primarily used for spatial models of reactions that occur within  
biological cells, and has some notions of spatial relation, but  
an SBML model does not necessarily have the minimum 3 orthogonal  
dimensions specified in RFC2077.  We're wondering if SBML is  
still within the spirit of the model/ top-level content type,  
though.  RFC2077 speaks of economic models, behavioral models,  
and so on, and seems to encourage a situation where modeling  
tools might work successfully on models from radically different  
domains.  

4. Any other advice you'd care to offer?  

Thanks in advance for your assistance,  

Ben  

--  
Ben Kovitz  
Systems Biology Workbench (SBW) Development Group, Caltech  
http://www.sbw-sbml.org  
bkovitz at caltech.edu

Ben Kovitz

Mark Baker

Linus Walleij

tags

participants (3)