diff --git a/_freeze/index/execute-results/docx.json b/_freeze/index/execute-results/docx.json index 615a2c6..c93913e 100644 --- a/_freeze/index/execute-results/docx.json +++ b/_freeze/index/execute-results/docx.json @@ -1,8 +1,8 @@ { - "hash": "2434aff51be677b3776a836877ec18a3", + "hash": "7cab29af2e6862d55b0f240d8ed3a19b", "result": { "engine": "jupyter", - "markdown": "---\ntitle: T is for Topology\nauthor:\n - name: Tanya Strydom\n orcid: 0000-0001-6067-1349\n corresponding: true\n email: t.strydom@sheffield.ac.uk\n roles:\n - Words\n - Nonsense\n - Rascality\n - Visualisation (although absent)\n affiliations:\n - University of Sheffield\n - name: Andrew P. Beckerman\n orcid: 0000-0002-7859-8394\n corresponding: false\n roles: []\n affiliations:\n - University of Sheffield\nkeywords:\n - food web\n - network construction\nabstract: |\n Pending...\nplain-language-summary: |\n We want to know a bit more about the different network topology generators (predict tools) and how they differ - *i.e.,* their strengths and weaknesses\nkey-points:\n - Important point 1\n - Important point 2\ndate: last-modified\nbibliography: references.bib\ncitation:\n container-title: Some fancy journal\nnumber-sections: true\n---\n\n:::{#977539bd .cell .markdown}\n## Introduction\n\nThe standard run of the mill that we cannot always feasibly construct networks because 1. hard, 2. time (yay dinosaurs, but also the future and impending doom I guess), and 3. probably something else meaningful that's just slipping my mind at the moment. Some of the usual culprits will come in here like: @jordanoSamplingNetworksEcological2016; [@jordanoChasingEcologicalInteractions2016]; @poisotGlobalKnowledgeGaps2021; @strydomRoadmapPredictingSpecies2021 (shameless self plug)\n\nMaybe a brief history of the development of predictive tools/topo generators? Sort of where the theory/body of work was based and how that has changed? IS there a difference between toppo generator and predictive tool - I'm inclined to think that it aligns with the whole debate of high level structure vs node-level perfection\n\nMaybe start here with discussing the core mechanistic differences that models will work at --- some are really concerned about (and thus constrained by) structure, others are more mechanistic in nature *i.e.,* species *a* has the capacity to eat species *b* because traits (read gob size), and then you get @rohrModelingFoodWebs2010 and @strydomFoodWebReconstruction2022 that sit in the weird liminal latent space... Here I will probably get on my (newly discovered) soapbox and wax lyrical about how in certain situations structure is enough (and that will probably be for some high-level things like thinking about energy flows etc., I can also see a world in which maybe you want to do some sort of robustness/extinction work - since then you're usually doing 'random' (within limits) extinctions) but there may be use cases where we are really interested in the node-level interactions *i.e.,* species identity is like a thing we need to care about and also be able to retrieve specific interactions at specific nodes correctly. \n\nAt some point we are going to need to discuss the key differences and implications between predicting a metaweb (*sensu* @dunneNetworkStructureFood2006) and a network realisation. And here I can't help but think about @poisotSpeciesWhyEcological2015 (and probably other papers) that discuss how the local factors are going to play a role and even the same pair of species may interact differently in different points in the landscape.\n\n> Do we need to delve into individual-based networks? (*sensu* Tinker 2012, Araújo 2008) I think its probably a step too far and one starts creeping into apples and pears type of comparisons. Especially since these work off of already existing networks (I seem to recall) and its more about about 'tweaking' those - so not so much *de novo* predictions. Although this might be useful to keep in mind when it comes to re-wiring... Also on that note do we opn the re-wiring door here in this ms or wait it out a bit.\n\n## Data & Methods {#sec-data-methods}\n\n### Overview of topology generators\n\nI know table are awful but in this case they may make more sense. Not sure about putting in some papers that have used the model - totes happy to drop those I think...\n\n| Approach | Reference | Core Mechanism | *e.g.,* uses |\n|-------------------|----------------------------------------------|--------------------|--------------------|\n| Cascade model | @cohenCommunityFoodWebs1990; @cohenStochasticTheoryCommunity1997 | structural | |\n| Niche model | @williamsSimpleRulesYield2000; @williamsSuccessItsLimits2008 | structural | |\n| PFIM | @shawFrameworkReconstructingAncient2024 | mechanistic | Dunhill (in review)|\n| Log-ratio | @rohrModelingFoodWebs2010 | latent trait space | @yeakelCollapseEcologicalNetwork2014, @piresMegafaunalExtinctionsHuman2020 (?)|\n| Nested hierarchy | @cattinPhylogeneticConstraintsAdaptation2004 | | |\n| ADBM | @petcheySizeForagingFood2008 | mechanistic | probably multiple |\n| Stochastic | @rossbergFoodWebsExperts2006 | | |\n| Graph Embedding | @strydomGraphEmbeddingTransfer2023 | latent trait space | @strydomFoodWebReconstruction2022 |\n| Trait-based | @caronAddressingEltonianShortfall2022 | mechanistic | @caronTrophicInteractionModels2023|\n\n: Lets make a table that gives an overview of the different topology generators that we will look at {#tbl-history}\n\n> Might be nice to have a little appendix/supp mat that breaks down the models in detail so that they are all in one place so that someone (grad student being told they need to build networks) some day can go and educate themselves with slightly lower effort. This will also be useful for me should I end up having to do some actual coding - think of this as step one in the pseudo code process.\n\n### Datasets used\n\nHere I think we need to span a variety of domains, at minimum aquatic and terrestrial but maybe there should be a 'scale' element as well *i.e.,* a regional and local network. I think there is going to be a 'turning point' where structural will take over from mechanistic in terms of performance. More specifically at local scales bioenergetic constraints (and co-occurrence) may play a bigger role in structuring a network whereas at the metaweb level then mechanistic may make more (since by default its about who can potentially interact and obviously not constrained by real-world scenarios) *sensu* @caronTrophicInteractionModels2023. Although having said that I feel that contradicts the idea of backbones (*sensu* Bramon Mora (sp?) et al & Stouffer et al) But that might be where we get the idea of core *structure* vs something like linkage density. So core things like trophic level/chain length will be conserved but connectance might not (I think I understand what I'm trying to say here)\n\nI think we should also use the Dunne (I think) Cambrian (also think) network (I was correct and its this one @dunneCompilationNetworkAnalyses2008). Because 1) it gives the paleo-centric methods their moment in the sun and 2) I think it also brings up the interesting question of can we use modern structure to predict past ones? Here one might expect a more mechanistic approach to shine.\n\nDraw the other datasets from `Mangal` because they will be nicely formatted and essentially at point and shoot level\n\n### Comparing different models\n\n1. Shortlist/finalise the different topo generators\n2. collate/translate into `Julia`\n * *e.g.,* some models wil be in SpeciesInteractionNetworks.jl (new EcoNet); I know (parts of) the transfer learning stuff is and the niche model\n * others will need to be coded out (the more simpler models should be easier)\n * can also consider `R` but then it becomes a case of porting things left and right depending on how we decide to do the post analyses\n3. Curate networks for the different datasets/scenarios we select - I feel like there might be some scenarios that we can't do all models for all datasets but maybe I'm being a pessimist.\n * Need to also think about where one might find the additional data for some of the models...\n * Body size: @herbersteinAnimalTraitsCuratedAnimal2022 - Although maybe Andrew has strong thotsTM RE the one true body size database to rule them all...\n * Other trait sources: @wilmanEltonTraitsSpecieslevelForaging2014 and @jonesPanTHERIASpecieslevelDatabase2009\n * This is where we'll get the paleo traits from if I'm correct @bambachAutecologyFillingEcospace2007\n * Phylogeny stuff: @uphamInferringMammalTree2019 (what we used for TL but its only mammals...) but I'm sure there will be others\n * Also limitation of scope... *e.g.,* do we even dare to think about including plants/basal producers (see *e.g.,* @valdovinosBioenergeticFrameworkAboveground2023)\n * Taxonomic harmonisation - something to think about and check\n4. compare model performance based on the ideas currently listed in the results section.\n5. Make a pretty picture that summarises things - maybe overlapping Venn circles that showcase which models do well in the different spheres/aspects of life\n\n## Results\n\nHow we want to compare and contrast. I think there won't be a 'winner' and thus we need to think of 'tests' that are going to measure performance in different situations/settings. With that in mind I think some valuable points to consider would be:\n\n* Structural vs pairwise link predictions (graph vs node level)\n * % of links correctly retrieved\n * connectance\n * trophic level\n * generalism vs specialism\n * something related to false positives/negatives\n * intervality\n* Data 'cost' (some methods might need a lot lot of supporting data vs something very light weight)\n* I think it would be remiss to not also take into consideration computational cost\n* something about the network output - I'm acknowledging my biases and saying that probabilistic (or *maybe* weighted) links are the way\n\n> maybe we can put these into broader categories - if we do start doing the venn overlap thing. *E.g.,* local scale predictions, regional scale predictions, pairwise interactions, structural (energetics), computationally cheap, low cost data\n\n## Discussion\n\nI think a big take home will (hopefully) be how different approaches do better in different situations and so you as an end user need to take this into consideration and pick accordingly. I think @petcheyFitEfficiencyBiology2011 might have (and share) some thoughts on this (thanks Andrew). I feel like I need to look at @berlowGoldilocksFactorFood2008 but maybe not exactly in this context but vaguely adjacent.\n\nAn interesting thing to also think about (and arguably it will be addressed based on some of the other thoughts and ideas) is data dependant and data independent 'parametrisation' of the models...\n\nI probably think about this point too much but a point of discussion that I think will be interesting to bring up the idea that if a model is missing a specific pairwise link but doing well at the structural level then when does it matter? I think this is covered with the whole node vs graph level performance but I kind of just want to bring it up here again because also one of those things that I think about a bit too much probably...\n\n> Thinking very long term here and maybe a bit beyond the scope but also thinking about a multi- model approach? So in other words using one model to build an initial network but maybe a second one to constrain it a bit better. I blame this thought on the over-connected PFIM food webs...\n\n## References {.unnumbered}\n\n::: {#refs}\n:::\n:::\n\n", + "markdown": "---\ntitle: T is for Topology\nauthor:\n - name: Tanya Strydom\n orcid: 0000-0001-6067-1349\n corresponding: true\n email: t.strydom@sheffield.ac.uk\n roles:\n - Words\n - Nonsense\n - Rascality\n - Visualisation (although absent)\n affiliations:\n - University of Sheffield\n - name: Andrew P. Beckerman\n orcid: 0000-0002-7859-8394\n corresponding: false\n roles: []\n affiliations:\n - University of Sheffield\nkeywords:\n - food web\n - network construction\nabstract: |\n Pending...\nplain-language-summary: |\n We want to know a bit more about the different network topology generators (predict tools) and how they differ - *i.e.,* their strengths and weaknesses\nkey-points:\n - Important point 1\n - Important point 2\ndate: last-modified\nbibliography: references.bib\ncitation:\n container-title: Some fancy journal\nnumber-sections: true\n---\n\n:::{#00506001 .cell .markdown}\n## Introduction\n\nThe standard run of the mill that we cannot always feasibly construct networks because 1. hard, 2. time (yay dinosaurs, but also the future and impending doom I guess), and 3. probably something else meaningful that's just slipping my mind at the moment. Some of the usual culprits will come in here like: @jordanoSamplingNetworksEcological2016; [@jordanoChasingEcologicalInteractions2016]; @poisotGlobalKnowledgeGaps2021; @strydomRoadmapPredictingSpecies2021 (shameless self plug)\n\nMaybe a brief history of the development of predictive tools/topo generators? Sort of where the theory/body of work was based and how that has changed? IS there a difference between toppo generator and predictive tool - I'm inclined to think that it aligns with the whole debate of high level structure vs node-level perfection\n\nMaybe start here with discussing the core mechanistic differences that models will work at --- some are really concerned about (and thus constrained by) structure, others are more mechanistic in nature *i.e.,* species *a* has the capacity to eat species *b* because traits (read gob size), and then you get @rohrModelingFoodWebs2010 and @strydomFoodWebReconstruction2022 that sit in the weird liminal latent space... Here I will probably get on my (newly discovered) soapbox and wax lyrical about how in certain situations structure is enough (and that will probably be for some high-level things like thinking about energy flows etc., I can also see a world in which maybe you want to do some sort of robustness/extinction work - since then you're usually doing 'random' (within limits) extinctions) but there may be use cases where we are really interested in the node-level interactions *i.e.,* species identity is like a thing we need to care about and also be able to retrieve specific interactions at specific nodes correctly. \n\nAt some point we are going to need to discuss the key differences and implications between predicting a metaweb (*sensu* @dunneNetworkStructureFood2006) and a network realisation. And here I can't help but think about @poisotSpeciesWhyEcological2015 (and probably other papers) that discuss how the local factors are going to play a role and even the same pair of species may interact differently in different points in the landscape.\n\n> Do we need to delve into individual-based networks? (*sensu* Tinker 2012, Araújo 2008) I think its probably a step too far and one starts creeping into apples and pears type of comparisons. Especially since these work off of already existing networks (I seem to recall) and its more about about 'tweaking' those - so not so much *de novo* predictions. Although this might be useful to keep in mind when it comes to re-wiring... Also on that note do we opn the re-wiring door here in this ms or wait it out a bit.\n\n## Data & Methods {#sec-data-methods}\n\n### Overview of topology generators\n\nI know table are awful but in this case they may make more sense. Also I don't think I'm at the point where I can say that the table is complete/comprehensive but it getting there Not sure about putting in some papers that have used the model - totes happy to drop those I think...\n\n| Approach | Reference | Core Mechanism | *e.g.,* uses |\n|-------------------|----------------------------------------------|--------------------|--------------------|\n| Cascade model | @cohenCommunityFoodWebs1990; @cohenStochasticTheoryCommunity1997 | structural | |\n| Niche model | @williamsSimpleRulesYield2000; @williamsSuccessItsLimits2008 | structural | |\n| PFIM | @shawFrameworkReconstructingAncient2024 | mechanistic | Dunhill (in review)|\n| Log-ratio | @rohrModelingFoodWebs2010 | latent trait space | @yeakelCollapseEcologicalNetwork2014, @piresMegafaunalExtinctionsHuman2020 (?)|\n| Nested hierarchy | @cattinPhylogeneticConstraintsAdaptation2004 | | |\n| ADBM | @petcheySizeForagingFood2008 | mechanistic | probably multiple |\n| Stochastic | @rossbergFoodWebsExperts2006 | | |\n| Graph Embedding | @strydomGraphEmbeddingTransfer2023 | latent trait space | @strydomFoodWebReconstruction2022 |\n| Trait-based | @caronAddressingEltonianShortfall2022 | mechanistic | @caronTrophicInteractionModels2023|\n\n: Lets make a table that gives an overview of the different topology generators that we will look at {#tbl-history}\n\n> Might be nice to have a little appendix/supp mat that breaks down the models in detail so that they are all in one place so that someone (grad student being told they need to build networks) some day can go and educate themselves with slightly lower effort. This will also be useful for me should I end up having to do some actual coding - think of this as step one in the pseudo code process.\n\n### Datasets used\n\nHere I think we need to span a variety of domains, at minimum aquatic and terrestrial but maybe there should be a 'scale' element as well *i.e.,* a regional and local network. I think there is going to be a 'turning point' where structural will take over from mechanistic in terms of performance. More specifically at local scales bioenergetic constraints (and co-occurrence) may play a bigger role in structuring a network whereas at the metaweb level then mechanistic may make more (since by default its about who can potentially interact and obviously not constrained by real-world scenarios) *sensu* @caronTrophicInteractionModels2023. Although having said that I feel that contradicts the idea of backbones (*sensu* Bramon Mora (sp?) et al & Stouffer et al) But that might be where we get the idea of core *structure* vs something like linkage density. So core things like trophic level/chain length will be conserved but connectance might not (I think I understand what I'm trying to say here)\n\nI think we should also use the Dunne (I think) Cambrian (also think) network (I was correct and its this one @dunneCompilationNetworkAnalyses2008). Because 1) it gives the paleo-centric methods their moment in the sun and 2) I think it also brings up the interesting question of can we use modern structure to predict past ones? Here one might expect a more mechanistic approach to shine.\n\nDraw the other datasets from `Mangal` because they will be nicely formatted and essentially at point and shoot level\n\n### Comparing different models\n\nFor now the (still essentially pending) workflow/associated code can be found at the following repository [BecksLab/topology_generators](https://github.com/BecksLab/topology_generators)\n\n1. Shortlist/finalise the different topo generators\n2. collate/translate into `Julia`\n * *e.g.,* some models wil be in SpeciesInteractionNetworks.jl (new EcoNet); I know (parts of) the transfer learning stuff is and the niche model\n * others will need to be coded out (the more simpler models should be easier)\n * can also consider `R` but then it becomes a case of porting things left and right depending on how we decide to do the post analyses\n3. Curate networks for the different datasets/scenarios we select - I feel like there might be some scenarios that we can't do all models for all datasets but maybe I'm being a pessimist.\n * Need to also think about where one might find the additional data for some of the models...\n * Body size: @herbersteinAnimalTraitsCuratedAnimal2022 - Although maybe Andrew has strong thotsTM RE the one true body size database to rule them all...\n * Other trait sources: @wilmanEltonTraitsSpecieslevelForaging2014 and @jonesPanTHERIASpecieslevelDatabase2009\n * This is where we'll get the paleo traits from if I'm correct @bambachAutecologyFillingEcospace2007\n * Phylogeny stuff: @uphamInferringMammalTree2019 (what we used for TL but its only mammals...) but I'm sure there will be others\n * Also limitation of scope... *e.g.,* do we even dare to think about including plants/basal producers (see *e.g.,* @valdovinosBioenergeticFrameworkAboveground2023)\n * Taxonomic harmonisation - something to think about and check\n4. compare model performance based on the ideas currently listed in the results section.\n5. Make a pretty picture that summarises things - maybe overlapping Venn circles that showcase which models do well in the different spheres/aspects of life\n\n## Results\n\nHow we want to compare and contrast. I think there won't be a 'winner' and thus we need to think of 'tests' that are going to measure performance in different situations/settings. With that in mind I think some valuable points to consider would be:\n\n* Structural vs pairwise link predictions (graph vs node level)\n * % of links correctly retrieved\n * connectance\n * trophic level\n * generalism vs specialism\n * something related to false positives/negatives\n * intervality\n* Data 'cost' (some methods might need a lot lot of supporting data vs something very light weight)\n* I think it would be remiss to not also take into consideration computational cost\n* something about the network output - I'm acknowledging my biases and saying that probabilistic (or *maybe* weighted) links are the way\n\n> maybe we can put these into broader categories - if we do start doing the venn overlap thing. *E.g.,* local scale predictions, regional scale predictions, pairwise interactions, structural (energetics), computationally cheap, low cost data\n\n## Discussion\n\nI think a big take home will (hopefully) be how different approaches do better in different situations and so you as an end user need to take this into consideration and pick accordingly. I think @petcheyFitEfficiencyBiology2011 might have (and share) some thoughts on this (thanks Andrew). I feel like I need to look at @berlowGoldilocksFactorFood2008 but maybe not exactly in this context but vaguely adjacent.\n\nAn interesting thing to also think about (and arguably it will be addressed based on some of the other thoughts and ideas) is data dependant and data independent 'parametrisation' of the models...\n\nI probably think about this point too much but a point of discussion that I think will be interesting to bring up the idea that if a model is missing a specific pairwise link but doing well at the structural level then when does it matter? I think this is covered with the whole node vs graph level performance but I kind of just want to bring it up here again because also one of those things that I think about a bit too much probably...\n\n> Thinking very long term here and maybe a bit beyond the scope but also thinking about a multi- model approach? So in other words using one model to build an initial network but maybe a second one to constrain it a bit better. I blame this thought on the over-connected PFIM food webs...\n\n## References {.unnumbered}\n\n::: {#refs}\n:::\n:::\n\n", "supporting": [ "index_files/figure-docx" ], diff --git a/_freeze/index/execute-results/html.json b/_freeze/index/execute-results/html.json index 9ab51c0..6ef4972 100644 --- a/_freeze/index/execute-results/html.json +++ b/_freeze/index/execute-results/html.json @@ -1,8 +1,8 @@ { - "hash": "2434aff51be677b3776a836877ec18a3", + "hash": "7cab29af2e6862d55b0f240d8ed3a19b", "result": { "engine": "jupyter", - "markdown": "---\ntitle: T is for Topology\nauthor:\n - name: Tanya Strydom\n orcid: 0000-0001-6067-1349\n corresponding: true\n email: t.strydom@sheffield.ac.uk\n roles:\n - Words\n - Nonsense\n - Rascality\n - Visualisation (although absent)\n affiliations:\n - University of Sheffield\n - name: Andrew P. Beckerman\n orcid: 0000-0002-7859-8394\n corresponding: false\n roles: []\n affiliations:\n - University of Sheffield\nkeywords:\n - food web\n - network construction\nabstract: |\n Pending...\nplain-language-summary: |\n We want to know a bit more about the different network topology generators (predict tools) and how they differ - *i.e.,* their strengths and weaknesses\nkey-points:\n - Important point 1\n - Important point 2\ndate: last-modified\nbibliography: references.bib\ncitation:\n container-title: Some fancy journal\nnumber-sections: true\n---\n\n:::{#cdbaa601 .cell .markdown}\n## Introduction\n\nThe standard run of the mill that we cannot always feasibly construct networks because 1. hard, 2. time (yay dinosaurs, but also the future and impending doom I guess), and 3. probably something else meaningful that's just slipping my mind at the moment. Some of the usual culprits will come in here like: @jordanoSamplingNetworksEcological2016; [@jordanoChasingEcologicalInteractions2016]; @poisotGlobalKnowledgeGaps2021; @strydomRoadmapPredictingSpecies2021 (shameless self plug)\n\nMaybe a brief history of the development of predictive tools/topo generators? Sort of where the theory/body of work was based and how that has changed? IS there a difference between toppo generator and predictive tool - I'm inclined to think that it aligns with the whole debate of high level structure vs node-level perfection\n\nMaybe start here with discussing the core mechanistic differences that models will work at --- some are really concerned about (and thus constrained by) structure, others are more mechanistic in nature *i.e.,* species *a* has the capacity to eat species *b* because traits (read gob size), and then you get @rohrModelingFoodWebs2010 and @strydomFoodWebReconstruction2022 that sit in the weird liminal latent space... Here I will probably get on my (newly discovered) soapbox and wax lyrical about how in certain situations structure is enough (and that will probably be for some high-level things like thinking about energy flows etc., I can also see a world in which maybe you want to do some sort of robustness/extinction work - since then you're usually doing 'random' (within limits) extinctions) but there may be use cases where we are really interested in the node-level interactions *i.e.,* species identity is like a thing we need to care about and also be able to retrieve specific interactions at specific nodes correctly. \n\nAt some point we are going to need to discuss the key differences and implications between predicting a metaweb (*sensu* @dunneNetworkStructureFood2006) and a network realisation. And here I can't help but think about @poisotSpeciesWhyEcological2015 (and probably other papers) that discuss how the local factors are going to play a role and even the same pair of species may interact differently in different points in the landscape.\n\n> Do we need to delve into individual-based networks? (*sensu* Tinker 2012, Araújo 2008) I think its probably a step too far and one starts creeping into apples and pears type of comparisons. Especially since these work off of already existing networks (I seem to recall) and its more about about 'tweaking' those - so not so much *de novo* predictions. Although this might be useful to keep in mind when it comes to re-wiring... Also on that note do we opn the re-wiring door here in this ms or wait it out a bit.\n\n## Data & Methods {#sec-data-methods}\n\n### Overview of topology generators\n\nI know table are awful but in this case they may make more sense. Not sure about putting in some papers that have used the model - totes happy to drop those I think...\n\n| Approach | Reference | Core Mechanism | *e.g.,* uses |\n|-------------------|----------------------------------------------|--------------------|--------------------|\n| Cascade model | @cohenCommunityFoodWebs1990; @cohenStochasticTheoryCommunity1997 | structural | |\n| Niche model | @williamsSimpleRulesYield2000; @williamsSuccessItsLimits2008 | structural | |\n| PFIM | @shawFrameworkReconstructingAncient2024 | mechanistic | Dunhill (in review)|\n| Log-ratio | @rohrModelingFoodWebs2010 | latent trait space | @yeakelCollapseEcologicalNetwork2014, @piresMegafaunalExtinctionsHuman2020 (?)|\n| Nested hierarchy | @cattinPhylogeneticConstraintsAdaptation2004 | | |\n| ADBM | @petcheySizeForagingFood2008 | mechanistic | probably multiple |\n| Stochastic | @rossbergFoodWebsExperts2006 | | |\n| Graph Embedding | @strydomGraphEmbeddingTransfer2023 | latent trait space | @strydomFoodWebReconstruction2022 |\n| Trait-based | @caronAddressingEltonianShortfall2022 | mechanistic | @caronTrophicInteractionModels2023|\n\n: Lets make a table that gives an overview of the different topology generators that we will look at {#tbl-history}\n\n> Might be nice to have a little appendix/supp mat that breaks down the models in detail so that they are all in one place so that someone (grad student being told they need to build networks) some day can go and educate themselves with slightly lower effort. This will also be useful for me should I end up having to do some actual coding - think of this as step one in the pseudo code process.\n\n### Datasets used\n\nHere I think we need to span a variety of domains, at minimum aquatic and terrestrial but maybe there should be a 'scale' element as well *i.e.,* a regional and local network. I think there is going to be a 'turning point' where structural will take over from mechanistic in terms of performance. More specifically at local scales bioenergetic constraints (and co-occurrence) may play a bigger role in structuring a network whereas at the metaweb level then mechanistic may make more (since by default its about who can potentially interact and obviously not constrained by real-world scenarios) *sensu* @caronTrophicInteractionModels2023. Although having said that I feel that contradicts the idea of backbones (*sensu* Bramon Mora (sp?) et al & Stouffer et al) But that might be where we get the idea of core *structure* vs something like linkage density. So core things like trophic level/chain length will be conserved but connectance might not (I think I understand what I'm trying to say here)\n\nI think we should also use the Dunne (I think) Cambrian (also think) network (I was correct and its this one @dunneCompilationNetworkAnalyses2008). Because 1) it gives the paleo-centric methods their moment in the sun and 2) I think it also brings up the interesting question of can we use modern structure to predict past ones? Here one might expect a more mechanistic approach to shine.\n\nDraw the other datasets from `Mangal` because they will be nicely formatted and essentially at point and shoot level\n\n### Comparing different models\n\n1. Shortlist/finalise the different topo generators\n2. collate/translate into `Julia`\n * *e.g.,* some models wil be in SpeciesInteractionNetworks.jl (new EcoNet); I know (parts of) the transfer learning stuff is and the niche model\n * others will need to be coded out (the more simpler models should be easier)\n * can also consider `R` but then it becomes a case of porting things left and right depending on how we decide to do the post analyses\n3. Curate networks for the different datasets/scenarios we select - I feel like there might be some scenarios that we can't do all models for all datasets but maybe I'm being a pessimist.\n * Need to also think about where one might find the additional data for some of the models...\n * Body size: @herbersteinAnimalTraitsCuratedAnimal2022 - Although maybe Andrew has strong thotsTM RE the one true body size database to rule them all...\n * Other trait sources: @wilmanEltonTraitsSpecieslevelForaging2014 and @jonesPanTHERIASpecieslevelDatabase2009\n * This is where we'll get the paleo traits from if I'm correct @bambachAutecologyFillingEcospace2007\n * Phylogeny stuff: @uphamInferringMammalTree2019 (what we used for TL but its only mammals...) but I'm sure there will be others\n * Also limitation of scope... *e.g.,* do we even dare to think about including plants/basal producers (see *e.g.,* @valdovinosBioenergeticFrameworkAboveground2023)\n * Taxonomic harmonisation - something to think about and check\n4. compare model performance based on the ideas currently listed in the results section.\n5. Make a pretty picture that summarises things - maybe overlapping Venn circles that showcase which models do well in the different spheres/aspects of life\n\n## Results\n\nHow we want to compare and contrast. I think there won't be a 'winner' and thus we need to think of 'tests' that are going to measure performance in different situations/settings. With that in mind I think some valuable points to consider would be:\n\n* Structural vs pairwise link predictions (graph vs node level)\n * % of links correctly retrieved\n * connectance\n * trophic level\n * generalism vs specialism\n * something related to false positives/negatives\n * intervality\n* Data 'cost' (some methods might need a lot lot of supporting data vs something very light weight)\n* I think it would be remiss to not also take into consideration computational cost\n* something about the network output - I'm acknowledging my biases and saying that probabilistic (or *maybe* weighted) links are the way\n\n> maybe we can put these into broader categories - if we do start doing the venn overlap thing. *E.g.,* local scale predictions, regional scale predictions, pairwise interactions, structural (energetics), computationally cheap, low cost data\n\n## Discussion\n\nI think a big take home will (hopefully) be how different approaches do better in different situations and so you as an end user need to take this into consideration and pick accordingly. I think @petcheyFitEfficiencyBiology2011 might have (and share) some thoughts on this (thanks Andrew). I feel like I need to look at @berlowGoldilocksFactorFood2008 but maybe not exactly in this context but vaguely adjacent.\n\nAn interesting thing to also think about (and arguably it will be addressed based on some of the other thoughts and ideas) is data dependant and data independent 'parametrisation' of the models...\n\nI probably think about this point too much but a point of discussion that I think will be interesting to bring up the idea that if a model is missing a specific pairwise link but doing well at the structural level then when does it matter? I think this is covered with the whole node vs graph level performance but I kind of just want to bring it up here again because also one of those things that I think about a bit too much probably...\n\n> Thinking very long term here and maybe a bit beyond the scope but also thinking about a multi- model approach? So in other words using one model to build an initial network but maybe a second one to constrain it a bit better. I blame this thought on the over-connected PFIM food webs...\n\n## References {.unnumbered}\n\n::: {#refs}\n:::\n:::\n\n", + "markdown": "---\ntitle: T is for Topology\nauthor:\n - name: Tanya Strydom\n orcid: 0000-0001-6067-1349\n corresponding: true\n email: t.strydom@sheffield.ac.uk\n roles:\n - Words\n - Nonsense\n - Rascality\n - Visualisation (although absent)\n affiliations:\n - University of Sheffield\n - name: Andrew P. Beckerman\n orcid: 0000-0002-7859-8394\n corresponding: false\n roles: []\n affiliations:\n - University of Sheffield\nkeywords:\n - food web\n - network construction\nabstract: |\n Pending...\nplain-language-summary: |\n We want to know a bit more about the different network topology generators (predict tools) and how they differ - *i.e.,* their strengths and weaknesses\nkey-points:\n - Important point 1\n - Important point 2\ndate: last-modified\nbibliography: references.bib\ncitation:\n container-title: Some fancy journal\nnumber-sections: true\n---\n\n:::{#e55a06e1 .cell .markdown}\n## Introduction\n\nThe standard run of the mill that we cannot always feasibly construct networks because 1. hard, 2. time (yay dinosaurs, but also the future and impending doom I guess), and 3. probably something else meaningful that's just slipping my mind at the moment. Some of the usual culprits will come in here like: @jordanoSamplingNetworksEcological2016; [@jordanoChasingEcologicalInteractions2016]; @poisotGlobalKnowledgeGaps2021; @strydomRoadmapPredictingSpecies2021 (shameless self plug)\n\nMaybe a brief history of the development of predictive tools/topo generators? Sort of where the theory/body of work was based and how that has changed? IS there a difference between toppo generator and predictive tool - I'm inclined to think that it aligns with the whole debate of high level structure vs node-level perfection\n\nMaybe start here with discussing the core mechanistic differences that models will work at --- some are really concerned about (and thus constrained by) structure, others are more mechanistic in nature *i.e.,* species *a* has the capacity to eat species *b* because traits (read gob size), and then you get @rohrModelingFoodWebs2010 and @strydomFoodWebReconstruction2022 that sit in the weird liminal latent space... Here I will probably get on my (newly discovered) soapbox and wax lyrical about how in certain situations structure is enough (and that will probably be for some high-level things like thinking about energy flows etc., I can also see a world in which maybe you want to do some sort of robustness/extinction work - since then you're usually doing 'random' (within limits) extinctions) but there may be use cases where we are really interested in the node-level interactions *i.e.,* species identity is like a thing we need to care about and also be able to retrieve specific interactions at specific nodes correctly. \n\nAt some point we are going to need to discuss the key differences and implications between predicting a metaweb (*sensu* @dunneNetworkStructureFood2006) and a network realisation. And here I can't help but think about @poisotSpeciesWhyEcological2015 (and probably other papers) that discuss how the local factors are going to play a role and even the same pair of species may interact differently in different points in the landscape.\n\n> Do we need to delve into individual-based networks? (*sensu* Tinker 2012, Araújo 2008) I think its probably a step too far and one starts creeping into apples and pears type of comparisons. Especially since these work off of already existing networks (I seem to recall) and its more about about 'tweaking' those - so not so much *de novo* predictions. Although this might be useful to keep in mind when it comes to re-wiring... Also on that note do we opn the re-wiring door here in this ms or wait it out a bit.\n\n## Data & Methods {#sec-data-methods}\n\n### Overview of topology generators\n\nI know table are awful but in this case they may make more sense. Also I don't think I'm at the point where I can say that the table is complete/comprehensive but it getting there Not sure about putting in some papers that have used the model - totes happy to drop those I think...\n\n| Approach | Reference | Core Mechanism | *e.g.,* uses |\n|-------------------|----------------------------------------------|--------------------|--------------------|\n| Cascade model | @cohenCommunityFoodWebs1990; @cohenStochasticTheoryCommunity1997 | structural | |\n| Niche model | @williamsSimpleRulesYield2000; @williamsSuccessItsLimits2008 | structural | |\n| PFIM | @shawFrameworkReconstructingAncient2024 | mechanistic | Dunhill (in review)|\n| Log-ratio | @rohrModelingFoodWebs2010 | latent trait space | @yeakelCollapseEcologicalNetwork2014, @piresMegafaunalExtinctionsHuman2020 (?)|\n| Nested hierarchy | @cattinPhylogeneticConstraintsAdaptation2004 | | |\n| ADBM | @petcheySizeForagingFood2008 | mechanistic | probably multiple |\n| Stochastic | @rossbergFoodWebsExperts2006 | | |\n| Graph Embedding | @strydomGraphEmbeddingTransfer2023 | latent trait space | @strydomFoodWebReconstruction2022 |\n| Trait-based | @caronAddressingEltonianShortfall2022 | mechanistic | @caronTrophicInteractionModels2023|\n\n: Lets make a table that gives an overview of the different topology generators that we will look at {#tbl-history}\n\n> Might be nice to have a little appendix/supp mat that breaks down the models in detail so that they are all in one place so that someone (grad student being told they need to build networks) some day can go and educate themselves with slightly lower effort. This will also be useful for me should I end up having to do some actual coding - think of this as step one in the pseudo code process.\n\n### Datasets used\n\nHere I think we need to span a variety of domains, at minimum aquatic and terrestrial but maybe there should be a 'scale' element as well *i.e.,* a regional and local network. I think there is going to be a 'turning point' where structural will take over from mechanistic in terms of performance. More specifically at local scales bioenergetic constraints (and co-occurrence) may play a bigger role in structuring a network whereas at the metaweb level then mechanistic may make more (since by default its about who can potentially interact and obviously not constrained by real-world scenarios) *sensu* @caronTrophicInteractionModels2023. Although having said that I feel that contradicts the idea of backbones (*sensu* Bramon Mora (sp?) et al & Stouffer et al) But that might be where we get the idea of core *structure* vs something like linkage density. So core things like trophic level/chain length will be conserved but connectance might not (I think I understand what I'm trying to say here)\n\nI think we should also use the Dunne (I think) Cambrian (also think) network (I was correct and its this one @dunneCompilationNetworkAnalyses2008). Because 1) it gives the paleo-centric methods their moment in the sun and 2) I think it also brings up the interesting question of can we use modern structure to predict past ones? Here one might expect a more mechanistic approach to shine.\n\nDraw the other datasets from `Mangal` because they will be nicely formatted and essentially at point and shoot level\n\n### Comparing different models\n\nFor now the (still essentially pending) workflow/associated code can be found at the following repository [BecksLab/topology_generators](https://github.com/BecksLab/topology_generators)\n\n1. Shortlist/finalise the different topo generators\n2. collate/translate into `Julia`\n * *e.g.,* some models wil be in SpeciesInteractionNetworks.jl (new EcoNet); I know (parts of) the transfer learning stuff is and the niche model\n * others will need to be coded out (the more simpler models should be easier)\n * can also consider `R` but then it becomes a case of porting things left and right depending on how we decide to do the post analyses\n3. Curate networks for the different datasets/scenarios we select - I feel like there might be some scenarios that we can't do all models for all datasets but maybe I'm being a pessimist.\n * Need to also think about where one might find the additional data for some of the models...\n * Body size: @herbersteinAnimalTraitsCuratedAnimal2022 - Although maybe Andrew has strong thotsTM RE the one true body size database to rule them all...\n * Other trait sources: @wilmanEltonTraitsSpecieslevelForaging2014 and @jonesPanTHERIASpecieslevelDatabase2009\n * This is where we'll get the paleo traits from if I'm correct @bambachAutecologyFillingEcospace2007\n * Phylogeny stuff: @uphamInferringMammalTree2019 (what we used for TL but its only mammals...) but I'm sure there will be others\n * Also limitation of scope... *e.g.,* do we even dare to think about including plants/basal producers (see *e.g.,* @valdovinosBioenergeticFrameworkAboveground2023)\n * Taxonomic harmonisation - something to think about and check\n4. compare model performance based on the ideas currently listed in the results section.\n5. Make a pretty picture that summarises things - maybe overlapping Venn circles that showcase which models do well in the different spheres/aspects of life\n\n## Results\n\nHow we want to compare and contrast. I think there won't be a 'winner' and thus we need to think of 'tests' that are going to measure performance in different situations/settings. With that in mind I think some valuable points to consider would be:\n\n* Structural vs pairwise link predictions (graph vs node level)\n * % of links correctly retrieved\n * connectance\n * trophic level\n * generalism vs specialism\n * something related to false positives/negatives\n * intervality\n* Data 'cost' (some methods might need a lot lot of supporting data vs something very light weight)\n* I think it would be remiss to not also take into consideration computational cost\n* something about the network output - I'm acknowledging my biases and saying that probabilistic (or *maybe* weighted) links are the way\n\n> maybe we can put these into broader categories - if we do start doing the venn overlap thing. *E.g.,* local scale predictions, regional scale predictions, pairwise interactions, structural (energetics), computationally cheap, low cost data\n\n## Discussion\n\nI think a big take home will (hopefully) be how different approaches do better in different situations and so you as an end user need to take this into consideration and pick accordingly. I think @petcheyFitEfficiencyBiology2011 might have (and share) some thoughts on this (thanks Andrew). I feel like I need to look at @berlowGoldilocksFactorFood2008 but maybe not exactly in this context but vaguely adjacent.\n\nAn interesting thing to also think about (and arguably it will be addressed based on some of the other thoughts and ideas) is data dependant and data independent 'parametrisation' of the models...\n\nI probably think about this point too much but a point of discussion that I think will be interesting to bring up the idea that if a model is missing a specific pairwise link but doing well at the structural level then when does it matter? I think this is covered with the whole node vs graph level performance but I kind of just want to bring it up here again because also one of those things that I think about a bit too much probably...\n\n> Thinking very long term here and maybe a bit beyond the scope but also thinking about a multi- model approach? So in other words using one model to build an initial network but maybe a second one to constrain it a bit better. I blame this thought on the over-connected PFIM food webs...\n\n## References {.unnumbered}\n\n::: {#refs}\n:::\n:::\n\n", "supporting": [ "index_files" ], diff --git a/_freeze/index/execute-results/tex.json b/_freeze/index/execute-results/tex.json index d1497d6..fe3d3e5 100644 --- a/_freeze/index/execute-results/tex.json +++ b/_freeze/index/execute-results/tex.json @@ -1,8 +1,8 @@ { - "hash": "2434aff51be677b3776a836877ec18a3", + "hash": "7cab29af2e6862d55b0f240d8ed3a19b", "result": { "engine": "jupyter", - "markdown": "---\ntitle: T is for Topology\nauthor:\n - name: Tanya Strydom\n orcid: 0000-0001-6067-1349\n corresponding: true\n email: t.strydom@sheffield.ac.uk\n roles:\n - Words\n - Nonsense\n - Rascality\n - Visualisation (although absent)\n affiliations:\n - University of Sheffield\n - name: Andrew P. Beckerman\n orcid: 0000-0002-7859-8394\n corresponding: false\n roles: []\n affiliations:\n - University of Sheffield\nkeywords:\n - food web\n - network construction\nabstract: |\n Pending...\nplain-language-summary: |\n We want to know a bit more about the different network topology generators (predict tools) and how they differ - *i.e.,* their strengths and weaknesses\nkey-points:\n - Important point 1\n - Important point 2\ndate: last-modified\nbibliography: references.bib\ncitation:\n container-title: Some fancy journal\nnumber-sections: true\n---\n\n:::{#892f6a0c .cell .markdown}\n## Introduction\n\nThe standard run of the mill that we cannot always feasibly construct networks because 1. hard, 2. time (yay dinosaurs, but also the future and impending doom I guess), and 3. probably something else meaningful that's just slipping my mind at the moment. Some of the usual culprits will come in here like: @jordanoSamplingNetworksEcological2016; [@jordanoChasingEcologicalInteractions2016]; @poisotGlobalKnowledgeGaps2021; @strydomRoadmapPredictingSpecies2021 (shameless self plug)\n\nMaybe a brief history of the development of predictive tools/topo generators? Sort of where the theory/body of work was based and how that has changed? IS there a difference between toppo generator and predictive tool - I'm inclined to think that it aligns with the whole debate of high level structure vs node-level perfection\n\nMaybe start here with discussing the core mechanistic differences that models will work at --- some are really concerned about (and thus constrained by) structure, others are more mechanistic in nature *i.e.,* species *a* has the capacity to eat species *b* because traits (read gob size), and then you get @rohrModelingFoodWebs2010 and @strydomFoodWebReconstruction2022 that sit in the weird liminal latent space... Here I will probably get on my (newly discovered) soapbox and wax lyrical about how in certain situations structure is enough (and that will probably be for some high-level things like thinking about energy flows etc., I can also see a world in which maybe you want to do some sort of robustness/extinction work - since then you're usually doing 'random' (within limits) extinctions) but there may be use cases where we are really interested in the node-level interactions *i.e.,* species identity is like a thing we need to care about and also be able to retrieve specific interactions at specific nodes correctly. \n\nAt some point we are going to need to discuss the key differences and implications between predicting a metaweb (*sensu* @dunneNetworkStructureFood2006) and a network realisation. And here I can't help but think about @poisotSpeciesWhyEcological2015 (and probably other papers) that discuss how the local factors are going to play a role and even the same pair of species may interact differently in different points in the landscape.\n\n> Do we need to delve into individual-based networks? (*sensu* Tinker 2012, Araújo 2008) I think its probably a step too far and one starts creeping into apples and pears type of comparisons. Especially since these work off of already existing networks (I seem to recall) and its more about about 'tweaking' those - so not so much *de novo* predictions. Although this might be useful to keep in mind when it comes to re-wiring... Also on that note do we opn the re-wiring door here in this ms or wait it out a bit.\n\n## Data & Methods {#sec-data-methods}\n\n### Overview of topology generators\n\nI know table are awful but in this case they may make more sense. Not sure about putting in some papers that have used the model - totes happy to drop those I think...\n\n| Approach | Reference | Core Mechanism | *e.g.,* uses |\n|-------------------|----------------------------------------------|--------------------|--------------------|\n| Cascade model | @cohenCommunityFoodWebs1990; @cohenStochasticTheoryCommunity1997 | structural | |\n| Niche model | @williamsSimpleRulesYield2000; @williamsSuccessItsLimits2008 | structural | |\n| PFIM | @shawFrameworkReconstructingAncient2024 | mechanistic | Dunhill (in review)|\n| Log-ratio | @rohrModelingFoodWebs2010 | latent trait space | @yeakelCollapseEcologicalNetwork2014, @piresMegafaunalExtinctionsHuman2020 (?)|\n| Nested hierarchy | @cattinPhylogeneticConstraintsAdaptation2004 | | |\n| ADBM | @petcheySizeForagingFood2008 | mechanistic | probably multiple |\n| Stochastic | @rossbergFoodWebsExperts2006 | | |\n| Graph Embedding | @strydomGraphEmbeddingTransfer2023 | latent trait space | @strydomFoodWebReconstruction2022 |\n| Trait-based | @caronAddressingEltonianShortfall2022 | mechanistic | @caronTrophicInteractionModels2023|\n\n: Lets make a table that gives an overview of the different topology generators that we will look at {#tbl-history}\n\n> Might be nice to have a little appendix/supp mat that breaks down the models in detail so that they are all in one place so that someone (grad student being told they need to build networks) some day can go and educate themselves with slightly lower effort. This will also be useful for me should I end up having to do some actual coding - think of this as step one in the pseudo code process.\n\n### Datasets used\n\nHere I think we need to span a variety of domains, at minimum aquatic and terrestrial but maybe there should be a 'scale' element as well *i.e.,* a regional and local network. I think there is going to be a 'turning point' where structural will take over from mechanistic in terms of performance. More specifically at local scales bioenergetic constraints (and co-occurrence) may play a bigger role in structuring a network whereas at the metaweb level then mechanistic may make more (since by default its about who can potentially interact and obviously not constrained by real-world scenarios) *sensu* @caronTrophicInteractionModels2023. Although having said that I feel that contradicts the idea of backbones (*sensu* Bramon Mora (sp?) et al & Stouffer et al) But that might be where we get the idea of core *structure* vs something like linkage density. So core things like trophic level/chain length will be conserved but connectance might not (I think I understand what I'm trying to say here)\n\nI think we should also use the Dunne (I think) Cambrian (also think) network (I was correct and its this one @dunneCompilationNetworkAnalyses2008). Because 1) it gives the paleo-centric methods their moment in the sun and 2) I think it also brings up the interesting question of can we use modern structure to predict past ones? Here one might expect a more mechanistic approach to shine.\n\nDraw the other datasets from `Mangal` because they will be nicely formatted and essentially at point and shoot level\n\n### Comparing different models\n\n1. Shortlist/finalise the different topo generators\n2. collate/translate into `Julia`\n * *e.g.,* some models wil be in SpeciesInteractionNetworks.jl (new EcoNet); I know (parts of) the transfer learning stuff is and the niche model\n * others will need to be coded out (the more simpler models should be easier)\n * can also consider `R` but then it becomes a case of porting things left and right depending on how we decide to do the post analyses\n3. Curate networks for the different datasets/scenarios we select - I feel like there might be some scenarios that we can't do all models for all datasets but maybe I'm being a pessimist.\n * Need to also think about where one might find the additional data for some of the models...\n * Body size: @herbersteinAnimalTraitsCuratedAnimal2022 - Although maybe Andrew has strong thotsTM RE the one true body size database to rule them all...\n * Other trait sources: @wilmanEltonTraitsSpecieslevelForaging2014 and @jonesPanTHERIASpecieslevelDatabase2009\n * This is where we'll get the paleo traits from if I'm correct @bambachAutecologyFillingEcospace2007\n * Phylogeny stuff: @uphamInferringMammalTree2019 (what we used for TL but its only mammals...) but I'm sure there will be others\n * Also limitation of scope... *e.g.,* do we even dare to think about including plants/basal producers (see *e.g.,* @valdovinosBioenergeticFrameworkAboveground2023)\n * Taxonomic harmonisation - something to think about and check\n4. compare model performance based on the ideas currently listed in the results section.\n5. Make a pretty picture that summarises things - maybe overlapping Venn circles that showcase which models do well in the different spheres/aspects of life\n\n## Results\n\nHow we want to compare and contrast. I think there won't be a 'winner' and thus we need to think of 'tests' that are going to measure performance in different situations/settings. With that in mind I think some valuable points to consider would be:\n\n* Structural vs pairwise link predictions (graph vs node level)\n * % of links correctly retrieved\n * connectance\n * trophic level\n * generalism vs specialism\n * something related to false positives/negatives\n * intervality\n* Data 'cost' (some methods might need a lot lot of supporting data vs something very light weight)\n* I think it would be remiss to not also take into consideration computational cost\n* something about the network output - I'm acknowledging my biases and saying that probabilistic (or *maybe* weighted) links are the way\n\n> maybe we can put these into broader categories - if we do start doing the venn overlap thing. *E.g.,* local scale predictions, regional scale predictions, pairwise interactions, structural (energetics), computationally cheap, low cost data\n\n## Discussion\n\nI think a big take home will (hopefully) be how different approaches do better in different situations and so you as an end user need to take this into consideration and pick accordingly. I think @petcheyFitEfficiencyBiology2011 might have (and share) some thoughts on this (thanks Andrew). I feel like I need to look at @berlowGoldilocksFactorFood2008 but maybe not exactly in this context but vaguely adjacent.\n\nAn interesting thing to also think about (and arguably it will be addressed based on some of the other thoughts and ideas) is data dependant and data independent 'parametrisation' of the models...\n\nI probably think about this point too much but a point of discussion that I think will be interesting to bring up the idea that if a model is missing a specific pairwise link but doing well at the structural level then when does it matter? I think this is covered with the whole node vs graph level performance but I kind of just want to bring it up here again because also one of those things that I think about a bit too much probably...\n\n> Thinking very long term here and maybe a bit beyond the scope but also thinking about a multi- model approach? So in other words using one model to build an initial network but maybe a second one to constrain it a bit better. I blame this thought on the over-connected PFIM food webs...\n\n## References {.unnumbered}\n\n::: {#refs}\n:::\n:::\n\n", + "markdown": "---\ntitle: T is for Topology\nauthor:\n - name: Tanya Strydom\n orcid: 0000-0001-6067-1349\n corresponding: true\n email: t.strydom@sheffield.ac.uk\n roles:\n - Words\n - Nonsense\n - Rascality\n - Visualisation (although absent)\n affiliations:\n - University of Sheffield\n - name: Andrew P. Beckerman\n orcid: 0000-0002-7859-8394\n corresponding: false\n roles: []\n affiliations:\n - University of Sheffield\nkeywords:\n - food web\n - network construction\nabstract: |\n Pending...\nplain-language-summary: |\n We want to know a bit more about the different network topology generators (predict tools) and how they differ - *i.e.,* their strengths and weaknesses\nkey-points:\n - Important point 1\n - Important point 2\ndate: last-modified\nbibliography: references.bib\ncitation:\n container-title: Some fancy journal\nnumber-sections: true\n---\n\n:::{#32ce74bd .cell .markdown}\n## Introduction\n\nThe standard run of the mill that we cannot always feasibly construct networks because 1. hard, 2. time (yay dinosaurs, but also the future and impending doom I guess), and 3. probably something else meaningful that's just slipping my mind at the moment. Some of the usual culprits will come in here like: @jordanoSamplingNetworksEcological2016; [@jordanoChasingEcologicalInteractions2016]; @poisotGlobalKnowledgeGaps2021; @strydomRoadmapPredictingSpecies2021 (shameless self plug)\n\nMaybe a brief history of the development of predictive tools/topo generators? Sort of where the theory/body of work was based and how that has changed? IS there a difference between toppo generator and predictive tool - I'm inclined to think that it aligns with the whole debate of high level structure vs node-level perfection\n\nMaybe start here with discussing the core mechanistic differences that models will work at --- some are really concerned about (and thus constrained by) structure, others are more mechanistic in nature *i.e.,* species *a* has the capacity to eat species *b* because traits (read gob size), and then you get @rohrModelingFoodWebs2010 and @strydomFoodWebReconstruction2022 that sit in the weird liminal latent space... Here I will probably get on my (newly discovered) soapbox and wax lyrical about how in certain situations structure is enough (and that will probably be for some high-level things like thinking about energy flows etc., I can also see a world in which maybe you want to do some sort of robustness/extinction work - since then you're usually doing 'random' (within limits) extinctions) but there may be use cases where we are really interested in the node-level interactions *i.e.,* species identity is like a thing we need to care about and also be able to retrieve specific interactions at specific nodes correctly. \n\nAt some point we are going to need to discuss the key differences and implications between predicting a metaweb (*sensu* @dunneNetworkStructureFood2006) and a network realisation. And here I can't help but think about @poisotSpeciesWhyEcological2015 (and probably other papers) that discuss how the local factors are going to play a role and even the same pair of species may interact differently in different points in the landscape.\n\n> Do we need to delve into individual-based networks? (*sensu* Tinker 2012, Araújo 2008) I think its probably a step too far and one starts creeping into apples and pears type of comparisons. Especially since these work off of already existing networks (I seem to recall) and its more about about 'tweaking' those - so not so much *de novo* predictions. Although this might be useful to keep in mind when it comes to re-wiring... Also on that note do we opn the re-wiring door here in this ms or wait it out a bit.\n\n## Data & Methods {#sec-data-methods}\n\n### Overview of topology generators\n\nI know table are awful but in this case they may make more sense. Also I don't think I'm at the point where I can say that the table is complete/comprehensive but it getting there Not sure about putting in some papers that have used the model - totes happy to drop those I think...\n\n| Approach | Reference | Core Mechanism | *e.g.,* uses |\n|-------------------|----------------------------------------------|--------------------|--------------------|\n| Cascade model | @cohenCommunityFoodWebs1990; @cohenStochasticTheoryCommunity1997 | structural | |\n| Niche model | @williamsSimpleRulesYield2000; @williamsSuccessItsLimits2008 | structural | |\n| PFIM | @shawFrameworkReconstructingAncient2024 | mechanistic | Dunhill (in review)|\n| Log-ratio | @rohrModelingFoodWebs2010 | latent trait space | @yeakelCollapseEcologicalNetwork2014, @piresMegafaunalExtinctionsHuman2020 (?)|\n| Nested hierarchy | @cattinPhylogeneticConstraintsAdaptation2004 | | |\n| ADBM | @petcheySizeForagingFood2008 | mechanistic | probably multiple |\n| Stochastic | @rossbergFoodWebsExperts2006 | | |\n| Graph Embedding | @strydomGraphEmbeddingTransfer2023 | latent trait space | @strydomFoodWebReconstruction2022 |\n| Trait-based | @caronAddressingEltonianShortfall2022 | mechanistic | @caronTrophicInteractionModels2023|\n\n: Lets make a table that gives an overview of the different topology generators that we will look at {#tbl-history}\n\n> Might be nice to have a little appendix/supp mat that breaks down the models in detail so that they are all in one place so that someone (grad student being told they need to build networks) some day can go and educate themselves with slightly lower effort. This will also be useful for me should I end up having to do some actual coding - think of this as step one in the pseudo code process.\n\n### Datasets used\n\nHere I think we need to span a variety of domains, at minimum aquatic and terrestrial but maybe there should be a 'scale' element as well *i.e.,* a regional and local network. I think there is going to be a 'turning point' where structural will take over from mechanistic in terms of performance. More specifically at local scales bioenergetic constraints (and co-occurrence) may play a bigger role in structuring a network whereas at the metaweb level then mechanistic may make more (since by default its about who can potentially interact and obviously not constrained by real-world scenarios) *sensu* @caronTrophicInteractionModels2023. Although having said that I feel that contradicts the idea of backbones (*sensu* Bramon Mora (sp?) et al & Stouffer et al) But that might be where we get the idea of core *structure* vs something like linkage density. So core things like trophic level/chain length will be conserved but connectance might not (I think I understand what I'm trying to say here)\n\nI think we should also use the Dunne (I think) Cambrian (also think) network (I was correct and its this one @dunneCompilationNetworkAnalyses2008). Because 1) it gives the paleo-centric methods their moment in the sun and 2) I think it also brings up the interesting question of can we use modern structure to predict past ones? Here one might expect a more mechanistic approach to shine.\n\nDraw the other datasets from `Mangal` because they will be nicely formatted and essentially at point and shoot level\n\n### Comparing different models\n\nFor now the (still essentially pending) workflow/associated code can be found at the following repository [BecksLab/topology_generators](https://github.com/BecksLab/topology_generators)\n\n1. Shortlist/finalise the different topo generators\n2. collate/translate into `Julia`\n * *e.g.,* some models wil be in SpeciesInteractionNetworks.jl (new EcoNet); I know (parts of) the transfer learning stuff is and the niche model\n * others will need to be coded out (the more simpler models should be easier)\n * can also consider `R` but then it becomes a case of porting things left and right depending on how we decide to do the post analyses\n3. Curate networks for the different datasets/scenarios we select - I feel like there might be some scenarios that we can't do all models for all datasets but maybe I'm being a pessimist.\n * Need to also think about where one might find the additional data for some of the models...\n * Body size: @herbersteinAnimalTraitsCuratedAnimal2022 - Although maybe Andrew has strong thotsTM RE the one true body size database to rule them all...\n * Other trait sources: @wilmanEltonTraitsSpecieslevelForaging2014 and @jonesPanTHERIASpecieslevelDatabase2009\n * This is where we'll get the paleo traits from if I'm correct @bambachAutecologyFillingEcospace2007\n * Phylogeny stuff: @uphamInferringMammalTree2019 (what we used for TL but its only mammals...) but I'm sure there will be others\n * Also limitation of scope... *e.g.,* do we even dare to think about including plants/basal producers (see *e.g.,* @valdovinosBioenergeticFrameworkAboveground2023)\n * Taxonomic harmonisation - something to think about and check\n4. compare model performance based on the ideas currently listed in the results section.\n5. Make a pretty picture that summarises things - maybe overlapping Venn circles that showcase which models do well in the different spheres/aspects of life\n\n## Results\n\nHow we want to compare and contrast. I think there won't be a 'winner' and thus we need to think of 'tests' that are going to measure performance in different situations/settings. With that in mind I think some valuable points to consider would be:\n\n* Structural vs pairwise link predictions (graph vs node level)\n * % of links correctly retrieved\n * connectance\n * trophic level\n * generalism vs specialism\n * something related to false positives/negatives\n * intervality\n* Data 'cost' (some methods might need a lot lot of supporting data vs something very light weight)\n* I think it would be remiss to not also take into consideration computational cost\n* something about the network output - I'm acknowledging my biases and saying that probabilistic (or *maybe* weighted) links are the way\n\n> maybe we can put these into broader categories - if we do start doing the venn overlap thing. *E.g.,* local scale predictions, regional scale predictions, pairwise interactions, structural (energetics), computationally cheap, low cost data\n\n## Discussion\n\nI think a big take home will (hopefully) be how different approaches do better in different situations and so you as an end user need to take this into consideration and pick accordingly. I think @petcheyFitEfficiencyBiology2011 might have (and share) some thoughts on this (thanks Andrew). I feel like I need to look at @berlowGoldilocksFactorFood2008 but maybe not exactly in this context but vaguely adjacent.\n\nAn interesting thing to also think about (and arguably it will be addressed based on some of the other thoughts and ideas) is data dependant and data independent 'parametrisation' of the models...\n\nI probably think about this point too much but a point of discussion that I think will be interesting to bring up the idea that if a model is missing a specific pairwise link but doing well at the structural level then when does it matter? I think this is covered with the whole node vs graph level performance but I kind of just want to bring it up here again because also one of those things that I think about a bit too much probably...\n\n> Thinking very long term here and maybe a bit beyond the scope but also thinking about a multi- model approach? So in other words using one model to build an initial network but maybe a second one to constrain it a bit better. I blame this thought on the over-connected PFIM food webs...\n\n## References {.unnumbered}\n\n::: {#refs}\n:::\n:::\n\n", "supporting": [ "index_files/figure-pdf" ], diff --git a/_freeze/index/execute-results/xml.json b/_freeze/index/execute-results/xml.json index ac280e5..16dd690 100644 --- a/_freeze/index/execute-results/xml.json +++ b/_freeze/index/execute-results/xml.json @@ -1,8 +1,8 @@ { - "hash": "2434aff51be677b3776a836877ec18a3", + "hash": "7cab29af2e6862d55b0f240d8ed3a19b", "result": { "engine": "jupyter", - "markdown": "---\ntitle: T is for Topology\nauthor:\n - name: Tanya Strydom\n orcid: 0000-0001-6067-1349\n corresponding: true\n email: t.strydom@sheffield.ac.uk\n roles:\n - Words\n - Nonsense\n - Rascality\n - Visualisation (although absent)\n affiliations:\n - University of Sheffield\n - name: Andrew P. Beckerman\n orcid: 0000-0002-7859-8394\n corresponding: false\n roles: []\n affiliations:\n - University of Sheffield\nkeywords:\n - food web\n - network construction\nabstract: |\n Pending...\nplain-language-summary: |\n We want to know a bit more about the different network topology generators (predict tools) and how they differ - *i.e.,* their strengths and weaknesses\nkey-points:\n - Important point 1\n - Important point 2\ndate: last-modified\nbibliography: references.bib\ncitation:\n container-title: Some fancy journal\nnumber-sections: true\n---\n\n:::{#46d2896d .cell .markdown}\n## Introduction\n\nThe standard run of the mill that we cannot always feasibly construct networks because 1. hard, 2. time (yay dinosaurs, but also the future and impending doom I guess), and 3. probably something else meaningful that's just slipping my mind at the moment. Some of the usual culprits will come in here like: @jordanoSamplingNetworksEcological2016; [@jordanoChasingEcologicalInteractions2016]; @poisotGlobalKnowledgeGaps2021; @strydomRoadmapPredictingSpecies2021 (shameless self plug)\n\nMaybe a brief history of the development of predictive tools/topo generators? Sort of where the theory/body of work was based and how that has changed? IS there a difference between toppo generator and predictive tool - I'm inclined to think that it aligns with the whole debate of high level structure vs node-level perfection\n\nMaybe start here with discussing the core mechanistic differences that models will work at --- some are really concerned about (and thus constrained by) structure, others are more mechanistic in nature *i.e.,* species *a* has the capacity to eat species *b* because traits (read gob size), and then you get @rohrModelingFoodWebs2010 and @strydomFoodWebReconstruction2022 that sit in the weird liminal latent space... Here I will probably get on my (newly discovered) soapbox and wax lyrical about how in certain situations structure is enough (and that will probably be for some high-level things like thinking about energy flows etc., I can also see a world in which maybe you want to do some sort of robustness/extinction work - since then you're usually doing 'random' (within limits) extinctions) but there may be use cases where we are really interested in the node-level interactions *i.e.,* species identity is like a thing we need to care about and also be able to retrieve specific interactions at specific nodes correctly. \n\nAt some point we are going to need to discuss the key differences and implications between predicting a metaweb (*sensu* @dunneNetworkStructureFood2006) and a network realisation. And here I can't help but think about @poisotSpeciesWhyEcological2015 (and probably other papers) that discuss how the local factors are going to play a role and even the same pair of species may interact differently in different points in the landscape.\n\n> Do we need to delve into individual-based networks? (*sensu* Tinker 2012, Araújo 2008) I think its probably a step too far and one starts creeping into apples and pears type of comparisons. Especially since these work off of already existing networks (I seem to recall) and its more about about 'tweaking' those - so not so much *de novo* predictions. Although this might be useful to keep in mind when it comes to re-wiring... Also on that note do we opn the re-wiring door here in this ms or wait it out a bit.\n\n## Data & Methods {#sec-data-methods}\n\n### Overview of topology generators\n\nI know table are awful but in this case they may make more sense. Not sure about putting in some papers that have used the model - totes happy to drop those I think...\n\n| Approach | Reference | Core Mechanism | *e.g.,* uses |\n|-------------------|----------------------------------------------|--------------------|--------------------|\n| Cascade model | @cohenCommunityFoodWebs1990; @cohenStochasticTheoryCommunity1997 | structural | |\n| Niche model | @williamsSimpleRulesYield2000; @williamsSuccessItsLimits2008 | structural | |\n| PFIM | @shawFrameworkReconstructingAncient2024 | mechanistic | Dunhill (in review)|\n| Log-ratio | @rohrModelingFoodWebs2010 | latent trait space | @yeakelCollapseEcologicalNetwork2014, @piresMegafaunalExtinctionsHuman2020 (?)|\n| Nested hierarchy | @cattinPhylogeneticConstraintsAdaptation2004 | | |\n| ADBM | @petcheySizeForagingFood2008 | mechanistic | probably multiple |\n| Stochastic | @rossbergFoodWebsExperts2006 | | |\n| Graph Embedding | @strydomGraphEmbeddingTransfer2023 | latent trait space | @strydomFoodWebReconstruction2022 |\n| Trait-based | @caronAddressingEltonianShortfall2022 | mechanistic | @caronTrophicInteractionModels2023|\n\n: Lets make a table that gives an overview of the different topology generators that we will look at {#tbl-history}\n\n> Might be nice to have a little appendix/supp mat that breaks down the models in detail so that they are all in one place so that someone (grad student being told they need to build networks) some day can go and educate themselves with slightly lower effort. This will also be useful for me should I end up having to do some actual coding - think of this as step one in the pseudo code process.\n\n### Datasets used\n\nHere I think we need to span a variety of domains, at minimum aquatic and terrestrial but maybe there should be a 'scale' element as well *i.e.,* a regional and local network. I think there is going to be a 'turning point' where structural will take over from mechanistic in terms of performance. More specifically at local scales bioenergetic constraints (and co-occurrence) may play a bigger role in structuring a network whereas at the metaweb level then mechanistic may make more (since by default its about who can potentially interact and obviously not constrained by real-world scenarios) *sensu* @caronTrophicInteractionModels2023. Although having said that I feel that contradicts the idea of backbones (*sensu* Bramon Mora (sp?) et al & Stouffer et al) But that might be where we get the idea of core *structure* vs something like linkage density. So core things like trophic level/chain length will be conserved but connectance might not (I think I understand what I'm trying to say here)\n\nI think we should also use the Dunne (I think) Cambrian (also think) network (I was correct and its this one @dunneCompilationNetworkAnalyses2008). Because 1) it gives the paleo-centric methods their moment in the sun and 2) I think it also brings up the interesting question of can we use modern structure to predict past ones? Here one might expect a more mechanistic approach to shine.\n\nDraw the other datasets from `Mangal` because they will be nicely formatted and essentially at point and shoot level\n\n### Comparing different models\n\n1. Shortlist/finalise the different topo generators\n2. collate/translate into `Julia`\n * *e.g.,* some models wil be in SpeciesInteractionNetworks.jl (new EcoNet); I know (parts of) the transfer learning stuff is and the niche model\n * others will need to be coded out (the more simpler models should be easier)\n * can also consider `R` but then it becomes a case of porting things left and right depending on how we decide to do the post analyses\n3. Curate networks for the different datasets/scenarios we select - I feel like there might be some scenarios that we can't do all models for all datasets but maybe I'm being a pessimist.\n * Need to also think about where one might find the additional data for some of the models...\n * Body size: @herbersteinAnimalTraitsCuratedAnimal2022 - Although maybe Andrew has strong thotsTM RE the one true body size database to rule them all...\n * Other trait sources: @wilmanEltonTraitsSpecieslevelForaging2014 and @jonesPanTHERIASpecieslevelDatabase2009\n * This is where we'll get the paleo traits from if I'm correct @bambachAutecologyFillingEcospace2007\n * Phylogeny stuff: @uphamInferringMammalTree2019 (what we used for TL but its only mammals...) but I'm sure there will be others\n * Also limitation of scope... *e.g.,* do we even dare to think about including plants/basal producers (see *e.g.,* @valdovinosBioenergeticFrameworkAboveground2023)\n * Taxonomic harmonisation - something to think about and check\n4. compare model performance based on the ideas currently listed in the results section.\n5. Make a pretty picture that summarises things - maybe overlapping Venn circles that showcase which models do well in the different spheres/aspects of life\n\n## Results\n\nHow we want to compare and contrast. I think there won't be a 'winner' and thus we need to think of 'tests' that are going to measure performance in different situations/settings. With that in mind I think some valuable points to consider would be:\n\n* Structural vs pairwise link predictions (graph vs node level)\n * % of links correctly retrieved\n * connectance\n * trophic level\n * generalism vs specialism\n * something related to false positives/negatives\n * intervality\n* Data 'cost' (some methods might need a lot lot of supporting data vs something very light weight)\n* I think it would be remiss to not also take into consideration computational cost\n* something about the network output - I'm acknowledging my biases and saying that probabilistic (or *maybe* weighted) links are the way\n\n> maybe we can put these into broader categories - if we do start doing the venn overlap thing. *E.g.,* local scale predictions, regional scale predictions, pairwise interactions, structural (energetics), computationally cheap, low cost data\n\n## Discussion\n\nI think a big take home will (hopefully) be how different approaches do better in different situations and so you as an end user need to take this into consideration and pick accordingly. I think @petcheyFitEfficiencyBiology2011 might have (and share) some thoughts on this (thanks Andrew). I feel like I need to look at @berlowGoldilocksFactorFood2008 but maybe not exactly in this context but vaguely adjacent.\n\nAn interesting thing to also think about (and arguably it will be addressed based on some of the other thoughts and ideas) is data dependant and data independent 'parametrisation' of the models...\n\nI probably think about this point too much but a point of discussion that I think will be interesting to bring up the idea that if a model is missing a specific pairwise link but doing well at the structural level then when does it matter? I think this is covered with the whole node vs graph level performance but I kind of just want to bring it up here again because also one of those things that I think about a bit too much probably...\n\n> Thinking very long term here and maybe a bit beyond the scope but also thinking about a multi- model approach? So in other words using one model to build an initial network but maybe a second one to constrain it a bit better. I blame this thought on the over-connected PFIM food webs...\n\n## References {.unnumbered}\n\n::: {#refs}\n:::\n:::\n\n", + "markdown": "---\ntitle: T is for Topology\nauthor:\n - name: Tanya Strydom\n orcid: 0000-0001-6067-1349\n corresponding: true\n email: t.strydom@sheffield.ac.uk\n roles:\n - Words\n - Nonsense\n - Rascality\n - Visualisation (although absent)\n affiliations:\n - University of Sheffield\n - name: Andrew P. Beckerman\n orcid: 0000-0002-7859-8394\n corresponding: false\n roles: []\n affiliations:\n - University of Sheffield\nkeywords:\n - food web\n - network construction\nabstract: |\n Pending...\nplain-language-summary: |\n We want to know a bit more about the different network topology generators (predict tools) and how they differ - *i.e.,* their strengths and weaknesses\nkey-points:\n - Important point 1\n - Important point 2\ndate: last-modified\nbibliography: references.bib\ncitation:\n container-title: Some fancy journal\nnumber-sections: true\n---\n\n:::{#24f7a3e7 .cell .markdown}\n## Introduction\n\nThe standard run of the mill that we cannot always feasibly construct networks because 1. hard, 2. time (yay dinosaurs, but also the future and impending doom I guess), and 3. probably something else meaningful that's just slipping my mind at the moment. Some of the usual culprits will come in here like: @jordanoSamplingNetworksEcological2016; [@jordanoChasingEcologicalInteractions2016]; @poisotGlobalKnowledgeGaps2021; @strydomRoadmapPredictingSpecies2021 (shameless self plug)\n\nMaybe a brief history of the development of predictive tools/topo generators? Sort of where the theory/body of work was based and how that has changed? IS there a difference between toppo generator and predictive tool - I'm inclined to think that it aligns with the whole debate of high level structure vs node-level perfection\n\nMaybe start here with discussing the core mechanistic differences that models will work at --- some are really concerned about (and thus constrained by) structure, others are more mechanistic in nature *i.e.,* species *a* has the capacity to eat species *b* because traits (read gob size), and then you get @rohrModelingFoodWebs2010 and @strydomFoodWebReconstruction2022 that sit in the weird liminal latent space... Here I will probably get on my (newly discovered) soapbox and wax lyrical about how in certain situations structure is enough (and that will probably be for some high-level things like thinking about energy flows etc., I can also see a world in which maybe you want to do some sort of robustness/extinction work - since then you're usually doing 'random' (within limits) extinctions) but there may be use cases where we are really interested in the node-level interactions *i.e.,* species identity is like a thing we need to care about and also be able to retrieve specific interactions at specific nodes correctly. \n\nAt some point we are going to need to discuss the key differences and implications between predicting a metaweb (*sensu* @dunneNetworkStructureFood2006) and a network realisation. And here I can't help but think about @poisotSpeciesWhyEcological2015 (and probably other papers) that discuss how the local factors are going to play a role and even the same pair of species may interact differently in different points in the landscape.\n\n> Do we need to delve into individual-based networks? (*sensu* Tinker 2012, Araújo 2008) I think its probably a step too far and one starts creeping into apples and pears type of comparisons. Especially since these work off of already existing networks (I seem to recall) and its more about about 'tweaking' those - so not so much *de novo* predictions. Although this might be useful to keep in mind when it comes to re-wiring... Also on that note do we opn the re-wiring door here in this ms or wait it out a bit.\n\n## Data & Methods {#sec-data-methods}\n\n### Overview of topology generators\n\nI know table are awful but in this case they may make more sense. Also I don't think I'm at the point where I can say that the table is complete/comprehensive but it getting there Not sure about putting in some papers that have used the model - totes happy to drop those I think...\n\n| Approach | Reference | Core Mechanism | *e.g.,* uses |\n|-------------------|----------------------------------------------|--------------------|--------------------|\n| Cascade model | @cohenCommunityFoodWebs1990; @cohenStochasticTheoryCommunity1997 | structural | |\n| Niche model | @williamsSimpleRulesYield2000; @williamsSuccessItsLimits2008 | structural | |\n| PFIM | @shawFrameworkReconstructingAncient2024 | mechanistic | Dunhill (in review)|\n| Log-ratio | @rohrModelingFoodWebs2010 | latent trait space | @yeakelCollapseEcologicalNetwork2014, @piresMegafaunalExtinctionsHuman2020 (?)|\n| Nested hierarchy | @cattinPhylogeneticConstraintsAdaptation2004 | | |\n| ADBM | @petcheySizeForagingFood2008 | mechanistic | probably multiple |\n| Stochastic | @rossbergFoodWebsExperts2006 | | |\n| Graph Embedding | @strydomGraphEmbeddingTransfer2023 | latent trait space | @strydomFoodWebReconstruction2022 |\n| Trait-based | @caronAddressingEltonianShortfall2022 | mechanistic | @caronTrophicInteractionModels2023|\n\n: Lets make a table that gives an overview of the different topology generators that we will look at {#tbl-history}\n\n> Might be nice to have a little appendix/supp mat that breaks down the models in detail so that they are all in one place so that someone (grad student being told they need to build networks) some day can go and educate themselves with slightly lower effort. This will also be useful for me should I end up having to do some actual coding - think of this as step one in the pseudo code process.\n\n### Datasets used\n\nHere I think we need to span a variety of domains, at minimum aquatic and terrestrial but maybe there should be a 'scale' element as well *i.e.,* a regional and local network. I think there is going to be a 'turning point' where structural will take over from mechanistic in terms of performance. More specifically at local scales bioenergetic constraints (and co-occurrence) may play a bigger role in structuring a network whereas at the metaweb level then mechanistic may make more (since by default its about who can potentially interact and obviously not constrained by real-world scenarios) *sensu* @caronTrophicInteractionModels2023. Although having said that I feel that contradicts the idea of backbones (*sensu* Bramon Mora (sp?) et al & Stouffer et al) But that might be where we get the idea of core *structure* vs something like linkage density. So core things like trophic level/chain length will be conserved but connectance might not (I think I understand what I'm trying to say here)\n\nI think we should also use the Dunne (I think) Cambrian (also think) network (I was correct and its this one @dunneCompilationNetworkAnalyses2008). Because 1) it gives the paleo-centric methods their moment in the sun and 2) I think it also brings up the interesting question of can we use modern structure to predict past ones? Here one might expect a more mechanistic approach to shine.\n\nDraw the other datasets from `Mangal` because they will be nicely formatted and essentially at point and shoot level\n\n### Comparing different models\n\nFor now the (still essentially pending) workflow/associated code can be found at the following repository [BecksLab/topology_generators](https://github.com/BecksLab/topology_generators)\n\n1. Shortlist/finalise the different topo generators\n2. collate/translate into `Julia`\n * *e.g.,* some models wil be in SpeciesInteractionNetworks.jl (new EcoNet); I know (parts of) the transfer learning stuff is and the niche model\n * others will need to be coded out (the more simpler models should be easier)\n * can also consider `R` but then it becomes a case of porting things left and right depending on how we decide to do the post analyses\n3. Curate networks for the different datasets/scenarios we select - I feel like there might be some scenarios that we can't do all models for all datasets but maybe I'm being a pessimist.\n * Need to also think about where one might find the additional data for some of the models...\n * Body size: @herbersteinAnimalTraitsCuratedAnimal2022 - Although maybe Andrew has strong thotsTM RE the one true body size database to rule them all...\n * Other trait sources: @wilmanEltonTraitsSpecieslevelForaging2014 and @jonesPanTHERIASpecieslevelDatabase2009\n * This is where we'll get the paleo traits from if I'm correct @bambachAutecologyFillingEcospace2007\n * Phylogeny stuff: @uphamInferringMammalTree2019 (what we used for TL but its only mammals...) but I'm sure there will be others\n * Also limitation of scope... *e.g.,* do we even dare to think about including plants/basal producers (see *e.g.,* @valdovinosBioenergeticFrameworkAboveground2023)\n * Taxonomic harmonisation - something to think about and check\n4. compare model performance based on the ideas currently listed in the results section.\n5. Make a pretty picture that summarises things - maybe overlapping Venn circles that showcase which models do well in the different spheres/aspects of life\n\n## Results\n\nHow we want to compare and contrast. I think there won't be a 'winner' and thus we need to think of 'tests' that are going to measure performance in different situations/settings. With that in mind I think some valuable points to consider would be:\n\n* Structural vs pairwise link predictions (graph vs node level)\n * % of links correctly retrieved\n * connectance\n * trophic level\n * generalism vs specialism\n * something related to false positives/negatives\n * intervality\n* Data 'cost' (some methods might need a lot lot of supporting data vs something very light weight)\n* I think it would be remiss to not also take into consideration computational cost\n* something about the network output - I'm acknowledging my biases and saying that probabilistic (or *maybe* weighted) links are the way\n\n> maybe we can put these into broader categories - if we do start doing the venn overlap thing. *E.g.,* local scale predictions, regional scale predictions, pairwise interactions, structural (energetics), computationally cheap, low cost data\n\n## Discussion\n\nI think a big take home will (hopefully) be how different approaches do better in different situations and so you as an end user need to take this into consideration and pick accordingly. I think @petcheyFitEfficiencyBiology2011 might have (and share) some thoughts on this (thanks Andrew). I feel like I need to look at @berlowGoldilocksFactorFood2008 but maybe not exactly in this context but vaguely adjacent.\n\nAn interesting thing to also think about (and arguably it will be addressed based on some of the other thoughts and ideas) is data dependant and data independent 'parametrisation' of the models...\n\nI probably think about this point too much but a point of discussion that I think will be interesting to bring up the idea that if a model is missing a specific pairwise link but doing well at the structural level then when does it matter? I think this is covered with the whole node vs graph level performance but I kind of just want to bring it up here again because also one of those things that I think about a bit too much probably...\n\n> Thinking very long term here and maybe a bit beyond the scope but also thinking about a multi- model approach? So in other words using one model to build an initial network but maybe a second one to constrain it a bit better. I blame this thought on the over-connected PFIM food webs...\n\n## References {.unnumbered}\n\n::: {#refs}\n:::\n:::\n\n", "supporting": [ "index_files/figure-jats" ], diff --git a/index.qmd b/index.qmd index 3742969..9986645 100644 --- a/index.qmd +++ b/index.qmd @@ -52,7 +52,7 @@ At some point we are going to need to discuss the key differences and implicatio ### Overview of topology generators -I know table are awful but in this case they may make more sense. Not sure about putting in some papers that have used the model - totes happy to drop those I think... +I know table are awful but in this case they may make more sense. Also I don't think I'm at the point where I can say that the table is complete/comprehensive but it getting there Not sure about putting in some papers that have used the model - totes happy to drop those I think... | Approach | Reference | Core Mechanism | *e.g.,* uses | |-------------------|----------------------------------------------|--------------------|--------------------| @@ -80,6 +80,8 @@ Draw the other datasets from `Mangal` because they will be nicely formatted and ### Comparing different models +For now the (still essentially pending) workflow/associated code can be found at the following repository [BecksLab/topology_generators](https://github.com/BecksLab/topology_generators) + 1. Shortlist/finalise the different topo generators 2. collate/translate into `Julia` * *e.g.,* some models wil be in SpeciesInteractionNetworks.jl (new EcoNet); I know (parts of) the transfer learning stuff is and the niche model