title | tags | authors | affiliations | date | bibliography | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LCIA Formatter |
|
|
|
28 April 2021 |
paper.bib |
Life Cycle Assessment (LCA) is an established and standardized methodology to comprehensively assess environmental and public health metrics across industries and products [@international_organization_for_standardization_enivronmental_2006]. The United States Environmental Protection Agency (USEPA) is developing an open source LCA tool ecosystem [@ingwersen_lca_2019]. The ecosystem includes tools to automate the creation of life cycle inventory (LCI) datasets, which account for flows to and from nature for steps across the life cycle of products or services, and tools for life cycle impact assessment (LCIA) to support classification and characterization of the cumulative LCI to potential impacts. Impacts are expressed via indicators, either midpoint or endpoint, corresponding to different points on the environmental cause-effect chain model [@unep_glam_2016]. This paper describes a USEPA LCA ecosystem tool 'LCIA formatter' that extracts LCIA information from original source methods and converts the data for interoperability with the rest of the USEPA LCA ecosystem tools.
A simplified algorithm for LCA is given in \autoref{eq:lca}, where
\begin{equation}\label{eq:lca} I = \sum(E*CF) \end{equation}
Both
LCA software generally include LCIA methods to provide impact assessment results for user-created and software-provided LCI, but the elementary flows used in these LCIA methods must be the same data objects as the flows in the LCI data to enable proper impact result calculations. A major challenge to reproducing LCA results and sharing LCA models across platforms is that there is no internationally common list of elementary flows used by either LCI, LCIA, or software providers. A critical review of elementary flow data used in LCA showed that flows in LCIA methods are the least clearly described and therefore least portable and machine-readable among those from LCI, LCIA and software providers [@edelen_critical_2018]. Furthermore, LCA software providers of both proprietary and open source software rarely provide an open process or external peer review of their incorporation of data from LCIA providers (Brightway2 is one exception [@Mutel2017]). This is one potential cause of discrepancies in results across LCA software.
The LCIA formatter is a specific solution to this problem. The LCIA formatter transparently acquires LCIA methods from original provider data portals, maps them to an authoritative flow list, and exports them in common data formats. The LCIA formatter v1.0 uses the Federal LCA Commons Elementary Flow List (FEDEFL) as the authoritative system of elementary flows [@edelen_federal_2019]. This system has been adopted by federal agencies in the U.S. to share data through the Federal LCA Commons (FLCAC). The automation of this process provided by the LCIA formatter is critical, because as elementary flows used in LCI are updated, the LCIA datasets should also be automatically updated, and vice versa, to facilitate their use in LCA.
The LCIA formatter code is written in Python 3 and created as a standard python package called lciafmt
that can be installed using pip. The LCIA formatter primarily uses pandas [@mckinney-proc-scipy-2010] for data wrangling, the Apache parquet format [@apache_pyarrow_2021] for local storage of processed datasets, and olca-ipc [@srocka_olca-ipc_2020] for writing data in a standard LCA data format. The code is stored on a USEPA GitHub repository and is publicly accessible.
The LCIA formatter accesses LCIA datasets directly from the data provider. These datasets are typically provided as Microsoft Excel or Access files. These are downloaded and saved in a temporary local cache. To support the specific functions necessary to access and parse individual methods, each method is processed within its own Python module. Flow names, indicators, characterization factors, and other metadata are compiled in a standard format. Adjustments are made as needed to improve consistency between indicators and across methods. This includes handling duplicate entries for the same elementary flow and data cleaning (such as cleaning string names, adjusting capitalization, formatting of CAS Registry Numbers). Additionally, the LCIA formatter supports the inclusion of non-specified secondary contexts (emission locations) where none are provided. Where methods provide both midpoint and endpoint categories within a single source, the LCIA formatter parses these indicators for separate use. Midpoint indicators are typically calculated at an intermediate point along the cause-effect model (such as final mass of substance in the environment), and endpoint indicators typically correspond to damage from the elementary flows (such as a disability-adjusted life year, or DALY) [@unep_glam_2016]. Therefore, midpoint and endpoint indicators should be evaluated separately. Finally, source flow data are mapped to elementary flows in the FEDEFL [@edelen_federal_2019], through mapping files provided within that package [@ingwersen_fedelemflowlist_2020]. These mapping files correspond to flow names and contexts to a common set of elementary flows generated for LCA modeling by the USEPA. Because flows are provided at more detailed contexts than exists in most LCIA methods currently, mapping to the FEDEFL typically expands the number of characterization factors available. Mapped methods are stored locally as parquet files for future access by LCIA formatter or other tools. Additionally, mapped methods can be exported in JSON-LD format for use in LCA software tools such as openLCA.
The LCIA formatter is structured to easily convert original source data from existing LCIA methods. Version 1.0 of the LCIA formatter converts three commonly used LCIA methods: TRACI 2.1 [@bare_traci_2011], ReCiPe2016 [@huijbregts_recipe_2017], and ImpactWorld+ [@bulle_impact_2019] (\autoref{tbl:methods}), which cover a variety of impact categories. Cumulative LCI indicators based on the FEDEFL are also available as an output of the LCIA formatter.
Method | Indicators | Characterization Factors (Source) | Characterization Factors (Mapped to FEDEFL) |
---|---|---|---|
TRACI2.1 | 9 midpoint | 33,429 | 200,686 |
ReCiPe2016 | 18 midpoint and 22 endpoint across three perspectives | 347,010 | 2,429,470 |
ImpactWorld+ | 18 midpoint and 26 endpoint | 87,804 | 457,983 |
FEDEFL Inventory Methods | 9 indicators | n.a. | 5,649 |
: Impact methods processed by lciafmt [@young_traciv21_2021; @young_recipe2016v11_2021; @young_impactworldv13_2021; @young_fedefl_2021].\label{tbl:methods}
USEPA's Tool for Reduction and Assessment of Chemicals and Other Impacts (TRACI) is widely used for LCA across the U.S. Federal Government and by U.S.-based LCA practitioners [@bare_traci_2011]. The LCIA formatter accesses the TRACI Excel-based source file and characterizes impacts across 9 midpoint indicators for use in U.S.-focused analyses. When available, flow characterization factors are mapped to the release context with the greatest level of detail. For example, an air emission may be linked to the specific release height and population density of release area. If secondary context data is unknown, the LCIA formatter generates an average factor across the possible contexts to ensure the flow will still be captured in model calculations.
ReCiPe 2016 characterizes impacts across 18 midpoint indicators and three perspectives: Individualist, Hierarchist, and Egalitarian [@huijbregts_recipe_2017]. The LCIA formatter generates endpoint impacts through a series of midpoint conversion factors provided for each indicator. As is done for TRACI, where characterization factors are not supplied for secondary contexts, an average factor across the possible contexts is generated. This ensures that users that do not specify a secondary context (e.g. emission to air with no indication of population density) can still obtain a characterization factor for a flow.
ImpactWorld+ v1.3 provides characterization factors for 44 indicators at midpoint or endpoint levels [@bulle_impact_2019]. ImpactWorld+ v1.3 is downloaded as an Access database and read into a pandas dataframe using pyodbc. Only elementary flows with global characterization factors are included at this time, as spatially explicit characterization factors (e.g., country) are not yet compatible with the FEDEFL. Context information is added for water scarcity and availability categories. Flowable name is applied as context for land occupation and transformation categories. Context descriptions are provided in the original source for all other categories.
The LCIA formatter generates life cycle inventory methods based on groups of elementary flows identified in the FEDEFL. For example, an inventory method for energy resource use represents the summation of all instances of these flows within a dataset. Where necessary unit conversions are applied to achieve a consistent indicator unit.
The LCIA formatter also includes a method-agnostic approach to convert indicators (midpoint or endpoint) to monetary values. The primary valuation is based on modified budget constraint modeling [@weidema_valuation_2009], updated to USD2014. A DALY is valued as the global average annual potential economic production per capita; the value of ecosystem damages is calculated from the ratio of global population to terrestrial surface area, and it is also validated based on environmental preservation expenditures in selected countries. The conversions between the different ecosystem impact indicators (e.g., PDF.m2.yr and species.yr) are based on the species density calculations from ReCiPe 2008 [@goedkoop_recipe_2009].
The LCIA methods generated by the LCIA formatter for use with the FEDEFL are hosted publicly on the FLCAC for use by LCA practitioners and researchers. These methods support LCAs performed by many parties, including member agencies for the Federal LCA Commons such as USEPA, U.S. Department of Energy, U.S. Department of Agriculture, U.S. Department of Defense, and others. These methods also enable impact assessment for researchers utilizing the US Life Cycle Inventory (USLCI) Database. LCIA methods from the LCIA formatter are also being used in the standard format as inputs into other USEPA LCA ecosystem tools, including useeior [@ingwersen_useeior_2021] and the Electricity Life Cycle Inventory (electricitylci) [@jamieson_electricitylci_2020]. The system was built to be flexible enough to support creating outputs for LCIA spatially-explicit characterization factors as those become more common. Users and LCA practitioners are encouraged to suggest additional LCIA methods or applications of the lciafmt as described in the package Wiki.
The creation of this software was funded by the SERDP-ESTCP research program under project WP-2757, and conducted via EPA contract EP-C-16-015, Task Order 68HERC19F0292 with ERG. The authors wish to acknowledge additional support from Ashley Edelen, Danny Hage, Troy Hottle, and Catherine Birney.
The U.S. Environmental Protection Agency, through its Office of Research and Development, conducted the research described herein under an approved Quality Assurance Project Plan (K-LRTD-0031522-QP-2). It has been subjected to the Agency’s peer and administrative review and has been approved for publication as an EPA document. Mention of trade names or commercial products does not constitute endorsement or recommendation for use.