Introduction
This Ecma Standard defines the Source Map Format, used for mapping transpiled source code back to the original sources.
The source map format has the following goals:
-
Support source-level debugging allowing bidirectional mapping
-
Support server-side stack trace deobfuscation
The document at https://tc39.es/source-map/ is the most accurate and up-to-date source map specification. It contains the content of the most recently published snapshot plus any modifications that will be included in the next snapshot.
If you want to get involved you will find more information at the specification repository.
The original source map format (v1) was created by Joseph Schorr for use by Closure Inspector to enable source-level debugging of optimized JavaScript code (although the format itself is language agnostic). However, as the size of the projects using source maps expanded, the verbosity of the format started to become a problem. The v2 format [V2Format] was created by trading some simplicity and flexibility to reduce the overall size of the source map. Even with the changes made with the v2 version of the format, the source map file size was limiting its usefulness. The v3 format is based on suggestions made by Pavel Podivilov (Google).
The source map format does not have version numbers anymore, and it is instead hard-coded to always be "3".
1. Scope
This Standard defines the source map format, used by different types of developer tools to improve the debugging experience of code compiled to JavaScript, WebAssembly, and CSS.
2. Conformance
A conforming source map document is a JSON document that conforms to the structure detailed in this specification.
A conforming source map generator should generate documents which are conforming source map documents, and can be decoded by the algorithms in this specification without reporting any errors (even those which are specified as optional).
A conforming source map consumer should implement the algorithms specified in this specification for retrieving (where applicable) and decoding source map documents. A conforming consumer is permitted to ignore errors or report them without terminating where the specification indicates that an algorithm may optionally report an error.
3. References
Normative References
- [ECMASCRIPT]
- ECMAScript Language Specification. URL: https://tc39.es/ecma262/multipage/
- [ENCODING]
- Anne van Kesteren. Encoding Standard. Living Standard. URL: https://encoding.spec.whatwg.org/
- [FETCH]
- Anne van Kesteren. Fetch Standard. Living Standard. URL: https://fetch.spec.whatwg.org/
- [INFRA]
- Anne van Kesteren; Domenic Denicola. Infra Standard. Living Standard. URL: https://infra.spec.whatwg.org/
- [URL]
- URL Standard. Living Standard. URL: https://url.spec.whatwg.org/
- [WEBIDL]
- Edgar Chen; Timothy Gu. Web IDL Standard. Living Standard. URL: https://webidl.spec.whatwg.org/
Informative References
- [BASE64]
- The Base16, Base32, and Base64 Data Encodings. Standards Track. URL: https://www.ietf.org/rfc/rfc4648.txt
- [ECMA-262]
- ECMAScript® Language Specification. Standards Track. URL: https://tc39.es/ecma262/
- [EvalSourceURL]
- Give your eval a name with //@ sourceURL. archive. URL: https://web.archive.org/web/20120814122523/http://blog.getfirebug.com/2009/08/11/give-your-eval-a-name-with-sourceurl/
- [V2Format]
- Source Map Revision 2 Proposal. URL: https://docs.google.com/document/d/1xi12LrcqjqIHTtZzrzZKmQ3lbTv9mKrN076UB-j3UZQ/edit?hl=en_US
- [VLQ]
- Variable-length quantity. reference article. URL: https://en.wikipedia.org/wiki/Variable-length_quantity
- [WasmNamesBinaryFormat]
- WebAssembly Names binary format. Living Standard. URL: https://www.w3.org/TR/wasm-core-2/#names%E2%91%A2
4. Terms and definitions
For the purposes of this document, the following terms and definitions apply.
- Generated Code
-
The code which is generated by the compiler or transpiler.
- Original Source
-
The source code which has not been passed through the compiler.
- Base64 VLQ [VLQ]
-
A [base64] value, where the most significant bit (the 6th bit) is used as the continuation bit, and the "digits" are encoded into the string least significant first, and where the least significant bit of the first digit is used as the sign bit.
Note: The values that can be represented by the VLQ Base64 encoded are limited to 32-bit quantities until some use case for larger values is presented. This means that values exceeding 32-bits are invalid and implementations may reject them. The sign bit is counted towards the limit, but the continuation bits are not.
The string
represents a Base64 VLQ with two digits. The first digit"iB"
encodes the bit pattern"i"
, which has a continuation bit of0x100010
(the VLQ continues), a sign bit of1
(non-negative), and the value bits0
. The second digit0x0001 B
encodes the bit pattern
, which has a continuation bit of0x000001
, no sign bit, and value bits0
. The decoding of this VLQ string is the number 17.0x00001 The string
represents a Base64 VLQ with one digit. The digit"V"
encodes the bit pattern"V"
, which has a continuation bit of0x010101
(no continuation), a sign bit of0
(negative), and the value bits1
. The decoding of this VLQ string is the number -10.0x1010 - Source Mapping URL
-
The URL referencing the location of a source map from the Generated code.
- Column
-
The zero-based indexed offset within a line of the generated code. How this offset is measured can depend on the content type. For JavaScript and CSS based source maps, they are defined to be in UTF-16 code units analogous to JavaScript string indices. That means that "A" (
LATIN CAPITAL LETTER A
) measures as 1 code unit, and "🔥" (FIRE
) measures as 2 code units. For WebAssembly, columns are defined as byte offsets from the beginning of the binary content (and there is only one group representing a line). Source maps for other content types may diverge from this.
5. Source Map Format
A source map is a JSON document containing a top-level JSON object with the following structure:
{ "version" : 3 , "file" : "out.js" , "sourceRoot" : "" , "sources" : [ "foo.js" , "bar.js" ], "sourcesContent" : [ null , null ], "names" : [ "src" , "maps" , "are" , "fun" ], "mappings" : "A,AAAB;;ABCDE" "ignoreList" : [ 0 ] }
-
version
is the version field which must always be the number
as an integer. The source map may be rejected if the field has any other value.3 -
file
is an optional name of the generated code that this source map is associated with. It’s not specified if this can be a URL, relative path name, or just a base name. Source map generators may choose the appropriate interpretation for their contexts of use. -
sourceRoot
is an optional source root string, used for relocating source files on a server or removing repeated values in the sources entry. This value is prepended to the individual entries in the sources field. -
sources
is a list of original sources used by the mappings entry. Each entry is either a string that is a (potentially relative) URL or
if the source name is not known.null -
sourcesContent
is an optional list of source content (i.e., the Original Source) strings, used when the source cannot be hosted. The contents are listed in the same order as the sources. Entries may be
if some original sources should be retrieved by name.null -
names
is an optional list of symbol names which may be used by the mappings entry. -
mappings
is a string with the encoded mapping data (see § 5.1 Mappings Structure). -
ignoreList
is an optional list of indices of files that should be considered third party code, such as framework code or bundler-generated code. This allows developer tools to avoid code that developers likely don’t want to see or step through, without requiring developers to configure this beforehand. It refers to the sources array and lists the indices of all the known third-party sources in the source map. Some browsers may also use the deprecatedx_google_ignoreList
field ifignoreList
is not present.
A decoded source map is a struct with the following fields:
- file
- A string or null.
- sources
- A list of decoded sources.
- mappings
- A list of decoded mappings.
A decoded source is a struct with the following fields:
- URL
- A URL or null.
- content
- A string or null.
- ignored
- A boolean.
To decode a source map from a JSON string str given a URL baseURL, run the following steps:
-
Let jsonMap be the result of parsing a JSON string to an Infra value str.
-
If jsonMap is not a map, report an error and abort these steps.
-
Decode a source map given jsonMap and baseURL, and return its result if any.
To decode a source map given a string-keyed map jsonMap and a URL baseURL, run the following steps:
-
If jsonMap[
] does not exist or jsonMap["version"
] is not 3, optionally report an error."version" -
If jsonMap[
] does not exist or jsonMap["mappings"
], is not a string, throw an error."mappings" -
If jsonMap[
] does not exist or jsonMap["sources"
], is not a list, throw an error."sources" -
Let sourceMap be a new decoded source map.
-
Set sourceMap’s file to optionally get a string
from jsonMap."file" -
Set sourceMap’s sources to the result of decoding source map sources given baseURL with:
-
sourceRoot set to optionally get a string
from jsonMap;"sourceRoot" -
sources set to optionally get a list of optional strings
from jsonMap;"sources" -
sourcesContent set to optionally get a list of optional strings
from jsonMap;"sourcesContent" -
ignoredSources set to optionally get a list of array indexes
from jsonMap."ignoreList"
-
-
Set sourceMap’s mappings to the result of decoding source map mappings with:
-
mappings set to jsonMap[
];"mappings" -
names set to optionally get a list of strings
from jsonMap;"names"
-
-
Return sourceMap.
To optionally get a string key from a string-keyed map jsonMap, run the following steps:
-
If jsonMap[key] does not exist, return null.
-
If jsonMap[key] is not a string, optionally report an error and return null.
-
Return jsonMap[key].
To optionally get a list of strings key from a string-keyed map jsonMap, run the following steps:
-
If jsonMap[key] is not a list, optionally report an error and return a new empty list.
-
Let list be a new empty list.
-
For each jsonItem of jsonMap[key]:
-
Else, optionally report an error and append
to list.""
-
Return list.
To optionally get a list of optional strings key from a string-keyed map jsonMap, run the following steps:
-
If jsonMap[key] is not a list, optionally report an error and return a new empty list.
-
Let list be a new empty list.
-
For each jsonItem of jsonMap[key]:
-
Else,
-
If jsonItem is not null, optionally report an error.
-
Append null to list.
-
-
Return list.
To optionally get a list of array indexes key from a string-keyed map jsonMap, run the following steps:
-
If jsonMap[key] is not a list, optionally report an error and return a new empty list.
-
Let list be a new empty list.
-
For each jsonItem of jsonMap[key]:
-
If jsonItem is a non-negative integer number, append it to list.
-
Else,
-
If jsonItem is not null, optionally report an error.
-
Append null to list.
-
-
-
Return list.
To optionally report an error, implementations can choose to:
-
Do nothing.
-
Report an error to the user, and continue processing.
-
Throw an error to abort the running algorithm. (Infra § 3.6 Control flow)
5.1. Mappings Structure
The mappings data is broken down as follows:
-
each group representing a line in the generated file is separated by a semicolon (
); -
each segment is separated by a comma (
), -
each segment is made up of 1, 4, or 5 variable length fields.
The fields in each segment are:
-
The zero-based starting column of the line in the generated code that the segment represents. If this is the first field of the first segment, or the first segment following a new generated line (
), then this field holds the whole Base64 VLQ. Otherwise, this field contains a Base64 VLQ that is relative to the previous occurrence of this field. Note that this is different from the subsequent fields below because the previous value is reset after every generated line.; -
If present, the zero-based index into the sources list. This field contains a Base64 VLQ relative to the previous occurrence of this field, unless it is the first occurrence of this field, in which case the whole value is represented.
-
If present, the zero-based starting line in the original source. This field contains a Base64 VLQ relative to the previous occurrence of this field, unless it is the first occurrence of this field, in which case the whole value is represented. Must be present if there is a source field.
-
If present, the zero-based starting column of the line in the original source. This field contains a Base64 VLQ relative to the previous occurrence of this field, unless it is the first occurrence of this field, in which case the whole value is represented. Must be present if there is a source field.
-
If present, the zero-based index into the names list associated with this segment. This field contains a Base64 VLQ relative to the previous occurrence of this field, unless it is the first occurrence of this field, in which case the whole value is represented.
Note: The purpose of this encoding is to reduce the source map size. VLQ encoding reduced source maps by 50% relative to the [V2Format] in tests performed using Google Calendar.
Note: Segments with one field are intended to represent generated code that is unmapped because there is no corresponding original source code, such as code that is generated by a compiler. Segments with four fields represent mapped code where a corresponding name does not exist. Segments with five fields represent mapped code that also has a mapped name.
Note: Using file offsets was considered but rejected in favor of using line/column data to avoid becoming misaligned with the original due to platform-specific line endings.
A decoded mapping is a struct with the following fields:
- generatedLine
- A non-negative integer.
- generatedColumn
- A non-negative integer.
- originalSource
- A decoded source or null.
- originalLine
- A non-negative integer or null.
- originalColumn
- A non-negative integer or null.
- name
- A string or null.
To decode source map mappings given a string mappings, a list of strings names, and a list of decoded sources sources, run the following steps:
-
Validate base64 VLQ groupings with mappings.
-
Let decodedMappings be a new empty list.
-
Let groups be the result of strictly splitting mappings on
.; -
Let generatedLine be 0.
-
Let originalLine be 0.
-
Let originalColumn be 0.
-
Let sourceIndex be 0.
-
Let nameIndex be 0.
-
While generatedLine is less than groups’s size:
-
If groups[generatedLine] is not the empty string, then:
-
Let segments be the result of strictly splitting groups[generatedLine] on
., -
Let generatedColumn be 0.
-
For each segment in segments:
-
Let position be a position variable for segment, initially pointing at segment’s start.
-
Decode a base64 VLQ from segment given position and let relativeGeneratedColumn be the result.
-
If relativeGeneratedColumn is null, optionally report an error and continue with the next iteration.
-
Increase generatedColumn by relativeGeneratedColumn. If the result is negative, optionally report an error and continue with the next iteration.
-
Let decodedMapping be a new decoded mapping whose generatedLine is generatedLine, generatedColumn is generatedColumn, originalSource is null, originalLine is null, originalColumn is null, and name is null.
-
Append decodedMapping to decodedMappings.
-
Decode a base64 VLQ from segment given position and let relativeSourceIndex be the result.
-
Decode a base64 VLQ from segment given position and let relativeOriginalLine be the result.
-
Decode a base64 VLQ from segment given position and let relativeOriginalColumn be the result.
-
If relativeOriginalColumn is null, then:
-
If relativeSourceIndex is not null, optionally report an error.
-
Continue with the next iteration.
-
-
Increase sourceIndex by relativeSourceIndex.
-
Increase originalLine by relativeOriginalLine.
-
Increase originalColumn by relativeOriginalColumn.
-
If any of sourceIndex, originalLine, or originalColumn are less than 0, or if sourceIndex is greater than or equal to sources’s size, optionally report an error.
-
Else,
-
Set decodedMapping’s originalSource to sources[sourceIndex].
-
Set decodedMapping’s originalLine to originalLine.
-
Set decodedMapping’s originalColumn to originalColumn.
-
-
Decode a base64 VLQ from segment given position and let relativeNameIndex be the result.
-
If relativeNameIndex is not null, then:
-
Increase nameIndex by relativeNameIndex.
-
If nameIndex is negative or greater than or equal to names’s size, optionally report an error.
-
Else, set decodedMapping’s name to names[nameIndex].
-
-
If position does not point to the end of segment, optionally report an error.
-
-
-
Increase generatedLine by 1.
-
-
Return decodedMappings.
To validate base64 VLQ groupings from a string groupings, run the following steps:
-
If groupings is not an ASCII string, throw an error.
-
If groupings contains any code unit other than:
-
U+002C (,) or U+003B (;);
-
U+0030 (0) to U+0039 (9);
-
U+0041 (A) to U+005A (Z);
-
U+0061 (a) to U+007A (z);
-
U+002B (+), U+002F (/)
NOTE: These are the valid [base64] characters (excluding the padding character
), together with=
and,
.; then throw an error.
-
To decode a base64 VLQ from a string segment given a position variable position, run the following steps:
-
If position points to the end of segment, return null.
-
Let first be a byte whose the value is the number corresponding to segment’s positionth code unit, according to the [base64] encoding.
NOTE: The two most significant bits of first are 0.
-
Let sign be 1 if first & 0x01 is 0x00, and -1 otherwise.
-
Let value be (first >> 1) & 0x0F, as a number.
-
Let nextShift be 16.
-
Let currentByte be first.
-
While currentByte & 0x20 is 0x20:
-
Advance position by 1.
-
If position points to the end of segment, throw an error.
-
Set currentByte to the byte whose the value is the number corresponding to segment’s positionth code unit, according to the [base64] encoding.
-
Let chunk be currentByte & 0x1F, as a number.
-
Add chunk * nextShift to value.
-
If value is greater than or equal to 231, throw an error.
-
Multiply nextShift by 32.
-
-
Advance position by 1.
-
If value is 0 and sign is -1, return -2147483648.
NOTE: -2147483648 is the smallest 32-bit signed integer.
-
Return value * sign.
NOTE: In addition to returning the decoded value, this algorithm updates the position variable in the calling algorithm.
5.1.1. Mappings for generated JavaScript code
Generated code positions that may have mapping entries are defined in terms of input elements as per ECMAScript Lexical Grammar. Mapping entries must point to either:
-
the first code point of the source text matched by IdentifierName, PrivateIdentifier, Punctuator, DivPunctuator, RightBracePunctuator, NumericLiteral and RegularExpressionLiteral.
-
any code point of the source text matched by Comment, HashbangComment, StringLiteral, Template, TemplateSubstitutionTail, WhiteSpace and LineTerminator.
5.1.2. Names for generated JavaScript code
Source map generators should create a mapping entry with a name field for a JavaScript token, if
-
The original source language construct maps semantically to the generated JavaScript code.
-
The original source language construct has a name.
Then the name of the mapping entry should be the name of the original source language construct. A mapping with a non-null name is called a named mapping.
Example: A minifier renaming functions and variables or removing function names from immediately invoked function expressions.
The following enumeration lists productions of the ECMAScript Syntactic Grammar and the respective token or non-terminal (on the right-hand side of the production) for which source map generators should emit a named mapping. The mapping entry created for such tokens must follow § 5.1.1 Mappings for generated JavaScript code.
The enumeration should be understood as the "minimum". In general, source map generators are free to emit any additional named mappings.
Note: The enumeration also lists tokens where generators "may" emit named mappings in addition
to the tokens where they "should". These reflect the reality where existing tooling emits or
expects named mappings. The duplicated named mapping is comparably cheap: Indices into names are encoded relative to each other so subsequent mappings to the same name are
encoded as 0 (A
).
-
The BindingIdentifier(s) for LexicalDeclaration, VariableStatement and Parameter List.
-
The BindingIdentifier for FunctionDeclaration, FunctionExpression, AsyncFunctionDeclaration, AsyncFunctionExpression, GeneratorDeclaration, GeneratorExpression, AsyncGeneratorDeclaration, and AsyncGeneratorExpression if it exists, or the opening parenthesis
preceding the FormalParameters otherwise.( Source map generators may chose to emit a named mapping on the opening parenthesis regardless of the presence of the BindingIdentifier.
-
For an ArrowFunction or AsyncArrowFunction:
-
The
token where ArrowFunction is produced with a single BindingIdentifier for ArrowParameters. Or AsyncArrowFunction is produced with an AsyncArrowBindingIdentifier.=> Note: This describes the case of (async) arrow functions with a single parameter, where that single parameter is not wrapped in parenthesis.
-
The opening parenthesis
where ArrowFunction or AsyncArrowFunction is produced with ArrowFormalParameters.( Source map generators may chose to additionally emit a named mapping on the
token for consistency with (1).=>
-
-
The ClassElementName for MethodDefinition. This includes generators, async methods, async generators and accessors. For MethodDefinition where ClassElementName is "constructor", the name should be the original class name if applicable.
Source map generators may chose to additionally emit a named mapping on the opening parenthesis
.( -
Source map generators may emit named mappings for IdentifierReference in Expression.
5.2. Resolving Sources
If the sources are not absolute URLs after prepending the sourceRoot, the sources are
resolved relative to the SourceMap (like resolving the script src
attribute in an HTML document).
To decode source map sources given a URL baseURL, a string or null sourceRoot, a list of either strings or nulls sources, a list of either strings or nulls sourcesContent, and a list of numbers ignoredSources, run the following steps:
-
Let decodedSources be a new empty list.
-
Let sourceURLPrefix be "".
-
If sourceRoot is not null, then:
-
If sourceRoot contains the code point U+002F (/), then:
-
Let index be the index of the last occurrence of U+002F (/) in sourceRoot.
-
Set sourceURLPrefix to the substring of sourceRoot from 0 to index + 1.
-
-
Else, set sourceURLPrefix to the concatenation of sourceRoot and "/".
-
-
For each source of sources with index index:
-
Let decodedSource be a new decoded source whose URL is null, content is null, and ignored is false.
-
If source is not null:
-
Set source to the concatenation of sourceURLPrefix and source.
-
Let sourceURL be the result of URL parsing source with baseURL.
-
If sourceURL is failure, optionally report an error.
-
Else, set decodedSource’s URL to sourceURL.
-
-
If index is in ignoredSources, set decodedSource’s ignored to true.
-
If sourcesContent’s size is greater than or equal to index, set decodedSource’s content to sourcesContent[index].
-
Append decodedSource to decodedSources.
-
-
Return decodedSources.
NOTE: Implementations that support showing source contents but do not support showing multiple sources with the same URL and different content will arbitrarily choose one of the various contents corresponding to the given URL.
5.3. Extensions
Source map consumers must ignore any additional unrecognized properties, rather than causing the source map to be rejected, so that additional features can be added to this format without breaking existing users.
6. Index Map
To support concatenating generated code and other common post-processing, an alternate representation of a map is supported:
{ "version" : 3 , "file" : "app.js" , "sections" : [ { "offset" : { "line" : 0 , "column" : 0 }, "map" : { "version" : 3 , "file" : "section.js" , "sources" : [ "foo.js" , "bar.js" ], "names" : [ "src" , "maps" , "are" , "fun" ], "mappings" : "AAAA,E;;ABCDE" } }, { "offset" : { "line" : 100 , "column" : 10 }, "map" : { "version" : 3 , "file" : "another_section.js" , "sources" : [ "more.js" ], "names" : [ "more" , "is" , "better" ], "mappings" : "AAAA,E;AACA,C;ABCDE" } } ] }
The index map follows the form of the standard map. Like the regular source map, the file format is JSON with a top-level object. It shares the version and file field from the regular source map, but gains a new sections field.
sections
is an array of Section objects.
6.1. Section
Section objects have the following fields:
-
offset
is an object with two fields,line
andcolumn
, that represent the offset into generated code that the referenced source map represents. -
map
is an embedded complete source map object. An embedded map does not inherit any values from the containing index map.
The sections must be sorted by starting position and the represented sections must not overlap.
7. Retrieving Source Maps
7.1. Linking generated code to source maps
While the source map format is intended to be language and platform agnostic, it is useful to define how to reference to them for the expected use-case of web server-hosted JavaScript.
There are two possible ways to link source maps to the output. The first requires server support in order to add an HTTP header and the second requires an annotation in the source.
Source maps are linked through URLs as defined in [URL]; in particular, characters outside the set permitted to appear in URIs must be percent-encoded and it may be a data URI. Using a data URI along with sourcesContent allows for a completely self-contained source map.
The HTTP sourcemap
header has precedence over a source annotation, and if both are present,
the header URL should be used to resolve the source map file.
Regardless of the method used to retrieve the Source Mapping URL the same process is used to resolve it, which is as follows:
When the Source Mapping URL is not absolute, then it is relative to the generated code’s source origin. The source origin is determined by one of the following cases:
-
If the generated source is not associated with a script element that has a
src
attribute and there exists a
comment in the generated code, that comment should be used to determine the source origin.//# sourceURL Note: Previously, this was
, as with//@ sourceURL
, it is reasonable to accept both but//@ sourceMappingURL
is preferred.//# -
If the generated code is associated with a script element and the script element has a
src
attribute, thesrc
attribute of the script element will be the source origin. -
If the generated code is associated with a script element and the script element does not have a
src
attribute, then the source origin will be the page’s origin. -
If the generated code is being evaluated as a string with the
eval
function or via()
, then the source origin will be the page’s origin.new Function()
7.1.1. Linking through HTTP headers
If a file is served through HTTP(S) with a sourcemap
header, the value of the header is
the URL of the linked source map.
sourcemap: < url>
Note: Previous revisions of this document recommended a header name of x
. This
is now deprecated; sourcemap
is now expected.
7.1.2. Linking through inline annotations
The generated code should include a comment, or the equivalent construct depending on its
language or format, named sourceMappingURL
and that contains the URL of the source map. This
specification defines how the comment should look like for JavaScript, CSS, and WebAssembly.
Other languages should follow a similar convention.
For a given language there can be multiple ways of detecting the sourceMappingURL
comment,
to allow for different implementations to choose what is less complex for them. The generated
code unambiguously links to a source map if the result of all the extraction methods
is the same.
If a tool consumes one or more source files that unambiguously links to a source map and it produces an output file that links to a source map, it must do so unambiguously.
let a= ` //# sourceMappingURL=foo.js.map //` ;
Extracing a Source Map URL from it through parsing gives null, while without parsing gives foo
.
A fix to this problem is being worked on, and will likely involve early returning from the below algorithms whenever there is a comment (or comment-like) that contains the characters U+0060 (`), U+0022 ("), or U+0027 ('), or the the sequence U+002A U+002F (*/).
7.1.2.1. Extraction methods for JavaScript sources
To extract a Source Map URL from JavaScript through parsing a string source, run the following steps:
-
Let tokens be the list of tokens obtained by parsing source according to [ECMA-262].
-
For each token in tokens, in reverse order:
-
If token is not a single-line comment or a multi-line comment, return null.
-
Let comment be the content of token.
-
If matching a Source Map URL in comment returns a string, return it.
-
-
Return null.
To extract a Source Map URL from JavaScript without parsing a string source, run the following steps:
-
Let lines be the result of strictly splitting source on ECMAScript line terminator code points.
-
Let lastURL be null.
-
For each line in lines:
-
Let position be a position variable for line, initially pointing at the start of line.
-
While position doesn’t point past the end of line:
-
Collect a sequence of code points that are ECMAScript white space code points from line given position.
NOTE: The collected code points are not used, but position is still updated.
-
If position points past the end of line, break.
-
Let first be the code point of line at position.
-
Increment position by 1.
-
If first is U+002F (/) and position does not point past the end of line, then:
-
Let second be the code point of line at position.
-
Increment position by 1.
-
If second is U+002F (/), then:
-
Let comment be the code point substring from position to the end of line.
-
If matching a Source Map URL in comment returns a string, set lastURL to it.
-
-
Else if second is U+002A (*), then:
-
Let comment be the empty string.
-
While position + 1 doesn’t point past the end of line:
-
Let c1 be the code point of line at position.
-
Increment position by 1.
-
Let c2 be the code point of line at position.
-
If c1 is U+002A (*) and c2 is U+002F (/), then:
-
If matching a Source Map URL in comment returns a string, set lastURL to it.
-
Increment position by 1.
-
-
Append c1 to comment.
-
-
-
Else, set lastURL to null.
-
-
Else, set lastURL to null.
Note: We reset lastURL to null whenever we find a non-comment code character.
-
-
-
Return lastURL.
NOTE: The algorithm above has been designed so that the source lines can be iterated in reverse order,
returning early after scanning through a line that contains a sourceMappingURL
comment.
const JS_NEWLINE= /^/m ; // This RegExp will always match one of the following: // - single-line comments // - "single-line" multi-line comments // - unclosed multi-line comments // - just trailing whitespaces // - a code character // The loop below differentiates between all these cases. const JS_COMMENT= /\s*(?:\/\/(?<single>.*)|\/\*(?<multi>.*?)\*\/|\/\*.*|$|(?<code>[^\/]+))/uym ; const PATTERN= /^[@#]\s*sourceMappingURL=(\S*?)\s*$/ ; let lastURL= null ; for ( const lineof source. split( JS_NEWLINE)) { JS_COMMENT. lastIndex= 0 ; while ( JS_COMMENT. lastIndex< line. length) { let commentMatch= JS_COMMENT. exec( line). groups; let comment= commentMatch. single?? commentMatch. multi; if ( comment!= null ) { let match= PATTERN. exec( comment); if ( match!== null ) lastURL= match[ 1 ]; } else if ( commentMatch. code!= null ) { lastURL= null ; } else { // We found either trailing whitespaces or an unclosed comment. // Assert: JS_COMMENT.lastIndex === line.length } } } return lastURL;
To match a Source Map URL in a comment comment (a string), run the following steps:
-
Let pattern be the regular expression
./^[@#]\s*sourceMappingURL=(\S*?)\s*$/ -
Let match be ! RegExpBuiltInExec(pattern, comment).
-
If match is not null, return match[1].
-
Return null.
Note: The prefix for this annotation was initially
however this conflicts with Internet
Explorer’s Conditional Compilation and was changed to
.
Source map generators must only emit
while source map consumers must accept both
and
.
7.1.2.2. Extraction methods for CSS sources
Extracting source mapping URLs from CSS is similar to JavaScript, with the exception that CSS only
supports
-style comments.
7.1.2.3. Extraction methods for WebAssembly binaries
To extract a Source Map URL from a WebAssembly source given a byte sequence bytes, run the following steps:
-
Let module be module_decode(bytes).
-
If module is error, return null.
-
For each custom section customSection of module,
-
Let name be the
name
of customSection, decoded as UTF-8. -
If name is "sourceMappingURL", then:
-
Let value be the
bytes
of customSection, decoded as UTF-8. -
If value is failure, return null.
-
Return value.
-
-
-
Return null.
Since WebAssembly is not a textual format and it does not support comments, it supports a single unambiguous extraction method. The URL is encoded using [WasmNamesBinaryFormat], and it’s placed as the content of the custom section. It is invalid for tools that generate WebAssembly code to generate two or more custom sections with the "sourceMappingURL" name.
7.2. Fetching Source Maps
To fetch a source map given a URL url, run the following steps:
-
Let promise be a new promise.
-
Fetch request with processResponseConsumeBody set to the following steps given response response and null, failure, or a byte sequence bodyBytes:
-
If bodyBytes is null or failure, reject promise with a
TypeError
and abort these steps. -
If url’s scheme is an HTTP(S) scheme and bodyBytes starts with `
`, then:)]} '-
While bodyBytes’s length is not 0 and bodyBytes’s 0th byte is not an HTTP newline byte:
-
remove the 0th byte from bodyBytes.
Note: For historic reasons, when delivering source maps over HTTP(S), servers may prepend a line starting with the string
to the source map.)]} ')]} 'garbage here{ "version" : 3 , ...} is interpreted as
{ "version" : 3 , ...} -
-
-
Let sourceMap be the result of parsing JSON bytes to a JavaScript value given bodyBytes.
-
If the previous step threw an error, reject promise with that error.
-
Otherwise, resolve promise with sourceMap.
-
-
Return promise.
8. Conventions
The following conventions should be followed when working with source maps or when generating them.
8.1. Source Map Naming
Commonly, a source map will have the same name as the generated file but with a
extension. For example, for page
a source map named page
would be generated.
8.2. Linking eval’d code to named generated code
There is an existing convention that should be supported for the use of source maps with eval’d code, it has the following form:
//# sourceURL=foo.js
It is described in [EvalSourceURL].
9. Language Neutral Stack Mapping Notes
Stack tracing mapping without knowledge of the source language is not covered by this document.
10. Multi-level Mapping Notes
It is getting more common to have tools generate sources from some DSL (templates) or compile TypeScript -> JavaScript -> minified JavaScript, resulting in multiple translations before the final source map is created. This problem can be handled in one of two ways. The easy but lossy way is to ignore the intermediate steps in the process for the purposes of debugging, the source location information from the translation is either ignored (the intermediate translation is considered the “Original Source”) or the source location information is carried through (the intermediate translation hidden). The more complete way is to support multiple levels of mapping: if the Original Source also has a source map reference, the user is given the choice of using that as well.
However, It is unclear what a "source map reference" looks like in anything other than JavaScript. More specifically, what a source map reference looks like in a language that doesn’t support JavaScript-style single-line comments.
11. License
This work is licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License.