HTML Media Capture W3C Working Draft 20 July 2010 This version: http://www.w3.org/TR/2010/WD-html-media-capture-20100720/ Latest published version: http://www.w3.org/TR/html-media-capture/ Latest editor's draft: http://dev.w3.org/2009/dap/camera/ Previous version: http://www.w3.org/TR/2010/WD-capture-api-20100401/ Editors: Ilkka Oksanen, Nokia Dominique Hazaël-Massieux, W3C Copyright © 2010 W3C ® ( MIT , ERCIM , Keio), All Rights Reserved. W3C liability, trademark and document use rules apply.

Abstract This specification defines HTML form enhancements that provide access to the audio, image and video capture capabilities of the device.

Status of This Document This section describes the status of this document at the time of its publication. Other documents may supersede this document. A list of current W3C publications and the latest revision of this technical report can be found in the W3C technical reports index at http://www.w3.org/TR/. This document is a the first part of the split of the previous version of this document, focused on the integration of media capture in HTML forms based on an extension to the FileAPI. The second part of the split focused on programmatic access to the capture devices will be published separately. The Working Group is looking for feedback on the general approach of this new version, and will coordinate with the HTML and Web Applications Working Group to ensure the proper progress of this document. Issues and editors notes in the document highlight some of the points on which the group is still working and would particularly like to get feedback. This document was published by the Device APIs and Policy Working Group as a Working Draft. This document is intended to become a W3C Recommendation. If you wish to make comments regarding this document, please send them to public-device-apis@w3.org (subscribe, archives). All feedback is welcome. Publication as a Working Draft does not imply endorsement by the W3C Membership. This is a draft document and may be updated, replaced or obsoleted by other documents at any time. It is inappropriate to cite this document as other than work in progress. This document was produced by a group operating under the 5 February 2004 W3C Patent Policy. W3C maintains a public list of any patent disclosures made in connection with the deliverables of the group; that page also includes instructions for disclosing a patent. An individual who has actual knowledge of a patent which the individual believes contains Essential Claim(s) must disclose the information in accordance with section 6 of the W3C Patent Policy.

1. Introduction The HTML Form Based Media Capturing specification defines a new interface for media files, a new parameter for the accept attribute of the HTML input element in file upload state, and recommendations for providing optimized access to the microphone and camera of a hosting device. Providing streaming access to these capabilities is outside of the scope of this specification. The Working Group is investigating the opportunity to specify streaming access via the proposed <device> element.

2. Conformance As well as sections marked as non-normative, all authoring guidelines, diagrams, examples, and notes in this specification are non-normative. Everything else in this specification is normative. The key words must, must not, required, should, should not, recommended, may, and optional in this specification are to be interpreted as described in [RFC2119]. This specification defines conformance criteria that apply to a single product: the user agent that implements the interfaces that it contains. Implementations that use ECMAScript to implement the APIs defined in this specification must implement them in a manner consistent with the ECMAScript Bindings defined in the Web IDL specification [WEBIDL], as this specification uses that specification and terminology.

3. Security and Privacy Considerations This specification builds upon the security and privacy protections provided by the [HTML5] <input type="file"> and the [FILE-API] specifications; in particular, it is expected that any offer to start capturing content from the user’s device would require a specific user interaction on an HTML element that is entirely controlled by the user agent. In addition to the requirements already highlighted in the [HTML5] and [FILE-API] specifications, implementors should take care of additional leakage of privacy-sensitive data from captured media. For instance, embedding the user’s location in a captured media metadata (e.g. EXIF) might transmit more private data than the user might be expecting.

4. Capture aware file-select control This section is normative. [HTML5] links <input type="file"> to the File interface. This specification defines a refined MediaFile interface to be used when the accept attribute take certain values — this will require coordination with the HTML5 Working Group. If input element in the File Upload state [HTML5] contains accept attribute with values image/* , sound/* , or video/* , the user agent can invoke a file picker that allows respectively the user to take a picture, record a sound file, or record a video in addition to selecting an existing file from the file system. See the User Interface Examples appendix for the illustration. In case the user chooses to capture video, audio, or image content, the user agent creates media files on the fly as specified in [HTML5]. If the user captures media files successfully, the files attribute [HTML5] must return a MediaList object that represents the current captured content instead of FileList [FILE-API] object as specified in [HTML5]. < input type = "file" accept = "image/*" >

5. The capture parameter This section is normative. The capture parameter may be specified on the media type values of the accept attribute to provide user agents with a hint of that by the default a file picker should be in media capturing mode. [HTML5] defines the accept attribute to take no parameters on MIME types. This specification proposes to use a MIME type parameter — this will require coordination with the HTML5 Working Group. The capture parameter can take one of the following values: camera , camcorder , microphone , filesystem . These values indicate which source the file picker interface should preferably present to the user by default. The values and their exact meaning are still very much in flux. For example, the following code indicates that the user is expected to upload an image from the device camera: < input type = "file" accept = "image/*;capture=camera" > A possible rendering of a file picker taking this parameter into account is offered in the User Interface Examples appendix.

6. WebIDL interfaces 6.1 MediaFile interface MediaFile captures a single photo, video or sound captured by the device. It inherits from File [FILE-API]. NoInterfaceObject ] interface MediaFile : File FormatData format readonly attribute ] interface}; 6.1.1 Attributes format of type FormatData The format attribute represents the format information of MediaFile object. The Working Group is considering whether the indirection provided by the format attribute is useful or if the MediaFile interface should inherit directly from FormatData . No exceptions. 6.2 FormatData interface FormatData captures format information of a media file captured by the device. The relationship between this FormatData interface and the properties made available through the API for Media Resource 1.0 [MEDIAONT-API] needs further investigation. NoInterfaceObject ] interface FormatData { DOMString codecs attribute unsigned long bitrate attribute unsigned long height attribute unsigned long width attribute double duration attribute ] interface}; 6.2.1 Attributes bitrate of type unsigned long The codecs attribute only specifies the profile and level of the encoded content which doesn't specify the actual bitrate. It only specifies the maximum encoded bitrate, thus this bitrate attribute is the average bitrate of the content. In the case of an image this attribute has value 0. No exceptions. codecs of type DOMString The type attribute of the Blob interface (inherited from the File interface) is not sufficient to determine the format of the content since it only specifies the container type. The codecs attribute represents the actual format that the audio and video of the content. The codecs attribute must conform to the [RFC4281]. For example, a valid value for H.263 video and AAC low complexity would be codecs="s263, mp4a.40.2". No exceptions. duration of type double The duration attribute represents length of the video or sound clip in milliseconds. In the case of an image this attribute has value 0. No exceptions. height of type unsigned long The height attribute represents height of the image or video in pixels. In the case of a sound clip this attribute has value 0. No exceptions. width of type unsigned long The width attribute represents width of the image or video in pixels. In the case of a sound clip this attribute has value 0. No exceptions. Some of the proposed attributes of the FormatData interface could possibly be integrated as parameters of the MIME type, or as MIME options object.

7. MediaList typedef The MediaList typedef represents a sequence of MediaFile objects. sequence< MediaFile MediaList ; typedef MediaList is used to refer to the sequence< MediaFile Throughout this specification, the identifieris used to refer to thetype.