{"id":9704,"date":"2024-05-07T12:20:08","date_gmt":"2024-05-07T20:20:08","guid":{"rendered":"https:\/\/devblogs.microsoft.com\/python\/?p=9704"},"modified":"2024-05-07T12:20:08","modified_gmt":"2024-05-07T20:20:08","slug":"announcing-data-wrangler-code-centric-viewing-and-cleaning-of-tabular-data-in-visual-studio-code","status":"publish","type":"post","link":"https:\/\/devblogs.microsoft.com\/python\/announcing-data-wrangler-code-centric-viewing-and-cleaning-of-tabular-data-in-visual-studio-code\/","title":{"rendered":"Announcing Data Wrangler: Code-centric viewing and cleaning of tabular data in Visual Studio Code"},"content":{"rendered":"<p>Today, we are excited to announce the general availability of the <a href=\"https:\/\/marketplace.visualstudio.com\/items?itemName=ms-toolsai.datawrangler\">Data Wrangler extension for Visual Studio Code<\/a>! Data Wrangler is a free extension that offers data viewing and cleaning that is directly integrated into VS Code and the Jupyter extension. It provides a rich user interface to view and analyze your data, show insightful column statistics and visualizations, and automatically generate Pandas code as you clean and transform the data. We want to thank all the early adopters who tried out the extension preview over the past year, as your valuable feedback has been crucial to this release.<\/p>\n<p><a href=\"https:\/\/devblogs.microsoft.com\/python\/wp-content\/uploads\/sites\/12\/2024\/05\/FUll-end-to-end-1.gif\"><img decoding=\"async\" class=\"aligncenter size-full wp-image-9722\" src=\"https:\/\/devblogs.microsoft.com\/python\/wp-content\/uploads\/sites\/12\/2024\/05\/FUll-end-to-end-1.gif\" alt=\"Image FUll end to end\" width=\"1510\" height=\"836\" \/><\/a><\/p>\n<p>With this general availability, we are also announcing that the <a href=\"https:\/\/code.visualstudio.com\/docs\/datascience\/jupyter-notebooks#_data-viewer\">data viewer<\/a> feature in the Jupyter extension will be going away. In its place, you will be able to use the new and improved data viewing experience offered by Data Wrangler, which is also built by Microsoft. We understand that the data viewer was a beloved feature from our customers, and we see this as the next evolution to working with data in VS Code in an extensible manner and hope that you will love the Data Wrangler extension even more than the data viewer feature. Several of the improvements and features of Data Wrangler are highlighted below.<\/p>\n<p>&nbsp;<\/p>\n<h2>Previewing data<\/h2>\n<p>Once the Data Wrangler extension is installed, you can get to Data Wrangler in one of three ways from the Jupyter Notebook.<\/p>\n<p><a href=\"https:\/\/devblogs.microsoft.com\/python\/wp-content\/uploads\/sites\/12\/2024\/05\/Screenshot-2024-05-03-at-11.49.49\u202fAM.png\"><img decoding=\"async\" class=\"aligncenter wp-image-9707 size-full\" src=\"https:\/\/devblogs.microsoft.com\/python\/wp-content\/uploads\/sites\/12\/2024\/05\/Screenshot-2024-05-03-at-11.49.49\u202fAM.png\" alt=\"The 3 entry points into Data Wrangler from the Notebook\" width=\"1938\" height=\"1352\" srcset=\"https:\/\/devblogs.microsoft.com\/python\/wp-content\/uploads\/sites\/12\/2024\/05\/Screenshot-2024-05-03-at-11.49.49\u202fAM.png 1938w, https:\/\/devblogs.microsoft.com\/python\/wp-content\/uploads\/sites\/12\/2024\/05\/Screenshot-2024-05-03-at-11.49.49\u202fAM-300x209.png 300w, https:\/\/devblogs.microsoft.com\/python\/wp-content\/uploads\/sites\/12\/2024\/05\/Screenshot-2024-05-03-at-11.49.49\u202fAM-1024x714.png 1024w, https:\/\/devblogs.microsoft.com\/python\/wp-content\/uploads\/sites\/12\/2024\/05\/Screenshot-2024-05-03-at-11.49.49\u202fAM-768x536.png 768w, https:\/\/devblogs.microsoft.com\/python\/wp-content\/uploads\/sites\/12\/2024\/05\/Screenshot-2024-05-03-at-11.49.49\u202fAM-1536x1072.png 1536w\" sizes=\"(max-width: 1938px) 100vw, 1938px\" \/><\/a><\/p>\n<ol>\n<li>In the\u00a0<strong>Jupyter<\/strong>\u00a0&gt;\u00a0<strong>Variables<\/strong>\u00a0panel, beside any supported data object, you can see a button to open it in Data Wrangler.<\/li>\n<li>If you have a supported data object in your notebook (such as a Pandas DataFrame), you can now see an\u00a0<strong>Open &#8216;df&#8217; in Data Wrangler<\/strong> button (where &#8216;df&#8217; is the variable name of your data frame) appear in bottom of the cell after running code that outputs the data frame. This includes df.head(), df.tail(), display(df), print(df), df.<\/li>\n<li>In the notebook toolbar, selecting\u00a0<strong>View data<\/strong>\u00a0brings up a list of every supported data object in your notebook. You can then choose which variable in that list you want to open in Data Wrangler.<\/li>\n<\/ol>\n<p>Alternatively, Data Wrangler can also be directly opened from a local file (such as CSV, Excel, or parquet files) by right clicking the file and selecting \u201cOpen in Data Wrangler\u201d.<\/p>\n<p>&nbsp;<\/p>\n<h2>Filtering and sorting<\/h2>\n<p>Data Wrangler can be used to quickly filter and sort through your rows of data.<\/p>\n<p><a href=\"https:\/\/devblogs.microsoft.com\/python\/wp-content\/uploads\/sites\/12\/2024\/05\/Filter.gif\"><img decoding=\"async\" class=\"aligncenter wp-image-9713 size-full\" src=\"https:\/\/devblogs.microsoft.com\/python\/wp-content\/uploads\/sites\/12\/2024\/05\/Filter.gif\" alt=\"A gif showing the filter feature in Data Wrangler\" width=\"1510\" height=\"836\" \/><\/a><\/p>\n<h2><\/h2>\n<h2>Transforming data<\/h2>\n<p>Switch from Viewing to Editing mode to unlock additional functionality and built-in data cleaning operations in Data Wrangler. For a full list of supported operations, see the documentation <a href=\"https:\/\/code.visualstudio.com\/docs\/datascience\/data-wrangler#_data-wrangler-operations\">here<\/a>.<\/p>\n<p><a href=\"https:\/\/devblogs.microsoft.com\/python\/wp-content\/uploads\/sites\/12\/2024\/05\/View-to-edit-and-op.gif\"><img decoding=\"async\" class=\"aligncenter wp-image-9711 size-full\" src=\"https:\/\/devblogs.microsoft.com\/python\/wp-content\/uploads\/sites\/12\/2024\/05\/View-to-edit-and-op.gif\" alt=\"A gif showing the switch from viewing to editing modes in Data Wrangler\" width=\"1510\" height=\"836\" \/><\/a><\/p>\n<p>&nbsp;<\/p>\n<h2>Code generation<\/h2>\n<p>As you make changes to the data using the built-in operations, Data Wrangler automatically generates code using open-source Python libraries for the data transformation operations you perform.<\/p>\n<p><a href=\"https:\/\/devblogs.microsoft.com\/python\/wp-content\/uploads\/sites\/12\/2024\/05\/By-Example-operation.gif\"><img decoding=\"async\" class=\"aligncenter wp-image-9712 size-full\" src=\"https:\/\/devblogs.microsoft.com\/python\/wp-content\/uploads\/sites\/12\/2024\/05\/By-Example-operation.gif\" alt=\"A gif showing a data transformation operation in Data Wrangler\" width=\"1510\" height=\"836\" \/><\/a><\/p>\n<p>When you are done wrangling your data, all the automatically generated code from your data cleaning session can then be exported either back into your Notebook, or into a new Python file.<\/p>\n<p>&nbsp;<\/p>\n<h2>Trying Data Wrangler today<\/h2>\n<p>To start using Data Wrangler today in\u00a0<a href=\"https:\/\/code.visualstudio.com\/\">Visual Studio Code<\/a>, just download the <a href=\"https:\/\/marketplace.visualstudio.com\/items?itemName=ms-toolsai.datawrangler\">Data Wrangler extension<\/a> from the VS Code marketplace to try it out! You can then launch Data Wrangler from any supported data object in a Jupyter Notebook or direct from a data file.<\/p>\n<p><a href=\"https:\/\/devblogs.microsoft.com\/python\/wp-content\/uploads\/sites\/12\/2024\/05\/Picture1.png\"><img decoding=\"async\" class=\"aligncenter wp-image-9708 size-full\" src=\"https:\/\/devblogs.microsoft.com\/python\/wp-content\/uploads\/sites\/12\/2024\/05\/Picture1.png\" alt=\"A screenshot of Data Wrangler in the marketplace\" width=\"936\" height=\"516\" srcset=\"https:\/\/devblogs.microsoft.com\/python\/wp-content\/uploads\/sites\/12\/2024\/05\/Picture1.png 936w, https:\/\/devblogs.microsoft.com\/python\/wp-content\/uploads\/sites\/12\/2024\/05\/Picture1-300x165.png 300w, https:\/\/devblogs.microsoft.com\/python\/wp-content\/uploads\/sites\/12\/2024\/05\/Picture1-768x423.png 768w\" sizes=\"(max-width: 936px) 100vw, 936px\" \/><\/a><\/p>\n<p>This article only covered some of the high-level features of what Data Wrangler can do. To learn more about Data Wrangler in detail, please check out the\u00a0<a href=\"https:\/\/code.visualstudio.com\/docs\/datascience\/data-wrangler\">Data Wrangler documentation<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Today, we are excited to announce the general availability of the Data Wrangler extension for Visual Studio Code! Data Wrangler is a free extension that offers data viewing and cleaning that is directly integrated into VS Code and the Jupyter extension. It provides a rich user interface to view and analyze your data, show insightful [&hellip;]<\/p>\n","protected":false},"author":8347,"featured_media":9713,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[1,6],"tags":[1266,1263,1032,1264,1265,14,1267,267],"class_list":["post-9704","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-python","category-visual-studio-code","tag-csv","tag-data","tag-data-science","tag-data-wrangler","tag-excel","tag-jupyter","tag-notebooks","tag-vs-code"],"acf":[],"blog_post_summary":"<p>Today, we are excited to announce the general availability of the Data Wrangler extension for Visual Studio Code! Data Wrangler is a free extension that offers data viewing and cleaning that is directly integrated into VS Code and the Jupyter extension. It provides a rich user interface to view and analyze your data, show insightful [&hellip;]<\/p>\n","_links":{"self":[{"href":"https:\/\/devblogs.microsoft.com\/python\/wp-json\/wp\/v2\/posts\/9704","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/devblogs.microsoft.com\/python\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/devblogs.microsoft.com\/python\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/devblogs.microsoft.com\/python\/wp-json\/wp\/v2\/users\/8347"}],"replies":[{"embeddable":true,"href":"https:\/\/devblogs.microsoft.com\/python\/wp-json\/wp\/v2\/comments?post=9704"}],"version-history":[{"count":0,"href":"https:\/\/devblogs.microsoft.com\/python\/wp-json\/wp\/v2\/posts\/9704\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/devblogs.microsoft.com\/python\/wp-json\/wp\/v2\/media\/9713"}],"wp:attachment":[{"href":"https:\/\/devblogs.microsoft.com\/python\/wp-json\/wp\/v2\/media?parent=9704"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/devblogs.microsoft.com\/python\/wp-json\/wp\/v2\/categories?post=9704"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/devblogs.microsoft.com\/python\/wp-json\/wp\/v2\/tags?post=9704"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}