谷歌開源可視化工具Facets,將用於人+AI協做項目研究——無非就是一個用於特徵工程探索的繪圖工具集,pandas能夠作的

見:http://www.infoq.com/cn/news/2017/07/goole-sight-facets-aigit

https://github.com/PAIR-code/facets/blob/master/facets_dive/README.mdgithub

 

Introduction

The facets project contains two visualizations for understanding and analyzing machine learning datasets: Facets Overview and Facets Dive.web

The visualizations are implemented as Polymer web components, backed by Typescript code and can be easily embedded into Jupyter notebooks or webpages.typescript

Live demos of the visualizations can be found on the Facets project description page.rest

Facets Overview

Overview visualization of UCI census data

Overview gives a high-level view of one or more data sets. It produces a visual feature-by-feature statistical analysis, and can also be used to compare statistics across two or more data sets. The tool can process both numeric and string features, including multiple instances of a number or string per feature.code

Overview can help uncover issues with datasets, including the following:component

  • Unexpected feature values
  • Missing feature values for a large number of examples
  • Training/serving skew
  • Training/test/validation set skew

Key aspects of the visualization are outlier detection and distribution comparison across multiple datasets. Interesting values (such as a high proportion of missing data, or very different distributions of a feature across multiple datasets) are highlighted in red. Features can be sorted by values of interest such as the number of missing values or the skew between the different datasets.orm

Details about Overview usage can be found in its README.ip

Facets Dive

Dive visualization of UCI census data

相關文章
相關標籤/搜索