# Data Similarity Map

## Find it in `Dataset` -> `Overview`&#x20;

<figure><img src="https://2222059734-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FgZbaVXXtfTXMMcqdnKWV%2Fuploads%2Fwiz9gXRdrL6xMHia8uFt%2Fimage.png?alt=media&#x26;token=b3e7ed28-6157-4a2c-8ff6-de73c38eaec6" alt=""><figcaption></figcaption></figure>

## Select Data

Use b-box or polygon tool to select data on the map.

<figure><img src="https://2222059734-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FgZbaVXXtfTXMMcqdnKWV%2Fuploads%2FsBhowWGnHgOyjY4hNKg5%2Fselect.png?alt=media&#x26;token=9b6c6302-dc8b-4c21-bfc1-8716fafa24c3" alt=""><figcaption></figcaption></figure>

## View Selected Data

Selected data can be displayed and save as a new dataset.

<figure><img src="https://2222059734-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FgZbaVXXtfTXMMcqdnKWV%2Fuploads%2FUhesQarctSRvArwstesC%2Fimage.png?alt=media&#x26;token=63b35d35-87d5-48dd-b5c4-5061c3ff056f" alt=""><figcaption><p>The data close to each other in the map shows that they have a high degree of similarity.</p></figcaption></figure>

## Learn more tech details, see these two open source repos:

* [A PyTorch implementation of MobileNetV3](https://github.com/xiaolai-sqlai/mobilenetv3) is a convolutional neural network that is tuned to mobile phone CPUs through a combination of hardware-aware network architecture search (NAS) complemented by the NetAdapt algorithm, and then subsequently improved through novel architecture advances.
* [openTSNE](https://github.com/pavlin-policar/openTSNE) is a modular Python implementation of t-Distributed Stochasitc Neighbor Embedding (t-SNE), a popular dimensionality-reduction algorithm for visualizing high-dimensional data sets. openTSNE incorporates the latest improvements to the t-SNE algorithm, including the ability to add new data points to existing embeddings, massive speed improvements, enabling t-SNE to scale to millions of data points and various tricks to improve global alignment of the resulting visualizations.
