Overview
Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 52100 |
| Missing cells | 100444 |
| Missing cells (%) | 17.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 25.2 MiB |
| Average record size in memory | 508.1 B |
Variable types
| Numeric | 1 |
|---|---|
| DateTime | 3 |
| Categorical | 7 |
indicado is highly overall correlated with utm_source | High correlation |
motivoPerda is highly overall correlated with tipo | High correlation |
tipo is highly overall correlated with motivoPerda and 1 other fields | High correlation |
utm_source is highly overall correlated with indicado and 1 other fields | High correlation |
utm_source is highly imbalanced (57.8%) | Imbalance |
indicado is highly imbalanced (74.4%) | Imbalance |
data_perda has 3308 (6.3%) missing values | Missing |
data_venda has 45891 (88.1%) missing values | Missing |
utm_source has 3766 (7.2%) missing values | Missing |
sdr has 26070 (50.0%) missing values | Missing |
closer has 14812 (28.4%) missing values | Missing |
motivoPerda has 6597 (12.7%) missing values | Missing |
id is uniformly distributed | Uniform |
Reproduction
| Analysis started | 2025-11-26 17:55:54.710158 |
|---|---|
| Analysis finished | 2025-11-26 17:56:02.540475 |
| Duration | 7.83 seconds |
| Software version | ydata-profiling vv4.18.0 |
| Download configuration | config.json |
Variables
id
Real number (ℝ)
Uniform
| Distinct | 52098 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 106263.11 |
| Minimum | 79195 |
|---|---|
| Maximum | 133333 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 407.2 KiB |
Quantile statistics
| Minimum | 79195 |
|---|---|
| 5-th percentile | 81890.95 |
| Q1 | 92635.75 |
| median | 106267.5 |
| Q3 | 119894.25 |
| 95-th percentile | 130631.05 |
| Maximum | 133333 |
| Range | 54138 |
| Interquartile range (IQR) | 27258.5 |
Descriptive statistics
| Standard deviation | 15668.164 |
|---|---|
| Coefficient of variation (CV) | 0.14744688 |
| Kurtosis | -1.2089851 |
| Mean | 106263.11 |
| Median Absolute Deviation (MAD) | 13629.5 |
| Skewness | 0.00037357976 |
| Sum | 5.5363079 × 109 |
| Variance | 2.4549137 × 108 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 128971 | 2 | < 0.1% |
| 79806 | 2 | < 0.1% |
| 79342 | 1 | < 0.1% |
| 115302 | 1 | < 0.1% |
| 115340 | 1 | < 0.1% |
| 115338 | 1 | < 0.1% |
| 115337 | 1 | < 0.1% |
| 115330 | 1 | < 0.1% |
| 115326 | 1 | < 0.1% |
| 115322 | 1 | < 0.1% |
| Other values (52088) | 52088 |
| Value | Count | Frequency (%) |
| 79195 | 1 | |
| 79196 | 1 | |
| 79197 | 1 | |
| 79198 | 1 | |
| 79199 | 1 | |
| 79200 | 1 | |
| 79201 | 1 | |
| 79202 | 1 | |
| 79203 | 1 | |
| 79204 | 1 |
| Value | Count | Frequency (%) |
| 133333 | 1 | |
| 133332 | 1 | |
| 133331 | 1 | |
| 133330 | 1 | |
| 133329 | 1 | |
| 133328 | 1 | |
| 133327 | 1 | |
| 133326 | 1 | |
| 133325 | 1 | |
| 133324 | 1 |
data_criacao
Date
| Distinct | 395 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 407.2 KiB |
| Minimum | 2024-04-01 00:00:00 |
|---|---|
| Maximum | 2025-04-30 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
Histogram with fixed size bins (bins=50)
data_perda
Date
Missing
| Distinct | 565 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 3308 |
| Missing (%) | 6.3% |
| Memory size | 407.2 KiB |
| Minimum | 2024-04-01 00:00:00 |
|---|---|
| Maximum | 2025-11-18 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
Histogram with fixed size bins (bins=50)
data_venda
Date
Missing
| Distinct | 492 |
|---|---|
| Distinct (%) | 7.9% |
| Missing | 45891 |
| Missing (%) | 88.1% |
| Memory size | 407.2 KiB |
| Minimum | 2021-09-01 00:00:00 |
|---|---|
| Maximum | 2025-11-18 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
Histogram with fixed size bins (bins=50)
utm_source
Categorical
High correlation Imbalance Missing
| Distinct | 38 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 3766 |
| Missing (%) | 7.2% |
| Memory size | 3.0 MiB |
| Whatsapp Oficial | |
|---|---|
| Não informado | |
| typeform-BLIP | 1187 |
| Other values (33) |
Length
| Max length | 33 |
|---|---|
| Median length | 27 |
| Mean length | 9.2695618 |
| Min length | 4 |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | typeform-Gentileza |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | typeform-BLIP |
| 5th row | typeform-Ex Membro |
Common Values
| Value | Count | Frequency (%) |
| 19622 | ||
| 17538 | ||
| Whatsapp Oficial | 4535 | 8.7% |
| Não informado | 2066 | 4.0% |
| typeform-BLIP | 1187 | 2.3% |
| typeform-Ex Membro | 594 | 1.1% |
| typeform-Indicação Interna | 500 | 1.0% |
| typeform-Outros | 446 | 0.9% |
| typeform-Gentileza | 425 | 0.8% |
| indicacao-app | 386 | 0.7% |
| Other values (28) | 1035 | 2.0% |
| (Missing) | 3766 | 7.2% |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| 19622 | ||
| 17539 | ||
| 4574 | 8.1% | |
| oficial | 4574 | 8.1% |
| não | 2066 | 3.7% |
| informado | 2066 | 3.7% |
| typeform-blip | 1187 | 2.1% |
| typeform-ex | 594 | 1.1% |
| membro | 594 | 1.1% |
| typeform-indicação | 500 | 0.9% |
| Other values (35) | 2902 | 5.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| g | 57039 | |
| a | 54430 | |
| o | 51684 | |
| i | 31837 | 7.1% |
| t | 27050 | 6.0% |
| e | 25628 | 5.7% |
| l | 25100 | 5.6% |
| r | 25049 | 5.6% |
| m | 23966 | 5.3% |
| s | 23080 | 5.2% |
| Other values (41) | 103172 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 448035 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| g | 57039 | |
| a | 54430 | |
| o | 51684 | |
| i | 31837 | 7.1% |
| t | 27050 | 6.0% |
| e | 25628 | 5.7% |
| l | 25100 | 5.6% |
| r | 25049 | 5.6% |
| m | 23966 | 5.3% |
| s | 23080 | 5.2% |
| Other values (41) | 103172 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 448035 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| g | 57039 | |
| a | 54430 | |
| o | 51684 | |
| i | 31837 | 7.1% |
| t | 27050 | 6.0% |
| e | 25628 | 5.7% |
| l | 25100 | 5.6% |
| r | 25049 | 5.6% |
| m | 23966 | 5.3% |
| s | 23080 | 5.2% |
| Other values (41) | 103172 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 448035 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| g | 57039 | |
| a | 54430 | |
| o | 51684 | |
| i | 31837 | 7.1% |
| t | 27050 | 6.0% |
| e | 25628 | 5.7% |
| l | 25100 | 5.6% |
| r | 25049 | 5.6% |
| m | 23966 | 5.3% |
| s | 23080 | 5.2% |
| Other values (41) | 103172 |
sdr
Categorical
Missing
| Distinct | 33 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 26070 |
| Missing (%) | 50.0% |
| Memory size | 2.8 MiB |
| Ana | |
|---|---|
| Camila | |
| Ingrid | |
| Gabriel | |
| Eric | |
| Other values (28) |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 5.9180177 |
| Min length | 3 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Ingrid |
|---|---|
| 2nd row | Ingrid |
| 3rd row | Ana |
| 4th row | Ingrid |
| 5th row | Ana |
Common Values
| Value | Count | Frequency (%) |
| Ana | 2158 | 4.1% |
| Camila | 2085 | 4.0% |
| Ingrid | 2074 | 4.0% |
| Gabriel | 1841 | 3.5% |
| Eric | 1821 | 3.5% |
| Morggiany | 1665 | 3.2% |
| Luan | 1656 | 3.2% |
| Lucas | 1510 | 2.9% |
| Emillyn | 1308 | 2.5% |
| Bianca | 1140 | 2.2% |
| Other values (23) | 8772 | 16.8% |
| (Missing) | 26070 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| ana | 2158 | 8.3% |
| camila | 2085 | 8.0% |
| ingrid | 2074 | 8.0% |
| gabriel | 1841 | 7.1% |
| eric | 1821 | 7.0% |
| morggiany | 1665 | 6.4% |
| luan | 1656 | 6.4% |
| lucas | 1510 | 5.8% |
| emillyn | 1308 | 5.0% |
| bianca | 1140 | 4.4% |
| Other values (23) | 8772 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 27258 | |
| i | 19217 | |
| n | 13051 | 8.5% |
| r | 10732 | 7.0% |
| l | 9854 | 6.4% |
| g | 6690 | 4.3% |
| c | 5581 | 3.6% |
| o | 5515 | 3.6% |
| u | 5079 | 3.3% |
| e | 4884 | 3.2% |
| Other values (25) | 46185 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 154046 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 27258 | |
| i | 19217 | |
| n | 13051 | 8.5% |
| r | 10732 | 7.0% |
| l | 9854 | 6.4% |
| g | 6690 | 4.3% |
| c | 5581 | 3.6% |
| o | 5515 | 3.6% |
| u | 5079 | 3.3% |
| e | 4884 | 3.2% |
| Other values (25) | 46185 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 154046 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 27258 | |
| i | 19217 | |
| n | 13051 | 8.5% |
| r | 10732 | 7.0% |
| l | 9854 | 6.4% |
| g | 6690 | 4.3% |
| c | 5581 | 3.6% |
| o | 5515 | 3.6% |
| u | 5079 | 3.3% |
| e | 4884 | 3.2% |
| Other values (25) | 46185 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 154046 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 27258 | |
| i | 19217 | |
| n | 13051 | 8.5% |
| r | 10732 | 7.0% |
| l | 9854 | 6.4% |
| g | 6690 | 4.3% |
| c | 5581 | 3.6% |
| o | 5515 | 3.6% |
| u | 5079 | 3.3% |
| e | 4884 | 3.2% |
| Other values (25) | 46185 |
closer
Categorical
Missing
| Distinct | 49 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 14812 |
| Missing (%) | 28.4% |
| Memory size | 2.7 MiB |
| Lana | |
|---|---|
| Barbara | |
| Niq | |
| Gabriella | |
| Raquel | |
| Other values (44) |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 5.9185797 |
| Min length | 3 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Raquel |
|---|---|
| 2nd row | Raquel |
| 3rd row | Bianca |
| 4th row | Bianca |
| 5th row | Adriele |
Common Values
| Value | Count | Frequency (%) |
| Lana | 3387 | 6.5% |
| Barbara | 3240 | 6.2% |
| Niq | 3164 | 6.1% |
| Gabriella | 2503 | 4.8% |
| Raquel | 2466 | 4.7% |
| Bianca | 2438 | 4.7% |
| Debora | 2337 | 4.5% |
| Laura | 2175 | 4.2% |
| Luan | 2009 | 3.9% |
| Leonardo | 1916 | 3.7% |
| Other values (39) | 11653 | |
| (Missing) | 14812 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| lana | 3387 | 9.1% |
| barbara | 3240 | 8.7% |
| niq | 3164 | 8.5% |
| gabriella | 2503 | 6.7% |
| raquel | 2466 | 6.6% |
| bianca | 2438 | 6.5% |
| debora | 2337 | 6.3% |
| laura | 2175 | 5.8% |
| luan | 2009 | 5.4% |
| leonardo | 1916 | 5.1% |
| Other values (39) | 11653 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 51480 | |
| r | 21899 | |
| e | 17364 | 7.9% |
| i | 17116 | 7.8% |
| n | 12515 | 5.7% |
| l | 11341 | 5.1% |
| L | 10341 | 4.7% |
| b | 8253 | 3.7% |
| u | 8100 | 3.7% |
| o | 8062 | 3.7% |
| Other values (33) | 54221 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 220692 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 51480 | |
| r | 21899 | |
| e | 17364 | 7.9% |
| i | 17116 | 7.8% |
| n | 12515 | 5.7% |
| l | 11341 | 5.1% |
| L | 10341 | 4.7% |
| b | 8253 | 3.7% |
| u | 8100 | 3.7% |
| o | 8062 | 3.7% |
| Other values (33) | 54221 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 220692 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 51480 | |
| r | 21899 | |
| e | 17364 | 7.9% |
| i | 17116 | 7.8% |
| n | 12515 | 5.7% |
| l | 11341 | 5.1% |
| L | 10341 | 4.7% |
| b | 8253 | 3.7% |
| u | 8100 | 3.7% |
| o | 8062 | 3.7% |
| Other values (33) | 54221 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 220692 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 51480 | |
| r | 21899 | |
| e | 17364 | 7.9% |
| i | 17116 | 7.8% |
| n | 12515 | 5.7% |
| l | 11341 | 5.1% |
| L | 10341 | 4.7% |
| b | 8253 | 3.7% |
| u | 8100 | 3.7% |
| o | 8062 | 3.7% |
| Other values (33) | 54221 |
profissao
Categorical
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.1 MiB |
| Medicina | |
|---|---|
| Outros | |
| Psicologia | |
| Nutrição | |
| Odontologia | |
| Other values (10) |
Length
| Max length | 19 |
|---|---|
| Median length | 15 |
| Mean length | 9.2061612 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Psicologia |
|---|---|
| 2nd row | Nutrição |
| 3rd row | Psicologia |
| 4th row | Odontologia |
| 5th row | Outros |
Common Values
| Value | Count | Frequency (%) |
| Medicina | 11843 | |
| Outros | 9945 | |
| Psicologia | 9066 | |
| Nutrição | 3773 | 7.2% |
| Odontologia | 3478 | 6.7% |
| Biomedicina | 2898 | 5.6% |
| Fisioterapia | 2889 | 5.5% |
| Não Informado | 2466 | 4.7% |
| Psicanálise | 1841 | 3.5% |
| Enfermagem | 1669 | 3.2% |
| Other values (5) | 2232 | 4.3% |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| medicina | 11843 | |
| outros | 9945 | |
| psicologia | 9066 | |
| nutrição | 3773 | 6.8% |
| odontologia | 3478 | 6.3% |
| biomedicina | 2898 | 5.2% |
| fisioterapia | 2889 | 5.2% |
| não | 2466 | 4.4% |
| informado | 2466 | 4.4% |
| psicanálise | 1841 | 3.3% |
| Other values (8) | 4841 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 73570 | |
| o | 58919 | |
| a | 44694 | 9.3% |
| c | 28191 | 5.9% |
| s | 25925 | 5.4% |
| n | 25418 | 5.3% |
| e | 23406 | 4.9% |
| r | 22002 | 4.6% |
| d | 21654 | 4.5% |
| t | 20085 | 4.2% |
| Other values (20) | 135777 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 479641 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 73570 | |
| o | 58919 | |
| a | 44694 | 9.3% |
| c | 28191 | 5.9% |
| s | 25925 | 5.4% |
| n | 25418 | 5.3% |
| e | 23406 | 4.9% |
| r | 22002 | 4.6% |
| d | 21654 | 4.5% |
| t | 20085 | 4.2% |
| Other values (20) | 135777 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 479641 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 73570 | |
| o | 58919 | |
| a | 44694 | 9.3% |
| c | 28191 | 5.9% |
| s | 25925 | 5.4% |
| n | 25418 | 5.3% |
| e | 23406 | 4.9% |
| r | 22002 | 4.6% |
| d | 21654 | 4.5% |
| t | 20085 | 4.2% |
| Other values (20) | 135777 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 479641 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 73570 | |
| o | 58919 | |
| a | 44694 | 9.3% |
| c | 28191 | 5.9% |
| s | 25925 | 5.4% |
| n | 25418 | 5.3% |
| e | 23406 | 4.9% |
| r | 22002 | 4.6% |
| d | 21654 | 4.5% |
| t | 20085 | 4.2% |
| Other values (20) | 135777 |
indicado
Categorical
High correlation Imbalance
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.4 MiB |
| Não Indicado | |
|---|---|
| Indicado | 2237 |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 11.828253 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Indicado |
|---|---|
| 2nd row | Não Indicado |
| 3rd row | Não Indicado |
| 4th row | Indicado |
| 5th row | Não Indicado |
Common Values
| Value | Count | Frequency (%) |
| Não Indicado | 49863 | |
| Indicado | 2237 | 4.3% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| indicado | 52100 | |
| não | 49863 |
Most occurring characters
| Value | Count | Frequency (%) |
| d | 104200 | |
| o | 101963 | |
| I | 52100 | |
| n | 52100 | |
| i | 52100 | |
| c | 52100 | |
| a | 52100 | |
| N | 49863 | |
| ã | 49863 | |
| 49863 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 616252 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| d | 104200 | |
| o | 101963 | |
| I | 52100 | |
| n | 52100 | |
| i | 52100 | |
| c | 52100 | |
| a | 52100 | |
| N | 49863 | |
| ã | 49863 | |
| 49863 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 616252 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| d | 104200 | |
| o | 101963 | |
| I | 52100 | |
| n | 52100 | |
| i | 52100 | |
| c | 52100 | |
| a | 52100 | |
| N | 49863 | |
| ã | 49863 | |
| 49863 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 616252 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| d | 104200 | |
| o | 101963 | |
| I | 52100 | |
| n | 52100 | |
| i | 52100 | |
| c | 52100 | |
| a | 52100 | |
| N | 49863 | |
| ã | 49863 | |
| 49863 |
tipo
Categorical
High correlation
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Qualificado | |
|---|---|
| Desqualificado | |
| Bucket | 5 |
Length
| Max length | 14 |
|---|---|
| Median length | 11 |
| Mean length | 11.88881 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Qualificado |
|---|---|
| 2nd row | Qualificado |
| 3rd row | Qualificado |
| 4th row | Qualificado |
| 5th row | Qualificado |
Common Values
| Value | Count | Frequency (%) |
| Qualificado | 36651 | |
| Desqualificado | 15444 | |
| Bucket | 5 | < 0.1% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| qualificado | 36651 | |
| desqualificado | 15444 | |
| bucket | 5 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 104190 | |
| i | 104190 | |
| u | 52100 | |
| c | 52100 | |
| l | 52095 | |
| f | 52095 | |
| d | 52095 | |
| o | 52095 | |
| Q | 36651 | 5.9% |
| e | 15449 | 2.5% |
| Other values (6) | 46347 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 619407 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 104190 | |
| i | 104190 | |
| u | 52100 | |
| c | 52100 | |
| l | 52095 | |
| f | 52095 | |
| d | 52095 | |
| o | 52095 | |
| Q | 36651 | 5.9% |
| e | 15449 | 2.5% |
| Other values (6) | 46347 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 619407 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 104190 | |
| i | 104190 | |
| u | 52100 | |
| c | 52100 | |
| l | 52095 | |
| f | 52095 | |
| d | 52095 | |
| o | 52095 | |
| Q | 36651 | 5.9% |
| e | 15449 | 2.5% |
| Other values (6) | 46347 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 619407 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 104190 | |
| i | 104190 | |
| u | 52100 | |
| c | 52100 | |
| l | 52095 | |
| f | 52095 | |
| d | 52095 | |
| o | 52095 | |
| Q | 36651 | 5.9% |
| e | 15449 | 2.5% |
| Other values (6) | 46347 |
motivoPerda
Categorical
High correlation Missing
| Distinct | 34 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 6597 |
| Missing (%) | 12.7% |
| Memory size | 5.7 MiB |
| NF - Desistência após tentativas | |
|---|---|
| NF - Lead não é profissional da saúde | |
| NF - Dados incorretos ou inexistentes | |
| NF - Estrutura não atende | |
| NF - Outros | |
| Other values (29) |
Length
| Max length | 59 |
|---|---|
| Median length | 50 |
| Mean length | 31.52313 |
| Min length | 10 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | NF - Concorrente |
|---|---|
| 2nd row | NF - Concorrente |
| 3rd row | NF - Consultório Próprio |
| 4th row | NF - Consultório Próprio |
| 5th row | NF - Curioso / Pesquisando |
Common Values
| Value | Count | Frequency (%) |
| NF - Desistência após tentativas | 17873 | |
| NF - Lead não é profissional da saúde | 12043 | |
| NF - Dados incorretos ou inexistentes | 2369 | 4.5% |
| NF - Estrutura não atende | 2133 | 4.1% |
| NF - Outros | 1830 | 3.5% |
| NF - Desistiu de abrir consultório agora | 1610 | 3.1% |
| NF - Curioso / Pesquisando | 1148 | 2.2% |
| NF - Preço | 1069 | 2.1% |
| NF - Região / Localidade | 970 | 1.9% |
| NF - Quer Hora Avulsa | 918 | 1.8% |
| Other values (24) | 3540 | 6.8% |
| (Missing) | 6597 | 12.7% |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| 47620 | ||
| nf | 45494 | |
| desistência | 17873 | 6.8% |
| após | 17873 | 6.8% |
| tentativas | 17873 | 6.8% |
| não | 15001 | 5.7% |
| lead | 12451 | 4.7% |
| é | 12043 | 4.6% |
| profissional | 12043 | 4.6% |
| da | 12043 | 4.6% |
| Other values (84) | 54227 |
Most occurring characters
| Value | Count | Frequency (%) |
| 219038 | ||
| a | 144942 | 10.1% |
| s | 136023 | 9.5% |
| i | 100526 | 7.0% |
| t | 95673 | 6.7% |
| e | 89851 | 6.3% |
| n | 79019 | 5.5% |
| o | 70070 | 4.9% |
| d | 48546 | 3.4% |
| N | 45749 | 3.2% |
| Other values (41) | 404960 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1434397 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 219038 | ||
| a | 144942 | 10.1% |
| s | 136023 | 9.5% |
| i | 100526 | 7.0% |
| t | 95673 | 6.7% |
| e | 89851 | 6.3% |
| n | 79019 | 5.5% |
| o | 70070 | 4.9% |
| d | 48546 | 3.4% |
| N | 45749 | 3.2% |
| Other values (41) | 404960 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1434397 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 219038 | ||
| a | 144942 | 10.1% |
| s | 136023 | 9.5% |
| i | 100526 | 7.0% |
| t | 95673 | 6.7% |
| e | 89851 | 6.3% |
| n | 79019 | 5.5% |
| o | 70070 | 4.9% |
| d | 48546 | 3.4% |
| N | 45749 | 3.2% |
| Other values (41) | 404960 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1434397 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 219038 | ||
| a | 144942 | 10.1% |
| s | 136023 | 9.5% |
| i | 100526 | 7.0% |
| t | 95673 | 6.7% |
| e | 89851 | 6.3% |
| n | 79019 | 5.5% |
| o | 70070 | 4.9% |
| d | 48546 | 3.4% |
| N | 45749 | 3.2% |
| Other values (41) | 404960 |
Interactions
Correlations
| closer | id | indicado | motivoPerda | profissao | sdr | tipo | utm_source | |
|---|---|---|---|---|---|---|---|---|
| closer | 1.000 | 0.253 | 0.066 | 0.082 | 0.069 | 0.208 | 0.144 | 0.178 |
| id | 0.253 | 1.000 | 0.051 | 0.090 | 0.081 | 0.286 | 0.024 | 0.150 |
| indicado | 0.066 | 0.051 | 1.000 | 0.082 | 0.116 | 0.045 | 0.102 | 0.803 |
| motivoPerda | 0.082 | 0.090 | 0.082 | 1.000 | 0.202 | 0.142 | 0.707 | 0.076 |
| profissao | 0.069 | 0.081 | 0.116 | 0.202 | 1.000 | 0.051 | 0.372 | 0.140 |
| sdr | 0.208 | 0.286 | 0.045 | 0.142 | 0.051 | 1.000 | 0.078 | 0.052 |
| tipo | 0.144 | 0.024 | 0.102 | 0.707 | 0.372 | 0.078 | 1.000 | 0.720 |
| utm_source | 0.178 | 0.150 | 0.803 | 0.076 | 0.140 | 0.052 | 0.720 | 1.000 |
Missing values
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
Sample
| id | data_criacao | data_perda | data_venda | utm_source | sdr | closer | profissao | indicado | tipo | motivoPerda | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 79342 | 2024-04-01 | 2024-07-24 | 2024-04-18 | typeform-Gentileza | NaN | Raquel | Psicologia | Indicado | Qualificado | NaN |
| 1 | 79305 | 2024-04-01 | 2024-09-11 | 2024-04-04 | NaN | NaN | Raquel | Nutrição | Não Indicado | Qualificado | NaN |
| 2 | 79271 | 2024-04-01 | 2024-07-26 | 2024-04-02 | NaN | Bianca | Psicologia | Não Indicado | Qualificado | NaN | |
| 3 | 79248 | 2024-04-01 | 2024-12-06 | 2024-04-01 | NaN | Bianca | Odontologia | Indicado | Qualificado | NaN | |
| 4 | 79226 | 2024-04-01 | 2024-11-25 | 2024-04-30 | typeform-BLIP | NaN | Adriele | Outros | Não Indicado | Qualificado | NaN |
| 5 | 79345 | 2024-04-01 | 2025-01-29 | 2024-04-05 | typeform-Ex Membro | NaN | Camila | Medicina | Não Indicado | Qualificado | NaN |
| 6 | 79301 | 2024-04-01 | 2024-12-17 | 2024-04-05 | typeform-Ex Membro | NaN | Barbara | Psicologia | Não Indicado | Qualificado | NaN |
| 7 | 79322 | 2024-04-01 | 2025-01-06 | 2024-04-04 | NaN | Bianca | Odontologia | Não Indicado | Qualificado | NaN | |
| 8 | 79285 | 2024-04-01 | 2024-05-03 | 2024-04-01 | typeform-Indicação Interna | NaN | Lana | Fisioterapia | Indicado | Qualificado | NaN |
| 9 | 79238 | 2024-04-01 | 2025-09-06 | 2024-04-09 | NaN | Raquel | Medicina | Indicado | Qualificado | NaN |
| id | data_criacao | data_perda | data_venda | utm_source | sdr | closer | profissao | indicado | tipo | motivoPerda | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 52090 | 133288 | 2025-04-30 | NaT | 2025-05-19 | typeform-Outros | Lucas | Thiago | Medicina | Não Indicado | Qualificado | NaN |
| 52091 | 133287 | 2025-04-30 | NaT | 2025-05-16 | Eric | Bianca | Medicina | Não Indicado | Qualificado | NaN | |
| 52092 | 133275 | 2025-04-30 | NaT | NaT | Lucas | NaN | Psicologia | Não Indicado | Qualificado | NaN | |
| 52093 | 133271 | 2025-04-30 | NaT | 2025-04-30 | typeform-Ex Membro | NaN | Luan | Medicina | Não Indicado | Qualificado | NaN |
| 52094 | 133265 | 2025-04-30 | NaT | 2025-05-15 | Leticia | Thiago | Psicologia | Não Indicado | Qualificado | NaN | |
| 52095 | 133244 | 2025-04-30 | NaT | 2025-09-09 | Não informado | Leticia | Karen | Medicina | Não Indicado | Qualificado | NaN |
| 52096 | 133242 | 2025-04-30 | NaT | 2025-04-30 | typeform-Outros | Ana | Debora | Medicina | Não Indicado | Qualificado | NaN |
| 52097 | 133225 | 2025-04-30 | NaT | 2025-05-09 | Ana | Caio | Enfermagem | Não Indicado | Qualificado | NaN | |
| 52098 | 133219 | 2025-04-30 | NaT | 2025-05-12 | Milena | Caio | Enfermagem | Não Indicado | Qualificado | NaN | |
| 52099 | 133214 | 2025-04-30 | NaT | 2025-05-06 | Whatsapp Oficial | NaN | Bianca | Outros | Não Indicado | Qualificado | NaN |