Dataset statistics
Number of variables | 18 |
---|---|
Number of observations | 61 |
Missing cells | 337 |
Missing cells (%) | 30.7% |
Duplicate rows | 1 |
Duplicate rows (%) | 1.6% |
Total size in memory | 9.7 KiB |
Average record size in memory | 163.2 B |
Variable types
Text | 1 |
---|---|
Numeric | 7 |
Unsupported | 1 |
Categorical | 9 |
Dataset
Description | 부산광역시 동래구 가로수 현황에 대한 데이터로 노선명, 은행나무, 왕벚나무, 플라타너스, 히말라야시다, 느티나무, 메타세콰이어, 가시나무, 곰솔, 후박나무, 가중나무, 단풍나무, 향나무, 먼나무, 이팝나무, 회화나무, 튤립나무, 가로수연장(킬로미터)에 대한 항목을 제공합니다. |
---|---|
Author | 부산광역시 동래구 |
URL | https://www.data.go.kr/data/3079676/fileData.do |
Dataset has 1 (1.6%) duplicate rows | Duplicates |
메타세콰이어 is highly overall correlated with 느티나무 and 2 other fields | High correlation |
후박나무 is highly overall correlated with 은행나무 and 4 other fields | High correlation |
히말라야시다 is highly overall correlated with 은행나무 and 7 other fields | High correlation |
은행나무 is highly overall correlated with 왕벚나무 and 5 other fields | High correlation |
왕벚나무 is highly overall correlated with 은행나무 and 7 other fields | High correlation |
느티나무 is highly overall correlated with 왕벚나무 and 6 other fields | High correlation |
가시나무 is highly overall correlated with 은행나무 and 7 other fields | High correlation |
먼나무 is highly overall correlated with 은행나무 and 6 other fields | High correlation |
이팝나무 is highly overall correlated with 왕벚나무 and 4 other fields | High correlation |
가로수연장(km) is highly overall correlated with 왕벚나무 and 6 other fields | High correlation |
단풍나무 is highly overall correlated with 느티나무 and 1 other fields | High correlation |
향나무 is highly overall correlated with 은행나무 and 3 other fields | High correlation |
히말라야시다 is highly imbalanced (73.9%) | Imbalance |
메타세콰이어 is highly imbalanced (82.0%) | Imbalance |
곰솔 is highly imbalanced (87.9%) | Imbalance |
후박나무 is highly imbalanced (82.0%) | Imbalance |
가중나무 is highly imbalanced (87.9%) | Imbalance |
단풍나무 is highly imbalanced (84.8%) | Imbalance |
향나무 is highly imbalanced (84.8%) | Imbalance |
회화나무 is highly imbalanced (87.9%) | Imbalance |
튤립나무 is highly imbalanced (87.9%) | Imbalance |
은행나무 has 42 (68.9%) missing values | Missing |
왕벚나무 has 51 (83.6%) missing values | Missing |
플라타너스 has 61 (100.0%) missing values | Missing |
느티나무 has 45 (73.8%) missing values | Missing |
가시나무 has 54 (88.5%) missing values | Missing |
먼나무 has 51 (83.6%) missing values | Missing |
이팝나무 has 33 (54.1%) missing values | Missing |
플라타너스 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2023-12-12 23:35:43.599535 |
---|---|
Analysis finished | 2023-12-12 23:35:50.324107 |
Duration | 6.72 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
노선명
Text
Distinct | 60 |
---|---|
Distinct (%) | 98.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 620.0 B |
Value | Count | Frequency (%) |
명장로67번길 | 2 | 3.2% |
명륜로129번길 | 1 | 1.6% |
체육공원로 | 1 | 1.6% |
동래로 | 1 | 1.6% |
수안로 | 1 | 1.6% |
낙민로 | 1 | 1.6% |
안락로 | 1 | 1.6% |
연안로 | 1 | 1.6% |
안연로 | 1 | 1.6% |
안남로 | 1 | 1.6% |
Other values (51) | 51 |
Most occurring characters
Value | Count | Frequency (%) |
로 | 67 | 16.2% |
번 | 29 | 7.0% |
길 | 29 | 7.0% |
1 | 19 | 4.6% |
대 | 15 | 3.6% |
2 | 14 | 3.4% |
장 | 11 | 2.7% |
3 | 11 | 2.7% |
충 | 9 | 2.2% |
5 | 9 | 2.2% |
Other values (68) | 201 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 301 | |
Decimal Number | 95 | 22.9% |
Dash Punctuation | 6 | 1.4% |
Close Punctuation | 5 | 1.2% |
Open Punctuation | 5 | 1.2% |
Other Punctuation | 1 | 0.2% |
Space Separator | 1 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
로 | 67 | |
번 | 29 | 9.6% |
길 | 29 | 9.6% |
대 | 15 | 5.0% |
장 | 11 | 3.7% |
충 | 9 | 3.0% |
렬 | 9 | 3.0% |
아 | 8 | 2.7% |
천 | 8 | 2.7% |
중 | 8 | 2.7% |
Other values (53) | 108 |
Decimal Number
Value | Count | Frequency (%) |
1 | 19 | |
2 | 14 | |
3 | 11 | |
5 | 9 | |
4 | 9 | |
0 | 8 | |
7 | 7 | 7.4% |
8 | 6 | 6.3% |
9 | 6 | 6.3% |
6 | 6 | 6.3% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 6 |
Close Punctuation
Value | Count | Frequency (%) |
) | 5 |
Open Punctuation
Value | Count | Frequency (%) |
( | 5 |
Other Punctuation
Value | Count | Frequency (%) |
, | 1 |
Space Separator
Value | Count | Frequency (%) |
1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 301 | |
Common | 113 | 27.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
로 | 67 | |
번 | 29 | 9.6% |
길 | 29 | 9.6% |
대 | 15 | 5.0% |
장 | 11 | 3.7% |
충 | 9 | 3.0% |
렬 | 9 | 3.0% |
아 | 8 | 2.7% |
천 | 8 | 2.7% |
중 | 8 | 2.7% |
Other values (53) | 108 |
Common
Value | Count | Frequency (%) |
1 | 19 | |
2 | 14 | |
3 | 11 | |
5 | 9 | |
4 | 9 | |
0 | 8 | |
7 | 7 | 6.2% |
8 | 6 | 5.3% |
- | 6 | 5.3% |
9 | 6 | 5.3% |
Other values (5) | 18 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 301 | |
ASCII | 113 | 27.3% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
로 | 67 | |
번 | 29 | 9.6% |
길 | 29 | 9.6% |
대 | 15 | 5.0% |
장 | 11 | 3.7% |
충 | 9 | 3.0% |
렬 | 9 | 3.0% |
아 | 8 | 2.7% |
천 | 8 | 2.7% |
중 | 8 | 2.7% |
Other values (53) | 108 |
ASCII
Value | Count | Frequency (%) |
1 | 19 | |
2 | 14 | |
3 | 11 | |
5 | 9 | |
4 | 9 | |
0 | 8 | |
7 | 7 | 6.2% |
8 | 6 | 5.3% |
- | 6 | 5.3% |
9 | 6 | 5.3% |
Other values (5) | 18 |
은행나무
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 17 |
---|---|
Distinct (%) | 89.5% |
Missing | 42 |
Missing (%) | 68.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 103.63158 |
Minimum | 1 |
---|---|
Maximum | 432 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 681.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 9.1 |
Q1 | 21.5 |
median | 45 |
Q3 | 117 |
95-th percentile | 323.1 |
Maximum | 432 |
Range | 431 |
Interquartile range (IQR) | 95.5 |
Descriptive statistics
Standard deviation | 122.78763 |
---|---|
Coefficient of variation (CV) | 1.1848476 |
Kurtosis | 1.7127287 |
Mean | 103.63158 |
Median Absolute Deviation (MAD) | 35 |
Skewness | 1.5813895 |
Sum | 1969 |
Variance | 15076.801 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
23 | 2 | 3.3% |
40 | 2 | 3.3% |
225 | 1 | 1.6% |
1 | 1 | 1.6% |
14 | 1 | 1.6% |
98 | 1 | 1.6% |
87 | 1 | 1.6% |
10 | 1 | 1.6% |
293 | 1 | 1.6% |
432 | 1 | 1.6% |
Other values (7) | 7 | 11.5% |
(Missing) | 42 |
Value | Count | Frequency (%) |
1 | 1 | |
10 | 1 | |
12 | 1 | |
14 | 1 | |
20 | 1 | |
23 | 2 | |
40 | 2 | |
45 | 1 | |
70 | 1 | |
87 | 1 |
Value | Count | Frequency (%) |
432 | 1 | |
311 | 1 | |
293 | 1 | |
225 | 1 | |
136 | 1 | |
98 | 1 | |
89 | 1 | |
87 | 1 | |
70 | 1 | |
45 | 1 |
왕벚나무
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 10 |
---|---|
Distinct (%) | 100.0% |
Missing | 51 |
Missing (%) | 83.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 104.2 |
Minimum | 2 |
---|---|
Maximum | 300 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 681.0 B |
Quantile statistics
Minimum | 2 |
---|---|
5-th percentile | 3.8 |
Q1 | 29.5 |
median | 67.5 |
Q3 | 166.75 |
95-th percentile | 259.5 |
Maximum | 300 |
Range | 298 |
Interquartile range (IQR) | 137.25 |
Descriptive statistics
Standard deviation | 99.258137 |
---|---|
Coefficient of variation (CV) | 0.95257329 |
Kurtosis | -0.11173638 |
Mean | 104.2 |
Median Absolute Deviation (MAD) | 63.5 |
Skewness | 0.89085365 |
Sum | 1042 |
Variance | 9852.1778 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
145 | 1 | 1.6% |
210 | 1 | 1.6% |
174 | 1 | 1.6% |
2 | 1 | 1.6% |
300 | 1 | 1.6% |
68 | 1 | 1.6% |
46 | 1 | 1.6% |
67 | 1 | 1.6% |
6 | 1 | 1.6% |
24 | 1 | 1.6% |
(Missing) | 51 |
Value | Count | Frequency (%) |
2 | 1 | |
6 | 1 | |
24 | 1 | |
46 | 1 | |
67 | 1 | |
68 | 1 | |
145 | 1 | |
174 | 1 | |
210 | 1 | |
300 | 1 |
Value | Count | Frequency (%) |
300 | 1 | |
210 | 1 | |
174 | 1 | |
145 | 1 | |
68 | 1 | |
67 | 1 | |
46 | 1 | |
24 | 1 | |
6 | 1 | |
2 | 1 |
플라타너스
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 61 |
---|---|
Missing (%) | 100.0% |
Memory size | 681.0 B |
히말라야시다
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 6.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 620.0 B |
<NA> | |
---|---|
1 | 3 |
4 | 1 |
34 | 1 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.7704918 |
Min length | 1 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 3.3% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 56 | |
1 | 3 | 4.9% |
4 | 1 | 1.6% |
34 | 1 | 1.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 56 | |
1 | 3 | 4.9% |
4 | 1 | 1.6% |
34 | 1 | 1.6% |
느티나무
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 13 |
---|---|
Distinct (%) | 81.2% |
Missing | 45 |
Missing (%) | 73.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 31.6875 |
Minimum | 1 |
---|---|
Maximum | 179 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 681.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3.75 |
median | 9.5 |
Q3 | 43 |
95-th percentile | 104 |
Maximum | 179 |
Range | 178 |
Interquartile range (IQR) | 39.25 |
Descriptive statistics
Standard deviation | 46.899494 |
---|---|
Coefficient of variation (CV) | 1.4800629 |
Kurtosis | 6.2478729 |
Mean | 31.6875 |
Median Absolute Deviation (MAD) | 8.5 |
Skewness | 2.3410129 |
Sum | 507 |
Variance | 2199.5625 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 3 | 4.9% |
6 | 2 | 3.3% |
5 | 1 | 1.6% |
3 | 1 | 1.6% |
58 | 1 | 1.6% |
15 | 1 | 1.6% |
179 | 1 | 1.6% |
67 | 1 | 1.6% |
79 | 1 | 1.6% |
13 | 1 | 1.6% |
Other values (3) | 3 | 4.9% |
(Missing) | 45 |
Value | Count | Frequency (%) |
1 | 3 | |
3 | 1 | 1.6% |
4 | 1 | 1.6% |
5 | 1 | 1.6% |
6 | 2 | |
13 | 1 | 1.6% |
15 | 1 | 1.6% |
31 | 1 | 1.6% |
38 | 1 | 1.6% |
58 | 1 | 1.6% |
Value | Count | Frequency (%) |
179 | 1 | |
79 | 1 | |
67 | 1 | |
58 | 1 | |
38 | 1 | |
31 | 1 | |
15 | 1 | |
13 | 1 | |
6 | 2 | |
5 | 1 |
메타세콰이어
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 6.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 620.0 B |
<NA> | |
---|---|
2 | 1 |
54 | 1 |
7 | 1 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.8688525 |
Min length | 1 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 4.9% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 58 | |
2 | 1 | 1.6% |
54 | 1 | 1.6% |
7 | 1 | 1.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 58 | |
2 | 1 | 1.6% |
54 | 1 | 1.6% |
7 | 1 | 1.6% |
가시나무
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 6 |
---|---|
Distinct (%) | 85.7% |
Missing | 54 |
Missing (%) | 88.5% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 26.571429 |
Minimum | 6 |
---|---|
Maximum | 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 681.0 B |
Quantile statistics
Minimum | 6 |
---|---|
5-th percentile | 7.2 |
Q1 | 10 |
median | 12 |
Q3 | 19.5 |
95-th percentile | 82.9 |
Maximum | 109 |
Range | 103 |
Interquartile range (IQR) | 9.5 |
Descriptive statistics
Standard deviation | 36.723549 |
---|---|
Coefficient of variation (CV) | 1.382069 |
Kurtosis | 6.5435972 |
Mean | 26.571429 |
Median Absolute Deviation (MAD) | 5 |
Skewness | 2.5355641 |
Sum | 186 |
Variance | 1348.619 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10 | 2 | 3.3% |
22 | 1 | 1.6% |
17 | 1 | 1.6% |
6 | 1 | 1.6% |
109 | 1 | 1.6% |
12 | 1 | 1.6% |
(Missing) | 54 |
Value | Count | Frequency (%) |
6 | 1 | |
10 | 2 | |
12 | 1 | |
17 | 1 | |
22 | 1 | |
109 | 1 |
Value | Count | Frequency (%) |
109 | 1 | |
22 | 1 | |
17 | 1 | |
12 | 1 | |
10 | 2 | |
6 | 1 |
곰솔
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 620.0 B |
<NA> | |
---|---|
67 | 1 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.9672131 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.6% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 60 | |
67 | 1 | 1.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 60 | |
67 | 1 | 1.6% |
후박나무
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 6.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 620.0 B |
<NA> | |
---|---|
222 | 1 |
1 | 1 |
5 | 1 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.8852459 |
Min length | 1 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 4.9% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 58 | |
222 | 1 | 1.6% |
1 | 1 | 1.6% |
5 | 1 | 1.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 58 | |
222 | 1 | 1.6% |
1 | 1 | 1.6% |
5 | 1 | 1.6% |
가중나무
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 620.0 B |
<NA> | |
---|---|
7 | 1 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.9508197 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.6% |
Sample
1st row | 7 |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 60 | |
7 | 1 | 1.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 60 | |
7 | 1 | 1.6% |
단풍나무
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 4.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 620.0 B |
<NA> | |
---|---|
17 | 1 |
48 | 1 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.9344262 |
Min length | 2 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 3.3% |
Sample
1st row | <NA> |
---|---|
2nd row | 17 |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 59 | |
17 | 1 | 1.6% |
48 | 1 | 1.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 59 | |
17 | 1 | 1.6% |
48 | 1 | 1.6% |
향나무
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 4.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 620.0 B |
<NA> | |
---|---|
10 | 1 |
6 | 1 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.9180328 |
Min length | 1 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 3.3% |
Sample
1st row | 10 |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 59 | |
10 | 1 | 1.6% |
6 | 1 | 1.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 59 | |
10 | 1 | 1.6% |
6 | 1 | 1.6% |
먼나무
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 9 |
---|---|
Distinct (%) | 90.0% |
Missing | 51 |
Missing (%) | 83.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20.5 |
Minimum | 5 |
---|---|
Maximum | 93 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 681.0 B |
Quantile statistics
Minimum | 5 |
---|---|
5-th percentile | 5.45 |
Q1 | 8.25 |
median | 9.5 |
Q3 | 12.5 |
95-th percentile | 69.6 |
Maximum | 93 |
Range | 88 |
Interquartile range (IQR) | 4.25 |
Descriptive statistics
Standard deviation | 27.496464 |
---|---|
Coefficient of variation (CV) | 1.3412909 |
Kurtosis | 6.3475902 |
Mean | 20.5 |
Median Absolute Deviation (MAD) | 2.5 |
Skewness | 2.5086158 |
Sum | 205 |
Variance | 756.05556 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
9 | 2 | 3.3% |
6 | 1 | 1.6% |
8 | 1 | 1.6% |
11 | 1 | 1.6% |
93 | 1 | 1.6% |
41 | 1 | 1.6% |
13 | 1 | 1.6% |
5 | 1 | 1.6% |
10 | 1 | 1.6% |
(Missing) | 51 |
Value | Count | Frequency (%) |
5 | 1 | |
6 | 1 | |
8 | 1 | |
9 | 2 | |
10 | 1 | |
11 | 1 | |
13 | 1 | |
41 | 1 | |
93 | 1 |
Value | Count | Frequency (%) |
93 | 1 | |
41 | 1 | |
13 | 1 | |
11 | 1 | |
10 | 1 | |
9 | 2 | |
8 | 1 | |
6 | 1 | |
5 | 1 |
이팝나무
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 23 |
---|---|
Distinct (%) | 82.1% |
Missing | 33 |
Missing (%) | 54.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 52.535714 |
Minimum | 4 |
---|---|
Maximum | 223 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 681.0 B |
Quantile statistics
Minimum | 4 |
---|---|
5-th percentile | 6.7 |
Q1 | 12.25 |
median | 31.5 |
Q3 | 55 |
95-th percentile | 180.95 |
Maximum | 223 |
Range | 219 |
Interquartile range (IQR) | 42.75 |
Descriptive statistics
Standard deviation | 59.67969 |
---|---|
Coefficient of variation (CV) | 1.1359832 |
Kurtosis | 2.1178852 |
Mean | 52.535714 |
Median Absolute Deviation (MAD) | 22 |
Skewness | 1.7024653 |
Sum | 1471 |
Variance | 3561.6653 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
8 | 4 | 6.6% |
48 | 2 | 3.3% |
13 | 2 | 3.3% |
223 | 1 | 1.6% |
18 | 1 | 1.6% |
42 | 1 | 1.6% |
40 | 1 | 1.6% |
27 | 1 | 1.6% |
82 | 1 | 1.6% |
113 | 1 | 1.6% |
Other values (13) | 13 | 21.3% |
(Missing) | 33 |
Value | Count | Frequency (%) |
4 | 1 | 1.6% |
6 | 1 | 1.6% |
8 | 4 | |
10 | 1 | 1.6% |
13 | 2 | |
15 | 1 | 1.6% |
18 | 1 | 1.6% |
20 | 1 | 1.6% |
27 | 1 | 1.6% |
29 | 1 | 1.6% |
Value | Count | Frequency (%) |
223 | 1 | |
196 | 1 | |
153 | 1 | |
147 | 1 | |
113 | 1 | |
82 | 1 | |
58 | 1 | |
54 | 1 | |
48 | 2 | |
46 | 1 |
회화나무
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 620.0 B |
<NA> | |
---|---|
47 | 1 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.9672131 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.6% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 60 | |
47 | 1 | 1.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 60 | |
47 | 1 | 1.6% |
튤립나무
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 620.0 B |
<NA> | |
---|---|
62 | 1 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.9672131 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.6% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 60 | |
62 | 1 | 1.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 60 | |
62 | 1 | 1.6% |
가로수연장(km)
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 40 |
---|---|
Distinct (%) | 65.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.81278689 |
Minimum | 0.05 |
---|---|
Maximum | 5.5 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 681.0 B |
Quantile statistics
Minimum | 0.05 |
---|---|
5-th percentile | 0.07 |
Q1 | 0.16 |
median | 0.3 |
Q3 | 0.8 |
95-th percentile | 2.95 |
Maximum | 5.5 |
Range | 5.45 |
Interquartile range (IQR) | 0.64 |
Descriptive statistics
Standard deviation | 1.2028745 |
---|---|
Coefficient of variation (CV) | 1.4799384 |
Kurtosis | 6.459338 |
Mean | 0.81278689 |
Median Absolute Deviation (MAD) | 0.2 |
Skewness | 2.5334808 |
Sum | 49.58 |
Variance | 1.4469071 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.2 | 4 | 6.6% |
0.15 | 4 | 6.6% |
0.35 | 3 | 4.9% |
0.3 | 3 | 4.9% |
0.1 | 3 | 4.9% |
0.6 | 3 | 4.9% |
0.09 | 2 | 3.3% |
0.25 | 2 | 3.3% |
1.5 | 2 | 3.3% |
0.05 | 2 | 3.3% |
Other values (30) | 33 |
Value | Count | Frequency (%) |
0.05 | 2 | |
0.06 | 1 | 1.6% |
0.07 | 1 | 1.6% |
0.08 | 1 | 1.6% |
0.09 | 2 | |
0.1 | 3 | |
0.12 | 1 | 1.6% |
0.15 | 4 | |
0.16 | 1 | 1.6% |
0.18 | 1 | 1.6% |
Value | Count | Frequency (%) |
5.5 | 1 | |
5.0 | 1 | |
4.8 | 1 | |
2.95 | 1 | |
2.5 | 1 | |
2.4 | 1 | |
2.3 | 1 | |
2.0 | 1 | |
1.8 | 1 | |
1.5 | 2 |
노선명 | 은행나무 | 왕벚나무 | 히말라야시다 | 느티나무 | 메타세콰이어 | 가시나무 | 후박나무 | 단풍나무 | 향나무 | 먼나무 | 이팝나무 | 가로수연장(km) | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
노선명 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.000 | 0.000 | 1.000 | 1.000 | 1.000 |
은행나무 | 1.000 | 1.000 | 1.000 | 1.000 | 0.323 | NaN | 1.000 | 0.000 | NaN | 0.000 | 0.000 | 0.926 | 0.612 |
왕벚나무 | 1.000 | 1.000 | 1.000 | 0.000 | NaN | NaN | 0.000 | NaN | NaN | 0.000 | 1.000 | NaN | 0.869 |
히말라야시다 | 1.000 | 1.000 | 0.000 | 1.000 | 1.000 | NaN | NaN | 0.000 | NaN | NaN | NaN | 0.000 | 0.416 |
느티나무 | 1.000 | 0.323 | NaN | 1.000 | 1.000 | 0.000 | 1.000 | NaN | NaN | NaN | 0.000 | NaN | 0.000 |
메타세콰이어 | 1.000 | NaN | NaN | NaN | 0.000 | 1.000 | NaN | NaN | NaN | NaN | NaN | NaN | 1.000 |
가시나무 | 1.000 | 1.000 | 0.000 | NaN | 1.000 | NaN | 1.000 | NaN | NaN | NaN | NaN | 0.000 | 0.672 |
후박나무 | 1.000 | 0.000 | NaN | 0.000 | NaN | NaN | NaN | 1.000 | NaN | NaN | NaN | NaN | 1.000 |
단풍나무 | 0.000 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 1.000 | NaN | NaN | NaN | 0.000 |
향나무 | 0.000 | 0.000 | 0.000 | NaN | NaN | NaN | NaN | NaN | NaN | 1.000 | NaN | NaN | NaN |
먼나무 | 1.000 | 0.000 | 1.000 | NaN | 0.000 | NaN | NaN | NaN | NaN | NaN | 1.000 | NaN | 0.729 |
이팝나무 | 1.000 | 0.926 | NaN | 0.000 | NaN | NaN | 0.000 | NaN | NaN | NaN | NaN | 1.000 | 0.831 |
가로수연장(km) | 1.000 | 0.612 | 0.869 | 0.416 | 0.000 | 1.000 | 0.672 | 1.000 | 0.000 | NaN | 0.729 | 0.831 | 1.000 |
메타세콰이어 | 후박나무 | 곰솔 | 히말라야시다 | 가중나무 | 회화나무 | 튤립나무 | 단풍나무 | 향나무 | |
---|---|---|---|---|---|---|---|---|---|
메타세콰이어 | 1.000 | NaN | NaN | 1.000 | NaN | NaN | NaN | NaN | NaN |
후박나무 | NaN | 1.000 | NaN | 1.000 | NaN | NaN | NaN | NaN | NaN |
곰솔 | NaN | NaN | 1.000 | NaN | NaN | NaN | NaN | NaN | NaN |
히말라야시다 | 1.000 | 1.000 | NaN | 1.000 | NaN | NaN | NaN | NaN | NaN |
가중나무 | NaN | NaN | NaN | NaN | 1.000 | NaN | NaN | NaN | NaN |
회화나무 | NaN | NaN | NaN | NaN | NaN | 1.000 | NaN | NaN | NaN |
튤립나무 | NaN | NaN | NaN | NaN | NaN | NaN | 1.000 | NaN | NaN |
단풍나무 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 1.000 | NaN |
향나무 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 1.000 |
은행나무 | 왕벚나무 | 느티나무 | 가시나무 | 먼나무 | 이팝나무 | 가로수연장(km) | 히말라야시다 | 메타세콰이어 | 곰솔 | 후박나무 | 가중나무 | 단풍나무 | 향나무 | 회화나무 | 튤립나무 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
은행나무 | 1.000 | -1.000 | -0.288 | 0.500 | 0.500 | 0.462 | 0.377 | 1.000 | NaN | NaN | 1.000 | NaN | NaN | 1.000 | NaN | NaN |
왕벚나무 | -1.000 | 1.000 | 0.800 | -1.000 | -0.500 | 1.000 | 0.794 | 1.000 | NaN | NaN | NaN | NaN | NaN | 1.000 | 0.000 | 0.000 |
느티나무 | -0.288 | 0.800 | 1.000 | -0.500 | -0.738 | 1.000 | 0.024 | 1.000 | 1.000 | 0.000 | NaN | NaN | 1.000 | NaN | 0.000 | NaN |
가시나무 | 0.500 | -1.000 | -0.500 | 1.000 | -1.000 | 1.000 | 0.649 | 1.000 | NaN | NaN | 1.000 | 0.000 | NaN | NaN | 0.000 | 0.000 |
먼나무 | 0.500 | -0.500 | -0.738 | -1.000 | 1.000 | NaN | -0.223 | 1.000 | NaN | NaN | 1.000 | NaN | 0.000 | 1.000 | 0.000 | 0.000 |
이팝나무 | 0.462 | 1.000 | 1.000 | 1.000 | NaN | 1.000 | 0.695 | 1.000 | 0.000 | NaN | NaN | 0.000 | NaN | NaN | NaN | 0.000 |
가로수연장(km) | 0.377 | 0.794 | 0.024 | 0.649 | -0.223 | 0.695 | 1.000 | 0.000 | 1.000 | NaN | 1.000 | NaN | 1.000 | 1.000 | NaN | NaN |
히말라야시다 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.000 | 1.000 | 1.000 | NaN | 1.000 | 0.000 | NaN | NaN | 0.000 | NaN |
메타세콰이어 | NaN | NaN | 1.000 | NaN | NaN | 0.000 | 1.000 | 1.000 | 1.000 | 0.000 | NaN | 0.000 | NaN | 0.000 | 0.000 | 0.000 |
곰솔 | NaN | NaN | 0.000 | NaN | NaN | NaN | NaN | NaN | 0.000 | 1.000 | NaN | 0.000 | 0.000 | NaN | 0.000 | 0.000 |
후박나무 | 1.000 | NaN | NaN | 1.000 | 1.000 | NaN | 1.000 | 1.000 | NaN | NaN | 1.000 | 0.000 | 0.000 | NaN | 0.000 | 0.000 |
가중나무 | NaN | NaN | NaN | 0.000 | NaN | 0.000 | NaN | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | NaN | 0.000 | 0.000 |
단풍나무 | NaN | NaN | 1.000 | NaN | 0.000 | NaN | 1.000 | NaN | NaN | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 |
향나무 | 1.000 | 1.000 | NaN | NaN | 1.000 | NaN | 1.000 | NaN | 0.000 | NaN | NaN | NaN | 0.000 | 1.000 | 0.000 | 0.000 |
회화나무 | NaN | 0.000 | 0.000 | 0.000 | 0.000 | NaN | NaN | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 |
튤립나무 | NaN | 0.000 | NaN | 0.000 | 0.000 | 0.000 | NaN | NaN | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 |
노선명 | 은행나무 | 왕벚나무 | 플라타너스 | 히말라야시다 | 느티나무 | 메타세콰이어 | 가시나무 | 곰솔 | 후박나무 | 가중나무 | 단풍나무 | 향나무 | 먼나무 | 이팝나무 | 회화나무 | 튤립나무 | 가로수연장(km) | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 중앙대로 | 293 | 145 | <NA> | <NA> | 5 | <NA> | <NA> | <NA> | <NA> | 7 | <NA> | 10 | 6 | <NA> | <NA> | <NA> | 5.5 |
1 | 충렬대로 | 432 | <NA> | <NA> | <NA> | 3 | <NA> | 22 | <NA> | <NA> | <NA> | 17 | <NA> | <NA> | 58 | <NA> | <NA> | 4.8 |
2 | 충렬대로107번길 | 70 | <NA> | <NA> | <NA> | 6 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 0.3 |
3 | 충렬대로238번길 | 12 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 0.2 |
4 | 충렬대로350번길 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 17 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 0.15 |
5 | 충렬대로410번길 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 8 | <NA> | <NA> | <NA> | 0.1 |
6 | 아시아드대로 | <NA> | <NA> | <NA> | 1 | 58 | 2 | 10 | <NA> | 222 | <NA> | <NA> | <NA> | 9 | <NA> | <NA> | <NA> | 1.8 |
7 | 아시아드대로146번길 | <NA> | <NA> | <NA> | <NA> | 1 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 11 | <NA> | <NA> | <NA> | 0.06 |
8 | 아시아드대로228번길 | 20 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 0.2 |
9 | 아시아드대로208번길 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 6 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 0.05 |
노선명 | 은행나무 | 왕벚나무 | 플라타너스 | 히말라야시다 | 느티나무 | 메타세콰이어 | 가시나무 | 곰솔 | 후박나무 | 가중나무 | 단풍나무 | 향나무 | 먼나무 | 이팝나무 | 회화나무 | 튤립나무 | 가로수연장(km) | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
51 | 명장로67번길 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 8 | <NA> | <NA> | 0.09 |
52 | 우장춘로9번길, 충렬대로75번길(중로3-224) | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 113 | <NA> | <NA> | 0.6 |
53 | 금정마을로(중로1-142) | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 82 | <NA> | <NA> | 0.42 |
54 | 우장춘로59번길(중로3-223) | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 27 | <NA> | <NA> | 0.26 |
55 | 중앙대로1381번길(중로1-141) | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 40 | <NA> | <NA> | 0.19 |
56 | 우장춘로18번길(중로2-206) | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 42 | <NA> | <NA> | 0.24 |
57 | 차밭골로 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 9 | <NA> | <NA> | <NA> | 0.1 |
58 | 온천천로531번길 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 10 | <NA> | <NA> | <NA> | 0.15 |
59 | 충렬대로75번길 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 18 | <NA> | <NA> | 0.24 |
60 | 명장로67번길 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 8 | <NA> | <NA> | 0.09 |
Most frequently occurring
노선명 | 은행나무 | 왕벚나무 | 히말라야시다 | 느티나무 | 메타세콰이어 | 가시나무 | 곰솔 | 후박나무 | 가중나무 | 단풍나무 | 향나무 | 먼나무 | 이팝나무 | 회화나무 | 튤립나무 | 가로수연장(km) | # duplicates | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 명장로67번길 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 8 | <NA> | <NA> | 0.09 | 2 |