Overview

Dataset statistics

Number of variables13
Number of observations199
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory21.3 KiB
Average record size in memory109.7 B

Variable types

Text7
Numeric4
Categorical2

Alerts

1 has constant value ""Constant
4131010400003540002 is highly overall correlated with 323227High correlation
323227 is highly overall correlated with 4131010400003540002High correlation
시외 is highly imbalanced (75.7%)Imbalance
E02639 has unique valuesUnique

Reproduction

Analysis started2023-12-10 06:46:03.115818
Analysis finished2023-12-10 06:46:06.519754
Duration3.4 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

E02639
Text

UNIQUE 

Distinct199
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-10T15:46:06.849798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length6
Min length6

Characters and Unicode

Total characters1194
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique199 ?
Unique (%)100.0%

Sample

1st rowE02338
2nd rowE02344
3rd rowE02541
4th rowE02275
5th rowE02490
ValueCountFrequency (%)
e02338 1
 
0.5%
e02666 1
 
0.5%
e02346 1
 
0.5%
e02540 1
 
0.5%
e02679 1
 
0.5%
e02568 1
 
0.5%
e02694 1
 
0.5%
e02452 1
 
0.5%
e02445 1
 
0.5%
e02489 1
 
0.5%
Other values (189) 189
95.0%
2023-12-10T15:46:07.422676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 257
21.5%
2 243
20.4%
E 199
16.7%
5 86
 
7.2%
4 82
 
6.9%
3 74
 
6.2%
6 72
 
6.0%
9 51
 
4.3%
7 50
 
4.2%
1 47
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 995
83.3%
Uppercase Letter 199
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 257
25.8%
2 243
24.4%
5 86
 
8.6%
4 82
 
8.2%
3 74
 
7.4%
6 72
 
7.2%
9 51
 
5.1%
7 50
 
5.0%
1 47
 
4.7%
8 33
 
3.3%
Uppercase Letter
ValueCountFrequency (%)
E 199
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 995
83.3%
Latin 199
 
16.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 257
25.8%
2 243
24.4%
5 86
 
8.6%
4 82
 
8.2%
3 74
 
7.4%
6 72
 
7.2%
9 51
 
5.1%
7 50
 
5.0%
1 47
 
4.7%
8 33
 
3.3%
Latin
ValueCountFrequency (%)
E 199
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1194
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 257
21.5%
2 243
20.4%
E 199
16.7%
5 86
 
7.2%
4 82
 
6.9%
3 74
 
6.2%
6 72
 
6.0%
9 51
 
4.3%
7 50
 
4.2%
1 47
 
3.9%
Distinct193
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-10T15:46:07.811215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length4.5778894
Min length2

Characters and Unicode

Total characters911
Distinct characters205
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique187 ?
Unique (%)94.0%

Sample

1st row전곡
2nd row장풍리(시내)
3rd row북삼송
4th row풍림
5th row송내한전
ValueCountFrequency (%)
여주 2
 
1.0%
삼성 2
 
1.0%
안산 2
 
1.0%
검찰청입구 2
 
1.0%
수원 2
 
1.0%
이천 2
 
1.0%
송우리 1
 
0.5%
신갈(용인 1
 
0.5%
운심2리(시내 1
 
0.5%
전곡 1
 
0.5%
Other values (184) 184
92.0%
2023-12-10T15:46:08.383344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 49
 
5.4%
) 49
 
5.4%
46
 
5.0%
37
 
4.1%
34
 
3.7%
23
 
2.5%
19
 
2.1%
16
 
1.8%
14
 
1.5%
12
 
1.3%
Other values (195) 612
67.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 784
86.1%
Open Punctuation 49
 
5.4%
Close Punctuation 49
 
5.4%
Decimal Number 20
 
2.2%
Uppercase Letter 6
 
0.7%
Other Punctuation 2
 
0.2%
Space Separator 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
46
 
5.9%
37
 
4.7%
34
 
4.3%
23
 
2.9%
19
 
2.4%
16
 
2.0%
14
 
1.8%
12
 
1.5%
11
 
1.4%
11
 
1.4%
Other values (181) 561
71.6%
Decimal Number
ValueCountFrequency (%)
2 7
35.0%
1 5
25.0%
3 5
25.0%
4 2
 
10.0%
6 1
 
5.0%
Uppercase Letter
ValueCountFrequency (%)
A 2
33.3%
T 1
16.7%
P 1
16.7%
S 1
16.7%
G 1
16.7%
Open Punctuation
ValueCountFrequency (%)
( 49
100.0%
Close Punctuation
ValueCountFrequency (%)
) 49
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 784
86.1%
Common 121
 
13.3%
Latin 6
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
46
 
5.9%
37
 
4.7%
34
 
4.3%
23
 
2.9%
19
 
2.4%
16
 
2.0%
14
 
1.8%
12
 
1.5%
11
 
1.4%
11
 
1.4%
Other values (181) 561
71.6%
Common
ValueCountFrequency (%)
( 49
40.5%
) 49
40.5%
2 7
 
5.8%
1 5
 
4.1%
3 5
 
4.1%
. 2
 
1.7%
4 2
 
1.7%
6 1
 
0.8%
1
 
0.8%
Latin
ValueCountFrequency (%)
A 2
33.3%
T 1
16.7%
P 1
16.7%
S 1
16.7%
G 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 784
86.1%
ASCII 127
 
13.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 49
38.6%
) 49
38.6%
2 7
 
5.5%
1 5
 
3.9%
3 5
 
3.9%
. 2
 
1.6%
4 2
 
1.6%
A 2
 
1.6%
T 1
 
0.8%
P 1
 
0.8%
Other values (4) 4
 
3.1%
Hangul
ValueCountFrequency (%)
46
 
5.9%
37
 
4.7%
34
 
4.3%
23
 
2.9%
19
 
2.4%
16
 
2.0%
14
 
1.8%
12
 
1.5%
11
 
1.4%
11
 
1.4%
Other values (181) 561
71.6%
Distinct188
Distinct (%)94.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-10T15:46:08.758635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length25
Mean length21.839196
Min length17

Characters and Unicode

Total characters4346
Distinct characters165
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique179 ?
Unique (%)89.9%

Sample

1st row경기도 연천군 전곡읍 전곡리 474-2번지
2nd row경기도 여주시 대신면 장풍리 165-5번지
3rd row경기도 고양시 덕양구 신원동 613번지
4th row경기도 성남시 분당구 서현동 255-1번지
5th row경기도 부천시 도당동 185-148번지
ValueCountFrequency (%)
경기도 199
 
21.0%
양평군 24
 
2.5%
용인시 19
 
2.0%
수원시 16
 
1.7%
고양시 16
 
1.7%
여주시 15
 
1.6%
안산시 11
 
1.2%
이천시 10
 
1.1%
처인구 9
 
0.9%
덕양구 9
 
0.9%
Other values (419) 620
65.4%
2023-12-10T15:46:09.324914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
749
 
17.2%
212
 
4.9%
207
 
4.8%
204
 
4.7%
200
 
4.6%
199
 
4.6%
175
 
4.0%
1 144
 
3.3%
- 140
 
3.2%
139
 
3.2%
Other values (155) 1977
45.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2678
61.6%
Decimal Number 779
 
17.9%
Space Separator 749
 
17.2%
Dash Punctuation 140
 
3.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
212
 
7.9%
207
 
7.7%
204
 
7.6%
200
 
7.5%
199
 
7.4%
175
 
6.5%
139
 
5.2%
82
 
3.1%
81
 
3.0%
78
 
2.9%
Other values (143) 1101
41.1%
Decimal Number
ValueCountFrequency (%)
1 144
18.5%
2 108
13.9%
3 89
11.4%
4 79
10.1%
5 78
10.0%
8 62
8.0%
7 59
7.6%
6 59
7.6%
0 51
 
6.5%
9 50
 
6.4%
Space Separator
ValueCountFrequency (%)
749
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 140
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2678
61.6%
Common 1668
38.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
212
 
7.9%
207
 
7.7%
204
 
7.6%
200
 
7.5%
199
 
7.4%
175
 
6.5%
139
 
5.2%
82
 
3.1%
81
 
3.0%
78
 
2.9%
Other values (143) 1101
41.1%
Common
ValueCountFrequency (%)
749
44.9%
1 144
 
8.6%
- 140
 
8.4%
2 108
 
6.5%
3 89
 
5.3%
4 79
 
4.7%
5 78
 
4.7%
8 62
 
3.7%
7 59
 
3.5%
6 59
 
3.5%
Other values (2) 101
 
6.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2678
61.6%
ASCII 1668
38.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
749
44.9%
1 144
 
8.6%
- 140
 
8.4%
2 108
 
6.5%
3 89
 
5.3%
4 79
 
4.7%
5 78
 
4.7%
8 62
 
3.7%
7 59
 
3.5%
6 59
 
3.5%
Other values (2) 101
 
6.1%
Hangul
ValueCountFrequency (%)
212
 
7.9%
207
 
7.7%
204
 
7.6%
200
 
7.5%
199
 
7.4%
175
 
6.5%
139
 
5.2%
82
 
3.1%
81
 
3.0%
78
 
2.9%
Other values (143) 1101
41.1%
Distinct119
Distinct (%)59.8%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-10T15:46:09.701962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length24
Mean length12.613065
Min length1

Characters and Unicode

Total characters2510
Distinct characters164
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique111 ?
Unique (%)55.8%

Sample

1st row경기도 연천군 전곡읍 온골로 49
2nd rowX
3rd row경기도 고양시 덕양구 오금로 7
4th row경기도 성남시 분당구 서현로 170
5th row경기도 부천시 부천로264번길 4
ValueCountFrequency (%)
경기도 127
 
18.8%
x 72
 
10.7%
수원시 12
 
1.8%
용인시 11
 
1.6%
안산시 9
 
1.3%
이천시 8
 
1.2%
고양시 8
 
1.2%
안성시 8
 
1.2%
여주시 8
 
1.2%
상록구 7
 
1.0%
Other values (278) 404
59.9%
2023-12-10T15:46:10.260718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
475
18.9%
137
 
5.5%
133
 
5.3%
129
 
5.1%
126
 
5.0%
121
 
4.8%
1 97
 
3.9%
X 72
 
2.9%
2 56
 
2.2%
53
 
2.1%
Other values (154) 1111
44.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1544
61.5%
Space Separator 475
 
18.9%
Decimal Number 406
 
16.2%
Uppercase Letter 72
 
2.9%
Dash Punctuation 13
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
137
 
8.9%
133
 
8.6%
129
 
8.4%
126
 
8.2%
121
 
7.8%
53
 
3.4%
38
 
2.5%
38
 
2.5%
36
 
2.3%
27
 
1.7%
Other values (141) 706
45.7%
Decimal Number
ValueCountFrequency (%)
1 97
23.9%
2 56
13.8%
7 38
 
9.4%
6 36
 
8.9%
4 34
 
8.4%
3 34
 
8.4%
5 31
 
7.6%
0 29
 
7.1%
9 29
 
7.1%
8 22
 
5.4%
Space Separator
ValueCountFrequency (%)
475
100.0%
Uppercase Letter
ValueCountFrequency (%)
X 72
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1544
61.5%
Common 894
35.6%
Latin 72
 
2.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
137
 
8.9%
133
 
8.6%
129
 
8.4%
126
 
8.2%
121
 
7.8%
53
 
3.4%
38
 
2.5%
38
 
2.5%
36
 
2.3%
27
 
1.7%
Other values (141) 706
45.7%
Common
ValueCountFrequency (%)
475
53.1%
1 97
 
10.9%
2 56
 
6.3%
7 38
 
4.3%
6 36
 
4.0%
4 34
 
3.8%
3 34
 
3.8%
5 31
 
3.5%
0 29
 
3.2%
9 29
 
3.2%
Other values (2) 35
 
3.9%
Latin
ValueCountFrequency (%)
X 72
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1544
61.5%
ASCII 966
38.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
475
49.2%
1 97
 
10.0%
X 72
 
7.5%
2 56
 
5.8%
7 38
 
3.9%
6 36
 
3.7%
4 34
 
3.5%
3 34
 
3.5%
5 31
 
3.2%
0 29
 
3.0%
Other values (3) 64
 
6.6%
Hangul
ValueCountFrequency (%)
137
 
8.9%
133
 
8.6%
129
 
8.4%
126
 
8.2%
121
 
7.8%
53
 
3.4%
38
 
2.5%
38
 
2.5%
36
 
2.3%
27
 
1.7%
Other values (141) 706
45.7%

520118
Real number (ℝ)

Distinct187
Distinct (%)94.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean298461.4
Minimum1254
Maximum519853
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-10T15:46:10.417871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1254
5-th percentile52697.3
Q1175900.5
median302261
Q3424327
95-th percentile509868.4
Maximum519853
Range518599
Interquartile range (IQR)248426.5

Descriptive statistics

Standard deviation133882.87
Coefficient of variation (CV)0.44857685
Kurtosis-0.63110263
Mean298461.4
Median Absolute Deviation (MAD)125547
Skewness-0.28268441
Sum59393818
Variance1.7924624 × 1010
MonotonicityNot monotonic
2023-12-10T15:46:10.565645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
176714 3
 
1.5%
433053 3
 
1.5%
510456 2
 
1.0%
148297 2
 
1.0%
510736 2
 
1.0%
303764 2
 
1.0%
300021 2
 
1.0%
301912 2
 
1.0%
424327 2
 
1.0%
170943 2
 
1.0%
Other values (177) 177
88.9%
ValueCountFrequency (%)
1254 1
0.5%
1742 1
0.5%
1781 1
0.5%
3265 1
0.5%
3267 1
0.5%
4741 1
0.5%
5626 1
0.5%
7525 1
0.5%
7582 1
0.5%
32477 1
0.5%
ValueCountFrequency (%)
519853 1
0.5%
519834 1
0.5%
518577 1
0.5%
518539 1
0.5%
511318 1
0.5%
510736 2
1.0%
510456 2
1.0%
510106 1
0.5%
509842 1
0.5%
508564 1
0.5%

시외
Categorical

IMBALANCE 

Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
시외
191 
고속
 
8

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row시외
2nd row시외
3rd row시외
4th row시외
5th row시외

Common Values

ValueCountFrequency (%)
시외 191
96.0%
고속 8
 
4.0%

Length

2023-12-10T15:46:10.727242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:46:10.827681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
시외 191
96.0%
고속 8
 
4.0%
Distinct196
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-10T15:46:11.185916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length33
Mean length19.613065
Min length11

Characters and Unicode

Total characters3903
Distinct characters215
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique194 ?
Unique (%)97.5%

Sample

1st row경기 연천군 전곡읍 온골로 49
2nd row경기 여주시 대신면 장풍리 165-5
3rd row경기 고양시 덕양구 신원동 613
4th row경기 성남시 분당구 서현동 255-1
5th row경기 부천시 원미구 도당동 185-148
ValueCountFrequency (%)
경기 173
 
17.6%
양평군 24
 
2.4%
용인시 19
 
1.9%
경기도 18
 
1.8%
수원시 16
 
1.6%
고양시 16
 
1.6%
여주시 15
 
1.5%
안산시 11
 
1.1%
이천시 10
 
1.0%
덕양구 9
 
0.9%
Other values (462) 674
68.4%
2023-12-10T15:46:11.725616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
787
20.2%
200
 
5.1%
194
 
5.0%
179
 
4.6%
1 160
 
4.1%
105
 
2.7%
- 99
 
2.5%
2 98
 
2.5%
85
 
2.2%
3 83
 
2.1%
Other values (205) 1913
49.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2257
57.8%
Space Separator 787
 
20.2%
Decimal Number 743
 
19.0%
Dash Punctuation 99
 
2.5%
Close Punctuation 6
 
0.2%
Open Punctuation 6
 
0.2%
Uppercase Letter 3
 
0.1%
Lowercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
200
 
8.9%
194
 
8.6%
179
 
7.9%
105
 
4.7%
85
 
3.8%
78
 
3.5%
62
 
2.7%
60
 
2.7%
58
 
2.6%
54
 
2.4%
Other values (186) 1182
52.4%
Decimal Number
ValueCountFrequency (%)
1 160
21.5%
2 98
13.2%
3 83
11.2%
5 68
9.2%
4 65
8.7%
9 62
 
8.3%
6 62
 
8.3%
7 55
 
7.4%
8 49
 
6.6%
0 41
 
5.5%
Uppercase Letter
ValueCountFrequency (%)
M 1
33.3%
U 1
33.3%
C 1
33.3%
Lowercase Letter
ValueCountFrequency (%)
s 1
50.0%
k 1
50.0%
Space Separator
ValueCountFrequency (%)
787
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 99
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2257
57.8%
Common 1641
42.0%
Latin 5
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
200
 
8.9%
194
 
8.6%
179
 
7.9%
105
 
4.7%
85
 
3.8%
78
 
3.5%
62
 
2.7%
60
 
2.7%
58
 
2.6%
54
 
2.4%
Other values (186) 1182
52.4%
Common
ValueCountFrequency (%)
787
48.0%
1 160
 
9.8%
- 99
 
6.0%
2 98
 
6.0%
3 83
 
5.1%
5 68
 
4.1%
4 65
 
4.0%
9 62
 
3.8%
6 62
 
3.8%
7 55
 
3.4%
Other values (4) 102
 
6.2%
Latin
ValueCountFrequency (%)
s 1
20.0%
k 1
20.0%
M 1
20.0%
U 1
20.0%
C 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2257
57.8%
ASCII 1646
42.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
787
47.8%
1 160
 
9.7%
- 99
 
6.0%
2 98
 
6.0%
3 83
 
5.0%
5 68
 
4.1%
4 65
 
3.9%
9 62
 
3.8%
6 62
 
3.8%
7 55
 
3.3%
Other values (9) 107
 
6.5%
Hangul
ValueCountFrequency (%)
200
 
8.9%
194
 
8.6%
179
 
7.9%
105
 
4.7%
85
 
3.8%
78
 
3.5%
62
 
2.7%
60
 
2.7%
58
 
2.6%
54
 
2.4%
Other values (186) 1182
52.4%

4131010400003540002
Real number (ℝ)

HIGH CORRELATION 

Distinct188
Distinct (%)94.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.1449601 × 1018
Minimum4.1111133 × 1018
Maximum4.183041 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-10T15:46:11.890166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4.1111133 × 1018
5-th percentile4.1113137 × 1018
Q14.1271108 × 1018
median4.146125 × 1018
Q34.1650177 × 1018
95-th percentile4.1830331 × 1018
Maximum4.183041 × 1018
Range7.1927732 × 1016
Interquartile range (IQR)3.790691 × 1016

Descriptive statistics

Standard deviation2.3238886 × 1016
Coefficient of variation (CV)0.0056065403
Kurtosis-1.1117856
Mean4.1449601 × 1018
Median Absolute Deviation (MAD)1.8914999 × 1016
Skewness0.23010281
Sum-5.2564328 × 1018
Variance5.4004581 × 1032
MonotonicityNot monotonic
2023-12-10T15:46:12.086745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4155035031100830010 3
 
1.5%
4111313700011890000 3
 
1.5%
4127110800005900000 2
 
1.0%
4143010100002830005 2
 
1.0%
4150010300002190001 2
 
1.0%
4150031029003630010 2
 
1.0%
4167010200002740001 2
 
1.0%
4146310200003700008 2
 
1.0%
4121010600001710001 2
 
1.0%
4139013200018860005 1
 
0.5%
Other values (178) 178
89.4%
ValueCountFrequency (%)
4111113300003000000 1
 
0.5%
4111113400001650003 1
 
0.5%
4111113400002840005 1
 
0.5%
4111113600004950000 1
 
0.5%
4111113600008810000 1
 
0.5%
4111113600100010024 1
 
0.5%
4111312900004340003 1
 
0.5%
4111313100002430006 1
 
0.5%
4111313700011890000 3
1.5%
4111513400000080000 1
 
0.5%
ValueCountFrequency (%)
4183041032007330036 1
0.5%
4183040023002090024 1
0.5%
4183039525009280001 1
0.5%
4183038028102550010 1
0.5%
4183037023003650001 1
0.5%
4183034024101510001 1
0.5%
4183034023011960000 1
0.5%
4183034023008070001 1
0.5%
4183034023003350004 1
0.5%
4183034023000890001 1
0.5%

1
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
1
199 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 199
100.0%

Length

2023-12-10T15:46:12.291685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:46:12.404731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 199
100.0%

323227
Real number (ℝ)

HIGH CORRELATION 

Distinct189
Distinct (%)95.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean325314.05
Minimum286614
Maximum377197
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-10T15:46:12.579495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum286614
5-th percentile290774.3
Q1304681.5
median322495
Q3346638
95-th percentile367483.1
Maximum377197
Range90583
Interquartile range (IQR)41956.5

Descriptive statistics

Standard deviation24421.911
Coefficient of variation (CV)0.075071801
Kurtosis-1.0664681
Mean325314.05
Median Absolute Deviation (MAD)20845
Skewness0.25320276
Sum64737495
Variance5.9642975 × 108
MonotonicityNot monotonic
2023-12-10T15:46:12.879924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
331393 3
 
1.5%
313265 3
 
1.5%
351181 2
 
1.0%
345033 2
 
1.0%
309305 2
 
1.0%
321796 2
 
1.0%
367885 2
 
1.0%
301584 2
 
1.0%
298149 1
 
0.5%
352718 1
 
0.5%
Other values (179) 179
89.9%
ValueCountFrequency (%)
286614 1
0.5%
287389 1
0.5%
287889 1
0.5%
288678 1
0.5%
289330 1
0.5%
289415 1
0.5%
289720 1
0.5%
290128 1
0.5%
290217 1
0.5%
290678 1
0.5%
ValueCountFrequency (%)
377197 1
0.5%
375011 1
0.5%
372539 1
0.5%
372299 1
0.5%
371055 1
0.5%
367972 1
0.5%
367885 2
1.0%
367876 1
0.5%
367871 1
0.5%
367440 1
0.5%

555559
Real number (ℝ)

Distinct188
Distinct (%)94.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean535280.85
Minimum485246
Maximum614433
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-10T15:46:13.081236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum485246
5-th percentile493458.8
Q1519926
median531676
Q3549600
95-th percentile573441.4
Maximum614433
Range129187
Interquartile range (IQR)29674

Descriptive statistics

Standard deviation24683.394
Coefficient of variation (CV)0.046112978
Kurtosis0.3167101
Mean535280.85
Median Absolute Deviation (MAD)13989
Skewness0.49566529
Sum1.0652089 × 108
Variance6.0926996 × 108
MonotonicityNot monotonic
2023-12-10T15:46:13.264693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
516921 3
 
1.5%
490486 3
 
1.5%
519926 2
 
1.0%
520999 2
 
1.0%
535753 2
 
1.0%
527751 2
 
1.0%
519680 2
 
1.0%
520361 2
 
1.0%
545680 2
 
1.0%
530248 1
 
0.5%
Other values (178) 178
89.4%
ValueCountFrequency (%)
485246 1
 
0.5%
488179 1
 
0.5%
488279 1
 
0.5%
489042 1
 
0.5%
489451 1
 
0.5%
490309 1
 
0.5%
490486 3
1.5%
491342 1
 
0.5%
493694 1
 
0.5%
495560 1
 
0.5%
ValueCountFrequency (%)
614433 1
0.5%
609538 1
0.5%
603911 1
0.5%
602909 1
0.5%
586906 1
0.5%
583870 1
0.5%
580958 1
0.5%
580335 1
0.5%
580097 1
0.5%
574381 1
0.5%
Distinct190
Distinct (%)95.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-10T15:46:13.697943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length3.5477387
Min length2

Characters and Unicode

Total characters706
Distinct characters196
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique181 ?
Unique (%)91.0%

Sample

1st row전곡
2nd row장풍리
3rd row북삼송
4th row풍림
5th row송내한전
ValueCountFrequency (%)
검찰청입구 2
 
1.0%
범계 2
 
1.0%
이천 2
 
1.0%
삼성 2
 
1.0%
안산 2
 
1.0%
여주 2
 
1.0%
명지대 2
 
1.0%
수원 2
 
1.0%
중앙대 2
 
1.0%
고양시장 1
 
0.5%
Other values (181) 181
90.5%
2023-12-10T15:46:14.310420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
35
 
5.0%
23
 
3.3%
19
 
2.7%
16
 
2.3%
11
 
1.6%
11
 
1.6%
11
 
1.6%
11
 
1.6%
11
 
1.6%
11
 
1.6%
Other values (186) 547
77.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 677
95.9%
Decimal Number 20
 
2.8%
Uppercase Letter 6
 
0.8%
Other Punctuation 2
 
0.3%
Space Separator 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
35
 
5.2%
23
 
3.4%
19
 
2.8%
16
 
2.4%
11
 
1.6%
11
 
1.6%
11
 
1.6%
11
 
1.6%
11
 
1.6%
11
 
1.6%
Other values (174) 518
76.5%
Decimal Number
ValueCountFrequency (%)
2 7
35.0%
3 5
25.0%
1 5
25.0%
4 2
 
10.0%
6 1
 
5.0%
Uppercase Letter
ValueCountFrequency (%)
A 2
33.3%
T 1
16.7%
P 1
16.7%
S 1
16.7%
G 1
16.7%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 677
95.9%
Common 23
 
3.3%
Latin 6
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
35
 
5.2%
23
 
3.4%
19
 
2.8%
16
 
2.4%
11
 
1.6%
11
 
1.6%
11
 
1.6%
11
 
1.6%
11
 
1.6%
11
 
1.6%
Other values (174) 518
76.5%
Common
ValueCountFrequency (%)
2 7
30.4%
3 5
21.7%
1 5
21.7%
4 2
 
8.7%
. 2
 
8.7%
1
 
4.3%
6 1
 
4.3%
Latin
ValueCountFrequency (%)
A 2
33.3%
T 1
16.7%
P 1
16.7%
S 1
16.7%
G 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 677
95.9%
ASCII 29
 
4.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
35
 
5.2%
23
 
3.4%
19
 
2.8%
16
 
2.4%
11
 
1.6%
11
 
1.6%
11
 
1.6%
11
 
1.6%
11
 
1.6%
11
 
1.6%
Other values (174) 518
76.5%
ASCII
ValueCountFrequency (%)
2 7
24.1%
3 5
17.2%
1 5
17.2%
A 2
 
6.9%
4 2
 
6.9%
. 2
 
6.9%
T 1
 
3.4%
P 1
 
3.4%
S 1
 
3.4%
G 1
 
3.4%
Other values (2) 2
 
6.9%
Distinct120
Distinct (%)60.3%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-10T15:46:14.575994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length25
Mean length16.437186
Min length1

Characters and Unicode

Total characters3271
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique112 ?
Unique (%)56.3%

Sample

1st row4180025321104740002002187
2nd rowX
3rd row4128110300100470015000001
4th row4113510500102550001040021
5th row4119510400101850148000001
ValueCountFrequency (%)
x 71
35.7%
4155035031200830010000001 3
 
1.5%
4111313700111890000027851 3
 
1.5%
4150031029103630005044904 2
 
1.0%
4143010100102830005007260 2
 
1.0%
4173025022102740003027154 2
 
1.0%
4150010300102190001009725 2
 
1.0%
4121010600101710001000001 2
 
1.0%
4165025021101660004013939 1
 
0.5%
4161025633102740011254118 1
 
0.5%
Other values (110) 110
55.3%
2023-12-10T15:46:15.016472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1166
35.6%
1 641
19.6%
4 293
 
9.0%
2 271
 
8.3%
3 218
 
6.7%
5 166
 
5.1%
7 122
 
3.7%
8 118
 
3.6%
6 113
 
3.5%
9 92
 
2.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3200
97.8%
Uppercase Letter 71
 
2.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1166
36.4%
1 641
20.0%
4 293
 
9.2%
2 271
 
8.5%
3 218
 
6.8%
5 166
 
5.2%
7 122
 
3.8%
8 118
 
3.7%
6 113
 
3.5%
9 92
 
2.9%
Uppercase Letter
ValueCountFrequency (%)
X 71
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3200
97.8%
Latin 71
 
2.2%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1166
36.4%
1 641
20.0%
4 293
 
9.2%
2 271
 
8.5%
3 218
 
6.8%
5 166
 
5.2%
7 122
 
3.8%
8 118
 
3.7%
6 113
 
3.5%
9 92
 
2.9%
Latin
ValueCountFrequency (%)
X 71
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3271
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1166
35.6%
1 641
19.6%
4 293
 
9.0%
2 271
 
8.3%
3 218
 
6.7%
5 166
 
5.1%
7 122
 
3.7%
8 118
 
3.6%
6 113
 
3.5%
9 92
 
2.8%

Interactions

2023-12-10T15:46:05.599225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:46:03.997905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:46:04.513941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:46:05.106642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:46:05.726348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:46:04.119260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:46:04.637781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:46:05.230672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:46:05.858624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:46:04.250798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:46:04.779386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:46:05.380240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:46:05.977202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:46:04.363804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:46:04.924823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:46:05.497300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T15:46:15.127517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
520118시외4131010400003540002323227555559
5201181.0000.0000.5880.6040.581
시외0.0001.0000.0000.0000.000
41310104000035400020.5880.0001.0000.7740.817
3232270.6040.0000.7741.0000.455
5555590.5810.0000.8170.4551.000
2023-12-10T15:46:15.259074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
5201184131010400003540002323227555559시외
5201181.0000.074-0.0220.0890.000
41310104000035400020.0741.0000.7010.1570.000
323227-0.0220.7011.000-0.1110.011
5555590.0890.157-0.1111.0000.000
시외0.0000.0000.0110.0001.000

Missing values

2023-12-10T15:46:06.218686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:46:06.423710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

E02639금강고속경기도 구리시 교문동 354-2번지경기도 구리시 아차산로487번길 34520118시외경기 구리시 교문동 354-241310104000035400021323227555559금강고속.14131010400102850023012223
0E02338전곡경기도 연천군 전곡읍 전곡리 474-2번지경기도 연천군 전곡읍 온골로 49302503시외경기 연천군 전곡읍 온골로 4941800253210047400021318341602909전곡4180025321104740002002187
1E02344장풍리(시내)경기도 여주시 대신면 장풍리 165-5번지X7525시외경기 여주시 대신면 장풍리 165-541670350270016500051367192531888장풍리X
2E02541북삼송경기도 고양시 덕양구 신원동 613번지경기도 고양시 덕양구 오금로 7407618시외경기 고양시 덕양구 신원동 61341281103000061300001302555562968북삼송4128110300100470015000001
3E02275풍림경기도 성남시 분당구 서현동 255-1번지경기도 성남시 분당구 서현로 170350517시외경기 성남시 분당구 서현동 255-141135105000025500011322495532244풍림4113510500102550001040021
4E02490송내한전경기도 부천시 도당동 185-148번지경기도 부천시 부천로264번길 4137443시외경기 부천시 원미구 도당동 185-14841190104000018501481292897545680송내한전4119510400101850148000001
5E027132동경기도 안양시 동안구 관양동 1562번지X134971시외경기 안양시 동안구 관양동 1562411731020001562000013090915339992동X
6E02423에스케이하이닉스경기도 이천시 부발읍 아미리 산136-1번지경기도 이천시 부발읍 경충대로 2091-1433060시외경기 이천시 부발읍 아미리 산136-141500253301013600011354882516494에스케이하이닉스4150025330107020000028848
7E02313지산한빛마을경기도 파주시 와동동 1364번지X406319시외경기 파주시 와동동 136441480122000136400001290128570064지산한빛마을X
8E02699객사리경기도 평택시 팽성읍 객사리 93번지경기도 평택시 팽성읍 동서촌로 123440645시외경기 평택시 팽성읍 동서촌로 12341220250210009300001316956485246객사리4122025021100920007228404
9E02590동아대경기도 안성시 삼죽면 진촌리 632-18번지경기도 안성시 삼죽면 동아예대길 47511318시외경기 안성시 삼죽면 진촌리 632-1841550410270063200181342681495830동아대4155041026105630003011174
E02639금강고속경기도 구리시 교문동 354-2번지경기도 구리시 아차산로487번길 34520118시외경기 구리시 교문동 354-241310104000035400021323227555559금강고속.14131010400102850023012223
189E02409오정동경기도 부천시 오정동 110-2번지X416373시외경기 부천시 오정구 오정동 110-241190117000011000021293136548379오정동X
190E00109고양(화정)경기도 고양시 덕양구 화정동 974번지경기도 고양시 덕양구 화신로260번길 74282594고속고양시 덕양구 화정동 97441281123000097400001297255559741고양4128112300109740000012887
191E02591동백지구경기도 부천시 내동 222-43번지경기도 부천시 신흥로 475417131시외경기 부천시 오정구 내동 222-4341190124000022200431292155547750동백지구4119910800102220053044478
192E02295킨텍스경기도 고양시 일산서구 대화동 2607번지X508300시외경기도 고양시 일산서구 대화동 260741287104000260700001289720563388킨텍스X
193E02256행정타운.용인대경기도 용인시 처인구 역북동 365-75번지X169153시외경기 용인시 처인구 역북동 365-7541461102000036500751327420515469행정타운.용인대X
194E02468시흥관광호텔경기도 시흥시 정왕동 1622-6번지경기도 시흥시 평안상가4길 21299314시외경기 시흥시 평안상가4길 2141390132000162200061289415526932시흥관광호텔4139013200116220006036699
195E02426어현(현곡)경기도 평택시 청북읍 현곡리 293-4번지경기도 평택시 청북읍 청원로 2142198시외경기 평택시 청북면 청원로 241220259210029300041305417493694어현4122035021102930003232953
196E02361이동경기도 포천시 이동면 장암리 527번지X32477시외경기 포천시 이동면 장암리 52741650380210052700001345118603911이동X
197E02704가평경기도 가평군 가평읍 대곡리 168-9번지경기도 가평군 가평읍 가화로 51403433시외경기 가평군 가평읍 가화로 5141820250220016800091357539580335가평4182025022101680009005294
198E02516서화경기도 양평군 양동면 단석리 산255-10번지X7582시외경기 양평군 양동면 단석리 산 255-1041830380281025500101375011533108서화X