Overview

Dataset statistics

Number of variables2
Number of observations63
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory19.1 B

Variable types

Numeric1
Text1

Alerts

시설iD has unique valuesUnique

Reproduction

Analysis started2023-12-10 21:14:15.659638
Analysis finished2023-12-10 21:14:15.971706
Duration0.31 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시설iD
Real number (ℝ)

UNIQUE 

Distinct63
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10266.397
Minimum10010
Maximum10717
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size699.0 B
2023-12-11T06:14:16.058974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10010
5-th percentile10016.1
Q110083.5
median10141
Q310520.5
95-th percentile10703.9
Maximum10717
Range707
Interquartile range (IQR)437

Descriptive statistics

Standard deviation243.99296
Coefficient of variation (CV)0.023766173
Kurtosis-0.9056681
Mean10266.397
Median Absolute Deviation (MAD)108
Skewness0.79873841
Sum646783
Variance59532.566
MonotonicityNot monotonic
2023-12-11T06:14:16.203410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10141 1
 
1.6%
10140 1
 
1.6%
10081 1
 
1.6%
10706 1
 
1.6%
10705 1
 
1.6%
10703 1
 
1.6%
10704 1
 
1.6%
10523 1
 
1.6%
10522 1
 
1.6%
10521 1
 
1.6%
Other values (53) 53
84.1%
ValueCountFrequency (%)
10010 1
1.6%
10013 1
1.6%
10015 1
1.6%
10016 1
1.6%
10017 1
1.6%
10019 1
1.6%
10020 1
1.6%
10021 1
1.6%
10033 1
1.6%
10036 1
1.6%
ValueCountFrequency (%)
10717 1
1.6%
10706 1
1.6%
10705 1
1.6%
10704 1
1.6%
10703 1
1.6%
10685 1
1.6%
10682 1
1.6%
10681 1
1.6%
10680 1
1.6%
10649 1
1.6%
Distinct46
Distinct (%)73.0%
Missing0
Missing (%)0.0%
Memory size636.0 B
2023-12-11T06:14:16.432409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length5.8730159
Min length2

Characters and Unicode

Total characters370
Distinct characters86
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)57.1%

Sample

1st row(본사)교육장
2nd row(본사)세차장
3rd row(본사)숙소
4th row(본사)사무실
5th row(본사)주차장
ValueCountFrequency (%)
공영차고지 6
 
7.3%
차고 5
 
6.1%
휴게실 5
 
6.1%
자동차관련시설 4
 
4.9%
사무실 4
 
4.9%
차고지 4
 
4.9%
주차장 3
 
3.7%
교양실 2
 
2.4%
샤워실 2
 
2.4%
평촌영업소 2
 
2.4%
Other values (41) 45
54.9%
2023-12-11T06:14:16.792725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
34
 
9.2%
24
 
6.5%
19
 
5.1%
18
 
4.9%
16
 
4.3%
15
 
4.1%
14
 
3.8%
13
 
3.5%
) 11
 
3.0%
( 11
 
3.0%
Other values (76) 195
52.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 316
85.4%
Space Separator 19
 
5.1%
Close Punctuation 11
 
3.0%
Open Punctuation 11
 
3.0%
Decimal Number 9
 
2.4%
Uppercase Letter 2
 
0.5%
Other Punctuation 1
 
0.3%
Dash Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
34
 
10.8%
24
 
7.6%
18
 
5.7%
16
 
5.1%
15
 
4.7%
14
 
4.4%
13
 
4.1%
11
 
3.5%
10
 
3.2%
9
 
2.8%
Other values (63) 152
48.1%
Decimal Number
ValueCountFrequency (%)
2 3
33.3%
4 2
22.2%
8 1
 
11.1%
1 1
 
11.1%
7 1
 
11.1%
6 1
 
11.1%
Uppercase Letter
ValueCountFrequency (%)
L 1
50.0%
H 1
50.0%
Space Separator
ValueCountFrequency (%)
19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 316
85.4%
Common 52
 
14.1%
Latin 2
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
34
 
10.8%
24
 
7.6%
18
 
5.7%
16
 
5.1%
15
 
4.7%
14
 
4.4%
13
 
4.1%
11
 
3.5%
10
 
3.2%
9
 
2.8%
Other values (63) 152
48.1%
Common
ValueCountFrequency (%)
19
36.5%
) 11
21.2%
( 11
21.2%
2 3
 
5.8%
4 2
 
3.8%
8 1
 
1.9%
1 1
 
1.9%
7 1
 
1.9%
6 1
 
1.9%
. 1
 
1.9%
Latin
ValueCountFrequency (%)
L 1
50.0%
H 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 316
85.4%
ASCII 54
 
14.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
34
 
10.8%
24
 
7.6%
18
 
5.7%
16
 
5.1%
15
 
4.7%
14
 
4.4%
13
 
4.1%
11
 
3.5%
10
 
3.2%
9
 
2.8%
Other values (63) 152
48.1%
ASCII
ValueCountFrequency (%)
19
35.2%
) 11
20.4%
( 11
20.4%
2 3
 
5.6%
4 2
 
3.7%
8 1
 
1.9%
1 1
 
1.9%
7 1
 
1.9%
6 1
 
1.9%
. 1
 
1.9%
Other values (3) 3
 
5.6%

Interactions

2023-12-11T06:14:15.754972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T06:14:16.908538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설iD시설명
시설iD1.0000.972
시설명0.9721.000

Missing values

2023-12-11T06:14:15.868608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:14:15.940886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시설iD시설명
010141(본사)교육장
110140(본사)세차장
210139(본사)숙소
310138(본사)사무실
410127(본사)주차장
510649선진버스주식회사
610136차고및식당외
710135차고
810134차고및세차시설
910133차고
시설iD시설명
5310042공영차고지 휴게실
5410041공영차고지 휴게실
5510040공영차고지 휴게실
5610036공영차고지 사무실
5710033하남시 공영차고지
5810685사송동 공영차고지 (사송로 41)
5910682상대원(상) 차고지 (순환로 242)
6010681산성동 차고지 (산성동 7)
6110717주사무소
6210680동원동 차고지 (동원동86-2)