Overview

Dataset statistics

Number of variables6
Number of observations43
Missing cells13
Missing cells (%)5.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.3 KiB
Average record size in memory54.1 B

Variable types

Numeric3
Text3

Dataset

Description인천광역시 미추홀구의 고시원에 대한 데이터입니다. 상호명, 도로명주소, 전화번호, 좌표값 등의 항목을 제공하고 있습니다.
Author인천광역시 미추홀구
URLhttps://www.data.go.kr/data/15086999/fileData.do

Alerts

전화번호 has 13 (30.2%) missing valuesMissing
연번 has unique valuesUnique
도로명주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:39:37.052609
Analysis finished2023-12-12 06:39:38.549826
Duration1.5 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct43
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22
Minimum1
Maximum43
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size519.0 B
2023-12-12T15:39:38.646190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.1
Q111.5
median22
Q332.5
95-th percentile40.9
Maximum43
Range42
Interquartile range (IQR)21

Descriptive statistics

Standard deviation12.556539
Coefficient of variation (CV)0.57075176
Kurtosis-1.2
Mean22
Median Absolute Deviation (MAD)11
Skewness0
Sum946
Variance157.66667
MonotonicityStrictly increasing
2023-12-12T15:39:38.824483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=43)
ValueCountFrequency (%)
1 1
 
2.3%
2 1
 
2.3%
25 1
 
2.3%
26 1
 
2.3%
27 1
 
2.3%
28 1
 
2.3%
29 1
 
2.3%
30 1
 
2.3%
31 1
 
2.3%
32 1
 
2.3%
Other values (33) 33
76.7%
ValueCountFrequency (%)
1 1
2.3%
2 1
2.3%
3 1
2.3%
4 1
2.3%
5 1
2.3%
6 1
2.3%
7 1
2.3%
8 1
2.3%
9 1
2.3%
10 1
2.3%
ValueCountFrequency (%)
43 1
2.3%
42 1
2.3%
41 1
2.3%
40 1
2.3%
39 1
2.3%
38 1
2.3%
37 1
2.3%
36 1
2.3%
35 1
2.3%
34 1
2.3%
Distinct42
Distinct (%)97.7%
Missing0
Missing (%)0.0%
Memory size476.0 B
2023-12-12T15:39:39.114493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length5
Mean length6.1395349
Min length3

Characters and Unicode

Total characters264
Distinct characters96
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique41 ?
Unique (%)95.3%

Sample

1st row레브스테이
2nd row풀하우스고시텔
3rd row오픈하우스 인하대점
4th row싱글하우스 고시텔 주안점
5th row스카이블루원룸텔
ValueCountFrequency (%)
주영웰빙텔 2
 
3.8%
원룸텔 2
 
3.8%
싱글하우스 2
 
3.8%
주안점 2
 
3.8%
맥나인미니하우스고시텔 1
 
1.9%
인하레지던스 1
 
1.9%
명지고시원 1
 
1.9%
마운틴리빙텔 1
 
1.9%
월드고시원 1
 
1.9%
주안고시텔 1
 
1.9%
Other values (39) 39
73.6%
2023-12-12T15:39:39.568758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24
 
9.1%
20
 
7.6%
20
 
7.6%
19
 
7.2%
13
 
4.9%
11
 
4.2%
10
 
3.8%
10
 
3.8%
6
 
2.3%
5
 
1.9%
Other values (86) 126
47.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 250
94.7%
Space Separator 10
 
3.8%
Decimal Number 4
 
1.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
9.6%
20
 
8.0%
20
 
8.0%
19
 
7.6%
13
 
5.2%
11
 
4.4%
10
 
4.0%
6
 
2.4%
5
 
2.0%
5
 
2.0%
Other values (81) 117
46.8%
Decimal Number
ValueCountFrequency (%)
8 1
25.0%
1 1
25.0%
9 1
25.0%
6 1
25.0%
Space Separator
ValueCountFrequency (%)
10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 250
94.7%
Common 14
 
5.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
9.6%
20
 
8.0%
20
 
8.0%
19
 
7.6%
13
 
5.2%
11
 
4.4%
10
 
4.0%
6
 
2.4%
5
 
2.0%
5
 
2.0%
Other values (81) 117
46.8%
Common
ValueCountFrequency (%)
10
71.4%
8 1
 
7.1%
1 1
 
7.1%
9 1
 
7.1%
6 1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 250
94.7%
ASCII 14
 
5.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
24
 
9.6%
20
 
8.0%
20
 
8.0%
19
 
7.6%
13
 
5.2%
11
 
4.4%
10
 
4.0%
6
 
2.4%
5
 
2.0%
5
 
2.0%
Other values (81) 117
46.8%
ASCII
ValueCountFrequency (%)
10
71.4%
8 1
 
7.1%
1 1
 
7.1%
9 1
 
7.1%
6 1
 
7.1%

도로명주소
Text

UNIQUE 

Distinct43
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size476.0 B
2023-12-12T15:39:39.889926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length30
Mean length25.906977
Min length17

Characters and Unicode

Total characters1114
Distinct characters97
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)100.0%

Sample

1st row인천광역시 미추홀구 인하로 107 조은빌딩 레브스테이 3~5층
2nd row인천광역시 미추홀구 주안로 108 경향프라자 15층 1501호
3rd row인천광역시 미추홀구 경인남길30번길 65 대동빌딩 2층
4th row인천광역시 미추홀구 석바위로 53
5th row인천광역시 미추홀구 경원대로 740-1 3층
ValueCountFrequency (%)
인천광역시 43
20.8%
미추홀구 43
20.8%
미추홀대로719번길 7
 
3.4%
미추홀대로734번길 4
 
1.9%
2층 3
 
1.4%
16 3
 
1.4%
석바위로53번길 3
 
1.4%
3층 3
 
1.4%
경인로 2
 
1.0%
25-15 2
 
1.0%
Other values (88) 94
45.4%
2023-12-12T15:39:40.398956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
164
 
14.7%
57
 
5.1%
57
 
5.1%
57
 
5.1%
54
 
4.8%
1 47
 
4.2%
44
 
3.9%
44
 
3.9%
43
 
3.9%
43
 
3.9%
Other values (87) 504
45.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 727
65.3%
Decimal Number 209
 
18.8%
Space Separator 164
 
14.7%
Dash Punctuation 9
 
0.8%
Math Symbol 2
 
0.2%
Other Punctuation 1
 
0.1%
Close Punctuation 1
 
0.1%
Open Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
57
 
7.8%
57
 
7.8%
57
 
7.8%
54
 
7.4%
44
 
6.1%
44
 
6.1%
43
 
5.9%
43
 
5.9%
43
 
5.9%
38
 
5.2%
Other values (71) 247
34.0%
Decimal Number
ValueCountFrequency (%)
1 47
22.5%
3 30
14.4%
7 24
11.5%
2 20
9.6%
5 19
9.1%
0 16
 
7.7%
9 16
 
7.7%
4 15
 
7.2%
6 14
 
6.7%
8 8
 
3.8%
Space Separator
ValueCountFrequency (%)
164
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 727
65.3%
Common 387
34.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
57
 
7.8%
57
 
7.8%
57
 
7.8%
54
 
7.4%
44
 
6.1%
44
 
6.1%
43
 
5.9%
43
 
5.9%
43
 
5.9%
38
 
5.2%
Other values (71) 247
34.0%
Common
ValueCountFrequency (%)
164
42.4%
1 47
 
12.1%
3 30
 
7.8%
7 24
 
6.2%
2 20
 
5.2%
5 19
 
4.9%
0 16
 
4.1%
9 16
 
4.1%
4 15
 
3.9%
6 14
 
3.6%
Other values (6) 22
 
5.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 727
65.3%
ASCII 387
34.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
164
42.4%
1 47
 
12.1%
3 30
 
7.8%
7 24
 
6.2%
2 20
 
5.2%
5 19
 
4.9%
0 16
 
4.1%
9 16
 
4.1%
4 15
 
3.9%
6 14
 
3.6%
Other values (6) 22
 
5.7%
Hangul
ValueCountFrequency (%)
57
 
7.8%
57
 
7.8%
57
 
7.8%
54
 
7.4%
44
 
6.1%
44
 
6.1%
43
 
5.9%
43
 
5.9%
43
 
5.9%
38
 
5.2%
Other values (71) 247
34.0%

전화번호
Text

MISSING 

Distinct29
Distinct (%)96.7%
Missing13
Missing (%)30.2%
Memory size476.0 B
2023-12-12T15:39:40.664911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.4
Min length12

Characters and Unicode

Total characters372
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)93.3%

Sample

1st row0507-1349-1738
2nd row032-872-1141
3rd row032-865-0010
4th row0507-1412-5713
5th row032-422-0902
ValueCountFrequency (%)
032-875-9998 2
 
6.7%
032-863-1155 1
 
3.3%
0507-1349-1738 1
 
3.3%
032-424-8189 1
 
3.3%
032-866-0711 1
 
3.3%
032-887-6760 1
 
3.3%
032-862-9991 1
 
3.3%
070-8132-6492 1
 
3.3%
032-889-7008 1
 
3.3%
032-862-8952 1
 
3.3%
Other values (19) 19
63.3%
2023-12-12T15:39:41.089241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 60
16.1%
0 58
15.6%
2 43
11.6%
3 41
11.0%
8 37
9.9%
1 26
7.0%
5 25
6.7%
9 24
 
6.5%
7 21
 
5.6%
6 20
 
5.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 312
83.9%
Dash Punctuation 60
 
16.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 58
18.6%
2 43
13.8%
3 41
13.1%
8 37
11.9%
1 26
8.3%
5 25
8.0%
9 24
7.7%
7 21
 
6.7%
6 20
 
6.4%
4 17
 
5.4%
Dash Punctuation
ValueCountFrequency (%)
- 60
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 372
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 60
16.1%
0 58
15.6%
2 43
11.6%
3 41
11.0%
8 37
9.9%
1 26
7.0%
5 25
6.7%
9 24
 
6.5%
7 21
 
5.6%
6 20
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 372
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 60
16.1%
0 58
15.6%
2 43
11.6%
3 41
11.0%
8 37
9.9%
1 26
7.0%
5 25
6.7%
9 24
 
6.5%
7 21
 
5.6%
6 20
 
5.4%

위도
Real number (ℝ)

Distinct39
Distinct (%)90.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.459813
Minimum37.44171
Maximum37.468105
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size519.0 B
2023-12-12T15:39:41.289136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum37.44171
5-th percentile37.448921
Q137.457942
median37.461714
Q337.463264
95-th percentile37.467441
Maximum37.468105
Range0.02639498
Interquartile range (IQR)0.00532211

Descriptive statistics

Standard deviation0.0059995011
Coefficient of variation (CV)0.00016015833
Kurtosis1.0673966
Mean37.459813
Median Absolute Deviation (MAD)0.00169663
Skewness-1.22757
Sum1610.772
Variance3.5994013 × 10-5
MonotonicityNot monotonic
2023-12-12T15:39:41.453333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%)
37.46366077 2
 
4.7%
37.4621995 2
 
4.7%
37.46186403 2
 
4.7%
37.46341102 2
 
4.7%
37.46402962 1
 
2.3%
37.46294321 1
 
2.3%
37.45323371 1
 
2.3%
37.46171073 1
 
2.3%
37.46777189 1
 
2.3%
37.44596058 1
 
2.3%
Other values (29) 29
67.4%
ValueCountFrequency (%)
37.44171042 1
2.3%
37.44596058 1
2.3%
37.44874789 1
2.3%
37.45048081 1
2.3%
37.4513708 1
2.3%
37.45172306 1
2.3%
37.4522475 1
2.3%
37.45316135 1
2.3%
37.45319033 1
2.3%
37.45323371 1
2.3%
ValueCountFrequency (%)
37.4681054 1
2.3%
37.46777189 1
2.3%
37.46754729 1
2.3%
37.46648884 1
2.3%
37.46562508 1
2.3%
37.46402962 1
2.3%
37.46366077 2
4.7%
37.46360285 1
2.3%
37.46341102 2
4.7%
37.4631173 1
2.3%

경도
Real number (ℝ)

Distinct39
Distinct (%)90.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean126.67326
Minimum126.65181
Maximum126.69021
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size519.0 B
2023-12-12T15:39:41.636252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.65181
5-th percentile126.6561
Q1126.6619
median126.67858
Q3126.6808
95-th percentile126.68281
Maximum126.69021
Range0.0383955
Interquartile range (IQR)0.01889425

Descriptive statistics

Standard deviation0.010537858
Coefficient of variation (CV)8.3189281 × 10-5
Kurtosis-1.1026354
Mean126.67326
Median Absolute Deviation (MAD)0.0031494
Skewness-0.67709446
Sum5446.9503
Variance0.00011104645
MonotonicityNot monotonic
2023-12-12T15:39:41.829169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%)
126.6817327 2
 
4.7%
126.679626 2
 
4.7%
126.6792175 2
 
4.7%
126.6817343 2
 
4.7%
126.6633522 1
 
2.3%
126.6791331 1
 
2.3%
126.6599748 1
 
2.3%
126.6782782 1
 
2.3%
126.6593691 1
 
2.3%
126.6584137 1
 
2.3%
Other values (29) 29
67.4%
ValueCountFrequency (%)
126.6518133 1
2.3%
126.6543518 1
2.3%
126.655973 1
2.3%
126.6572802 1
2.3%
126.658128 1
2.3%
126.6584137 1
2.3%
126.6593691 1
2.3%
126.659941 1
2.3%
126.6599748 1
2.3%
126.6601181 1
2.3%
ValueCountFrequency (%)
126.6902088 1
2.3%
126.6856998 1
2.3%
126.6829126 1
2.3%
126.681924 1
2.3%
126.6817744 1
2.3%
126.6817343 2
4.7%
126.6817327 2
4.7%
126.6810363 1
2.3%
126.6809991 1
2.3%
126.680592 1
2.3%

Interactions

2023-12-12T15:39:37.985331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:39:37.363178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:39:37.684013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:39:38.082041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:39:37.451028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:39:37.774585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:39:38.198919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:39:37.559710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:39:37.874494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:39:41.972599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번상호명도로명주소전화번호위도경도
연번1.0000.9191.0000.8770.2190.000
상호명0.9191.0001.0001.0001.0001.000
도로명주소1.0001.0001.0001.0001.0001.000
전화번호0.8771.0001.0001.0001.0001.000
위도0.2191.0001.0001.0001.0000.855
경도0.0001.0001.0001.0000.8551.000
2023-12-12T15:39:42.079579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번위도경도
연번1.0000.116-0.039
위도0.1161.0000.237
경도-0.0390.2371.000

Missing values

2023-12-12T15:39:38.371950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:39:38.497229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번상호명도로명주소전화번호위도경도
01레브스테이인천광역시 미추홀구 인하로 107 조은빌딩 레브스테이 3~5층0507-1349-173837.450481126.660118
12풀하우스고시텔인천광역시 미추홀구 주안로 108 경향프라자 15층 1501호<NA>37.463661126.681733
23오픈하우스 인하대점인천광역시 미추홀구 경인남길30번길 65 대동빌딩 2층032-872-114137.451723126.658128
34싱글하우스 고시텔 주안점인천광역시 미추홀구 석바위로 53032-865-001037.460979126.679114
45스카이블루원룸텔인천광역시 미추홀구 경원대로 740-1 3층0507-1412-571337.448748126.690209
56노블레지던스 주안점인천광역시 미추홀구 미추홀대로734번길 23 주안빌딩032-422-090237.463117126.681774
67나 홀로 집에 1986 원룸텔인천광역시 미추홀구 학익소로 67<NA>37.44171126.670765
78렉스리빙텔인천광역시 미추홀구 주안동로 21<NA>37.460214126.6857
89경향고시텔인천광역시 미추홀구 주안로 108 1502호 경향고시텔0507-1326-792637.463661126.681733
910사이버고시원인천광역시 미추홀구 석정로150번길 43 새한빌딩032-885-500837.467547126.654352
연번상호명도로명주소전화번호위도경도
3334디에스빌인천광역시 미추홀구 주안중로 42-2 디에스빌<NA>37.462215126.682913
3435주영웰빙텔인천광역시 미추홀구 미추홀대로719번길 35032-875-999837.461714126.678133
3536설악리빙텔인천광역시 미추홀구 미추홀대로719번길 7-27<NA>37.461094126.679519
3637행복고시원인천광역시 미추홀구 주안로90번길 39 행복빌딩<NA>37.462199126.679626
3738파크빌인천광역시 미추홀구 미추홀대로719번길 13032-866-071137.461642126.679352
3839경도고시원인천광역시 미추홀구 미추홀대로734번길 26032-424-818937.462873126.681924
3940탑고시텔인천광역시 미추홀구 미추홀대로719번길 29<NA>37.461697126.678418
4041해피하우스인천광역시 미추홀구 경인남길118번길 16-10<NA>37.453161126.659941
4142명지고시원인천광역시 미추홀구 주안로90번길 39 행복빌딩4~5층<NA>37.462199126.679626
4243정문고시텔인천광역시 미추홀구 미추홀대로719번길 16032-863-123637.461864126.679217