Overview

Dataset statistics

Number of variables5
Number of observations470
Missing cells212
Missing cells (%)9.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory18.9 KiB
Average record size in memory41.3 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description연수구 내 체육시설업 현황의 데이터에서 업종, 상호, 시설주소, 시설 전화번호의 항목- 업종, 상호, 시설주소, 시설 전화번호로 구분
Author인천광역시 연수구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15065468&srcSe=7661IVAWM27C61E190

Alerts

연번 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 연번High correlation
시설전화번호 has 212 (45.1%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2024-01-28 15:08:46.448917
Analysis finished2024-01-28 15:08:47.050669
Duration0.6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct470
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean235.5
Minimum1
Maximum470
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2024-01-29T00:08:47.110458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile24.45
Q1118.25
median235.5
Q3352.75
95-th percentile446.55
Maximum470
Range469
Interquartile range (IQR)234.5

Descriptive statistics

Standard deviation135.82157
Coefficient of variation (CV)0.57673705
Kurtosis-1.2
Mean235.5
Median Absolute Deviation (MAD)117.5
Skewness0
Sum110685
Variance18447.5
MonotonicityStrictly increasing
2024-01-29T00:08:47.220490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
311 1
 
0.2%
323 1
 
0.2%
322 1
 
0.2%
321 1
 
0.2%
320 1
 
0.2%
319 1
 
0.2%
318 1
 
0.2%
317 1
 
0.2%
316 1
 
0.2%
Other values (460) 460
97.9%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
470 1
0.2%
469 1
0.2%
468 1
0.2%
467 1
0.2%
466 1
0.2%
465 1
0.2%
464 1
0.2%
463 1
0.2%
462 1
0.2%
461 1
0.2%

업종
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
체력단련장업
130 
체육도장업
118 
당구장업
83 
골프연습장업
52 
가상체험 체육시설업
35 
Other values (7)
52 

Length

Max length10
Median length7
Mean length5.5702128
Min length4

Unique

Unique3 ?
Unique (%)0.6%

Sample

1st row수영장업
2nd row수영장업
3rd row수영장업
4th row수영장업
5th row수영장업

Common Values

ValueCountFrequency (%)
체력단련장업 130
27.7%
체육도장업 118
25.1%
당구장업 83
17.7%
골프연습장업 52
 
11.1%
가상체험 체육시설업 35
 
7.4%
체육교습업 30
 
6.4%
수영장업 13
 
2.8%
종합체육시설업 4
 
0.9%
인공암벽장업 2
 
0.4%
썰매장업 1
 
0.2%
Other values (2) 2
 
0.4%

Length

2024-01-29T00:08:47.326279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
체력단련장업 130
25.7%
체육도장업 118
23.4%
당구장업 83
16.4%
골프연습장업 52
 
10.3%
가상체험 35
 
6.9%
체육시설업 35
 
6.9%
체육교습업 30
 
5.9%
수영장업 13
 
2.6%
종합체육시설업 4
 
0.8%
인공암벽장업 2
 
0.4%
Other values (3) 3
 
0.6%

상호
Text

Distinct453
Distinct (%)96.4%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2024-01-29T00:08:47.540370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length18
Mean length8.6468085
Min length2

Characters and Unicode

Total characters4064
Distinct characters383
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique438 ?
Unique (%)93.2%

Sample

1st row미라클
2nd row블루라군
3rd row미라클 스포츠(주)
4th row주식회사 블루라군수영장
5th rowIGC글로벌캠퍼스 수영장
ValueCountFrequency (%)
태권도장 25
 
2.9%
송도 20
 
2.3%
당구클럽 20
 
2.3%
아카데미 15
 
1.7%
경희대 15
 
1.7%
골프 12
 
1.4%
휘트니스 10
 
1.1%
피트니스 10
 
1.1%
송도점 10
 
1.1%
당구장 9
 
1.0%
Other values (553) 728
83.3%
2024-01-29T00:08:47.869915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
404
 
9.9%
171
 
4.2%
148
 
3.6%
93
 
2.3%
80
 
2.0%
78
 
1.9%
72
 
1.8%
71
 
1.7%
69
 
1.7%
66
 
1.6%
Other values (373) 2812
69.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3184
78.3%
Space Separator 404
 
9.9%
Uppercase Letter 323
 
7.9%
Lowercase Letter 61
 
1.5%
Decimal Number 31
 
0.8%
Close Punctuation 23
 
0.6%
Open Punctuation 22
 
0.5%
Other Punctuation 12
 
0.3%
Dash Punctuation 3
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
171
 
5.4%
148
 
4.6%
93
 
2.9%
80
 
2.5%
78
 
2.4%
72
 
2.3%
71
 
2.2%
69
 
2.2%
66
 
2.1%
64
 
2.0%
Other values (310) 2272
71.4%
Uppercase Letter
ValueCountFrequency (%)
S 40
 
12.4%
C 23
 
7.1%
T 23
 
7.1%
G 22
 
6.8%
P 19
 
5.9%
B 18
 
5.6%
Y 18
 
5.6%
E 17
 
5.3%
M 17
 
5.3%
R 15
 
4.6%
Other values (15) 111
34.4%
Lowercase Letter
ValueCountFrequency (%)
i 7
11.5%
s 7
11.5%
t 5
 
8.2%
n 4
 
6.6%
o 4
 
6.6%
y 4
 
6.6%
r 4
 
6.6%
a 4
 
6.6%
h 3
 
4.9%
l 2
 
3.3%
Other values (10) 17
27.9%
Decimal Number
ValueCountFrequency (%)
1 8
25.8%
2 8
25.8%
5 4
12.9%
3 3
 
9.7%
9 2
 
6.5%
6 2
 
6.5%
0 1
 
3.2%
7 1
 
3.2%
8 1
 
3.2%
4 1
 
3.2%
Other Punctuation
ValueCountFrequency (%)
. 6
50.0%
& 5
41.7%
! 1
 
8.3%
Space Separator
ValueCountFrequency (%)
404
100.0%
Close Punctuation
ValueCountFrequency (%)
) 23
100.0%
Open Punctuation
ValueCountFrequency (%)
( 22
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3184
78.3%
Common 496
 
12.2%
Latin 384
 
9.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
171
 
5.4%
148
 
4.6%
93
 
2.9%
80
 
2.5%
78
 
2.4%
72
 
2.3%
71
 
2.2%
69
 
2.2%
66
 
2.1%
64
 
2.0%
Other values (310) 2272
71.4%
Latin
ValueCountFrequency (%)
S 40
 
10.4%
C 23
 
6.0%
T 23
 
6.0%
G 22
 
5.7%
P 19
 
4.9%
B 18
 
4.7%
Y 18
 
4.7%
E 17
 
4.4%
M 17
 
4.4%
R 15
 
3.9%
Other values (35) 172
44.8%
Common
ValueCountFrequency (%)
404
81.5%
) 23
 
4.6%
( 22
 
4.4%
1 8
 
1.6%
2 8
 
1.6%
. 6
 
1.2%
& 5
 
1.0%
5 4
 
0.8%
- 3
 
0.6%
3 3
 
0.6%
Other values (8) 10
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3184
78.3%
ASCII 880
 
21.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
404
45.9%
S 40
 
4.5%
C 23
 
2.6%
T 23
 
2.6%
) 23
 
2.6%
( 22
 
2.5%
G 22
 
2.5%
P 19
 
2.2%
B 18
 
2.0%
Y 18
 
2.0%
Other values (53) 268
30.5%
Hangul
ValueCountFrequency (%)
171
 
5.4%
148
 
4.6%
93
 
2.9%
80
 
2.5%
78
 
2.4%
72
 
2.3%
71
 
2.2%
69
 
2.2%
66
 
2.1%
64
 
2.0%
Other values (310) 2272
71.4%
Distinct466
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2024-01-29T00:08:48.068754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length70
Median length54.5
Mean length39.014894
Min length22

Characters and Unicode

Total characters18337
Distinct characters300
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique462 ?
Unique (%)98.3%

Sample

1st row인천광역시 연수구 동곡재로 16 (동춘동)
2nd row인천광역시 연수구 새말로 111, 지1층 (연수동, 영남아파트)
3rd row인천광역시 연수구 선학로 101, 상가동 지하1,2층 (선학동, 뉴서울1차아파트)
4th row인천광역시 연수구 갯벌로 12 (송도동, 미추홀타워 본관)
5th row인천광역시 연수구 송도문화로 119 (송도동, 인천글로벌캠퍼스)
ValueCountFrequency (%)
인천광역시 470
 
14.0%
연수구 470
 
14.0%
송도동 204
 
6.1%
동춘동 57
 
1.7%
2층 54
 
1.6%
3층 53
 
1.6%
옥련동 52
 
1.5%
연수동 49
 
1.5%
송도 32
 
1.0%
청학동 32
 
1.0%
Other values (801) 1889
56.2%
2024-01-29T00:08:48.380955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2899
 
15.8%
1 746
 
4.1%
647
 
3.5%
, 629
 
3.4%
556
 
3.0%
553
 
3.0%
511
 
2.8%
509
 
2.8%
2 496
 
2.7%
490
 
2.7%
Other values (290) 10301
56.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10348
56.4%
Decimal Number 3230
 
17.6%
Space Separator 2899
 
15.8%
Other Punctuation 631
 
3.4%
Close Punctuation 477
 
2.6%
Open Punctuation 477
 
2.6%
Uppercase Letter 147
 
0.8%
Math Symbol 58
 
0.3%
Dash Punctuation 52
 
0.3%
Lowercase Letter 16
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
647
 
6.3%
556
 
5.4%
553
 
5.3%
511
 
4.9%
509
 
4.9%
490
 
4.7%
479
 
4.6%
473
 
4.6%
473
 
4.6%
471
 
4.6%
Other values (241) 5186
50.1%
Uppercase Letter
ValueCountFrequency (%)
B 40
27.2%
D 14
 
9.5%
E 10
 
6.8%
A 9
 
6.1%
C 8
 
5.4%
I 8
 
5.4%
H 6
 
4.1%
R 6
 
4.1%
W 6
 
4.1%
N 6
 
4.1%
Other values (12) 34
23.1%
Decimal Number
ValueCountFrequency (%)
1 746
23.1%
2 496
15.4%
0 470
14.6%
3 358
11.1%
4 255
 
7.9%
5 244
 
7.6%
6 238
 
7.4%
8 173
 
5.4%
7 128
 
4.0%
9 122
 
3.8%
Lowercase Letter
ValueCountFrequency (%)
s 3
18.8%
a 3
18.8%
b 2
12.5%
i 2
12.5%
t 2
12.5%
e 2
12.5%
c 1
 
6.2%
m 1
 
6.2%
Other Punctuation
ValueCountFrequency (%)
, 629
99.7%
. 2
 
0.3%
Letter Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
2899
100.0%
Close Punctuation
ValueCountFrequency (%)
) 477
100.0%
Open Punctuation
ValueCountFrequency (%)
( 477
100.0%
Math Symbol
ValueCountFrequency (%)
~ 58
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 52
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10348
56.4%
Common 7824
42.7%
Latin 165
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
647
 
6.3%
556
 
5.4%
553
 
5.3%
511
 
4.9%
509
 
4.9%
490
 
4.7%
479
 
4.6%
473
 
4.6%
473
 
4.6%
471
 
4.6%
Other values (241) 5186
50.1%
Latin
ValueCountFrequency (%)
B 40
24.2%
D 14
 
8.5%
E 10
 
6.1%
A 9
 
5.5%
C 8
 
4.8%
I 8
 
4.8%
H 6
 
3.6%
R 6
 
3.6%
W 6
 
3.6%
N 6
 
3.6%
Other values (22) 52
31.5%
Common
ValueCountFrequency (%)
2899
37.1%
1 746
 
9.5%
, 629
 
8.0%
2 496
 
6.3%
) 477
 
6.1%
( 477
 
6.1%
0 470
 
6.0%
3 358
 
4.6%
4 255
 
3.3%
5 244
 
3.1%
Other values (7) 773
 
9.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10348
56.4%
ASCII 7987
43.6%
Number Forms 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2899
36.3%
1 746
 
9.3%
, 629
 
7.9%
2 496
 
6.2%
) 477
 
6.0%
( 477
 
6.0%
0 470
 
5.9%
3 358
 
4.5%
4 255
 
3.2%
5 244
 
3.1%
Other values (37) 936
 
11.7%
Hangul
ValueCountFrequency (%)
647
 
6.3%
556
 
5.4%
553
 
5.3%
511
 
4.9%
509
 
4.9%
490
 
4.7%
479
 
4.6%
473
 
4.6%
473
 
4.6%
471
 
4.6%
Other values (241) 5186
50.1%
Number Forms
ValueCountFrequency (%)
1
50.0%
1
50.0%

시설전화번호
Text

MISSING 

Distinct247
Distinct (%)95.7%
Missing212
Missing (%)45.1%
Memory size3.8 KiB
2024-01-29T00:08:48.587738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length10.062016
Min length8

Characters and Unicode

Total characters2596
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique239 ?
Unique (%)92.6%

Sample

1st row814-3322
2nd row1644-7926
3rd row834-2244
4th row819-0022
5th row715-7795
ValueCountFrequency (%)
032-832-4708 4
 
1.6%
032-811-6604 3
 
1.2%
811-1144 2
 
0.8%
835-7000 2
 
0.8%
851-4500 2
 
0.8%
032-812-1872 2
 
0.8%
835-4121 2
 
0.8%
032-811-8338 2
 
0.8%
032-831-7363 1
 
0.4%
032-831-5258 1
 
0.4%
Other values (237) 237
91.9%
2024-01-29T00:08:48.894091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 389
15.0%
3 360
13.9%
8 354
13.6%
0 314
12.1%
2 293
11.3%
1 250
9.6%
5 155
 
6.0%
7 146
 
5.6%
4 125
 
4.8%
9 118
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2207
85.0%
Dash Punctuation 389
 
15.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 360
16.3%
8 354
16.0%
0 314
14.2%
2 293
13.3%
1 250
11.3%
5 155
7.0%
7 146
6.6%
4 125
 
5.7%
9 118
 
5.3%
6 92
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 389
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2596
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 389
15.0%
3 360
13.9%
8 354
13.6%
0 314
12.1%
2 293
11.3%
1 250
9.6%
5 155
 
6.0%
7 146
 
5.6%
4 125
 
4.8%
9 118
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2596
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 389
15.0%
3 360
13.9%
8 354
13.6%
0 314
12.1%
2 293
11.3%
1 250
9.6%
5 155
 
6.0%
7 146
 
5.6%
4 125
 
4.8%
9 118
 
4.5%

Interactions

2024-01-29T00:08:46.855388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-29T00:08:48.968798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.880
업종0.8801.000
2024-01-29T00:08:49.029171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.631
업종0.6311.000

Missing values

2024-01-29T00:08:46.938734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-29T00:08:47.014167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종상호시설주소(도로명)시설전화번호
01수영장업미라클인천광역시 연수구 동곡재로 16 (동춘동)814-3322
12수영장업블루라군인천광역시 연수구 새말로 111, 지1층 (연수동, 영남아파트)1644-7926
23수영장업미라클 스포츠(주)인천광역시 연수구 선학로 101, 상가동 지하1,2층 (선학동, 뉴서울1차아파트)834-2244
34수영장업주식회사 블루라군수영장인천광역시 연수구 갯벌로 12 (송도동, 미추홀타워 본관)819-0022
45수영장업IGC글로벌캠퍼스 수영장인천광역시 연수구 송도문화로 119 (송도동, 인천글로벌캠퍼스)715-7795
56수영장업주식회사 블루라군인천광역시 연수구 센트럴로 123, 송도컨벤시아 13~17호 (송도동)710-3311
67수영장업키즈웨일 아카데미(주)인천광역시 연수구 센트럴로 232, 301,302,303호 (송도동, 더샵센트럴파크1)032-710-3666
78수영장업(주)박태환수영장인천광역시 연수구 하모니로138번길 11, 송도캐슬센트럴파크 지하1층 (송도동)032-710-2015
89수영장업웨스턴파크 스파앤풀인천광역시 연수구 아트센터대로168번길 100, 한라 웨스턴파크 송도 4층 (송도동)<NA>
910수영장업하비엘럭스인천광역시 연수구 해돋이로 160-19, 7층 701~706호 (송도동)816-6404
연번업종상호시설주소(도로명)시설전화번호
460461체육교습업드림베이스볼인천광역시 연수구 해돋이로 160-4, 8층 801~802호 (송도동)<NA>
461462체육교습업센트럴 파워점핑 줄넘기 전문클럽인천광역시 연수구 컨벤시아대로230번길 54, A동 3층 331호 (송도동)<NA>
462463체육교습업에스제이(SJ) 리듬체조 센트럴점인천광역시 연수구 해돋이로 160-19, 5층 505호 (송도동)<NA>
463464체육교습업에스제이(SJ) 리듬체조인천광역시 연수구 해돋이로 107, H동 5층 96호 (송도동, 송도 더샵 퍼스트월드)<NA>
464465체육교습업포레스트힐 캠퍼스인천광역시 연수구 아암대로 825-29, 지하1층 (동춘동)<NA>
465466체육교습업APT스포츠인천광역시 연수구 송도문화로28번길 81 (송도동, 송도더샵그린스퀘어)<NA>
466467체육교습업(주)에스디 일레븐인천광역시 연수구 인천신항대로 916, 송도LNG종합스포츠타운 (송도동)<NA>
467468체육교습업어루만짐 인천송도센터인천광역시 연수구 하모니로 158, 송도타임스페이스 B동 410호 (송도동)<NA>
468469인공암벽장업디스커버리씨에스(주) 송도지점인천광역시 연수구 송도과학로16번길 33-4, 송도 트리플스트리트 D동 211a~c호 (송도동)<NA>
469470인공암벽장업비블럭클라이밍인천광역시 연수구 아트센터대로 149, 커낼워크D1 SPRING 101동 101,201호 (송도동)<NA>