Dataset statistics
Number of variables | 12 |
---|---|
Number of observations | 24 |
Missing cells | 7 |
Missing cells (%) | 2.4% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 2.5 KiB |
Average record size in memory | 105.3 B |
Variable types
Categorical | 7 |
---|---|
Numeric | 4 |
Boolean | 1 |
Dataset
Description | 공중이용시설(공연장) 현황 |
---|---|
Author | 행정안전부 |
URL | https://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=GCJHPFN1ABDLV192J7KJ13727403&infSeq=1 |
다중이용업소여부 has constant value "False" | Constant |
위생업종명 has constant value "공중이용시설" | Constant |
위생업태명 has constant value "공연장" | Constant |
다중이용업소여부 is highly correlated with 소재지도로명주소 and 6 other fields | High correlation |
소재지도로명주소 is highly correlated with 다중이용업소여부 and 6 other fields | High correlation |
시군명 is highly correlated with 다중이용업소여부 and 5 other fields | High correlation |
위생업태명 is highly correlated with 다중이용업소여부 and 6 other fields | High correlation |
영업상태명 is highly correlated with 다중이용업소여부 and 5 other fields | High correlation |
사업장명 is highly correlated with 다중이용업소여부 and 6 other fields | High correlation |
소재지지번주소 is highly correlated with 다중이용업소여부 and 6 other fields | High correlation |
위생업종명 is highly correlated with 다중이용업소여부 and 6 other fields | High correlation |
다중이용업소여부 has 1 (4.2%) missing values | Missing |
위생업종명 has 1 (4.2%) missing values | Missing |
위생업태명 has 1 (4.2%) missing values | Missing |
소재지도로명주소 has 2 (8.3%) missing values | Missing |
WGS84위도 has 1 (4.2%) missing values | Missing |
WGS84경도 has 1 (4.2%) missing values | Missing |
사업장명 has unique values | Unique |
소재지지번주소 has unique values | Unique |
Reproduction
Analysis started | 2023-03-18 03:27:15.519796 |
---|---|
Analysis finished | 2023-03-18 03:27:17.399954 |
Duration | 1.88 second |
Software version | pandas-profiling v3.2.0 |
Download configuration | config.json |
Distinct | 13 |
---|---|
Distinct (%) | 54.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 320.0 B |
의정부시 | |
---|---|
안산시 | |
성남시 | |
수원시 | |
과천시 | |
Other values (8) |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.333333333 |
Min length | 3 |
Unique
Unique | 9 ? |
---|---|
Unique (%) | 37.5% |
Sample
1st row | 과천시 |
---|---|
2nd row | 광주시 |
3rd row | 구리시 |
4th row | 군포시 |
5th row | 부천시 |
Common Values
Value | Count | Frequency (%) |
의정부시 | 8 | |
안산시 | 3 | 12.5% |
성남시 | 2 | 8.3% |
수원시 | 2 | 8.3% |
과천시 | 1 | 4.2% |
광주시 | 1 | 4.2% |
구리시 | 1 | 4.2% |
군포시 | 1 | 4.2% |
부천시 | 1 | 4.2% |
안양시 | 1 | 4.2% |
Other values (3) | 3 | 12.5% |
Length
Histogram of lengths of the category
Value | Count | Frequency (%) |
의정부시 | 8 | |
안산시 | 3 | 12.5% |
성남시 | 2 | 8.3% |
수원시 | 2 | 8.3% |
과천시 | 1 | 4.2% |
광주시 | 1 | 4.2% |
구리시 | 1 | 4.2% |
군포시 | 1 | 4.2% |
부천시 | 1 | 4.2% |
안양시 | 1 | 4.2% |
Other values (3) | 3 | 12.5% |
Distinct | 24 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 320.0 B |
과천시민회관 | 1 |
---|---|
광주시 문화스포츠센터 | 1 |
롯데씨네(구리시네마) | 1 |
군포문화예술회관 | 1 |
부천시민회관 | 1 |
Other values (19) |
Length
Max length | 11 |
---|---|
Median length | 9 |
Mean length | 7.25 |
Min length | 4 |
Unique
Unique | 24 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 과천시민회관 |
---|---|
2nd row | 광주시 문화스포츠센터 |
3rd row | 롯데씨네(구리시네마) |
4th row | 군포문화예술회관 |
5th row | 부천시민회관 |
Common Values
Value | Count | Frequency (%) |
과천시민회관 | 1 | 4.2% |
광주시 문화스포츠센터 | 1 | 4.2% |
롯데씨네(구리시네마) | 1 | 4.2% |
군포문화예술회관 | 1 | 4.2% |
부천시민회관 | 1 | 4.2% |
성남시민회관 | 1 | 4.2% |
성남아트센터 대극장 | 1 | 4.2% |
영통키넥스5 | 1 | 4.2% |
경기도문화의전당 | 1 | 4.2% |
안산문화예술의전당 | 1 | 4.2% |
Other values (14) | 14 |
Length
Histogram of lengths of the category
Value | Count | Frequency (%) |
과천시민회관 | 1 | 3.8% |
광주시 | 1 | 3.8% |
숭문상가 | 1 | 3.8% |
성혼예식장 | 1 | 3.8% |
삼천리탄업(주 | 1 | 3.8% |
의장부기독교청년회 | 1 | 3.8% |
행복예식장 | 1 | 3.8% |
허니문예식장 | 1 | 3.8% |
동원웨딩홀 | 1 | 3.8% |
경기도북부여성회관3 | 1 | 3.8% |
Other values (16) | 16 |
인허가일자
Real number (ℝ≥0)
Distinct | 21 |
---|---|
Distinct (%) | 87.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20011757.58 |
Minimum | 19910408 |
---|---|
Maximum | 20130911 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 344.0 B |
Quantile statistics
Minimum | 19910408 |
---|---|
5-th percentile | 19950331 |
Q1 | 19957707 |
median | 19975460 |
Q3 | 20070335.5 |
95-th percentile | 20109488.8 |
Maximum | 20130911 |
Range | 220503 |
Interquartile range (IQR) | 112628.5 |
Descriptive statistics
Standard deviation | 66295.94672 |
---|---|
Coefficient of variation (CV) | 0.003312849781 |
Kurtosis | -1.507490205 |
Mean | 20011757.58 |
Median Absolute Deviation (MAD) | 45090.5 |
Skewness | 0.2948417378 |
Sum | 480282182 |
Variance | 4395152551 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=21)
Value | Count | Frequency (%) |
19950331 | 3 | 12.5% |
19960703 | 2 | 8.3% |
19910408 | 1 | 4.2% |
20100302 | 1 | 4.2% |
20070216 | 1 | 4.2% |
19980219 | 1 | 4.2% |
19960415 | 1 | 4.2% |
20050920 | 1 | 4.2% |
20071029 | 1 | 4.2% |
20070306 | 1 | 4.2% |
Other values (11) | 11 |
Value | Count | Frequency (%) |
19910408 | 1 | 4.2% |
19950331 | 3 | |
19950408 | 1 | 4.2% |
19950504 | 1 | 4.2% |
19960108 | 1 | 4.2% |
19960415 | 1 | 4.2% |
19960703 | 2 | |
19960704 | 1 | 4.2% |
19970701 | 1 | 4.2% |
19980219 | 1 | 4.2% |
Value | Count | Frequency (%) |
20130911 | 1 | |
20111110 | 1 | |
20100302 | 1 | |
20080102 | 1 | |
20071029 | 1 | |
20070406 | 1 | |
20070312 | 1 | |
20070306 | 1 | |
20070216 | 1 | |
20050920 | 1 |
Distinct | 2 |
---|---|
Distinct (%) | 8.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 320.0 B |
운영중 | |
---|---|
폐업 등 | 1 |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.041666667 |
Min length | 3 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 4.2% |
Sample
1st row | 운영중 |
---|---|
2nd row | 운영중 |
3rd row | 운영중 |
4th row | 운영중 |
5th row | 운영중 |
Common Values
Value | Count | Frequency (%) |
운영중 | 23 | |
폐업 등 | 1 | 4.2% |
Length
Histogram of lengths of the category
Category Frequency Plot
Value | Count | Frequency (%) |
운영중 | 23 | |
폐업 | 1 | 4.0% |
등 | 1 | 4.0% |
Distinct | 1 |
---|---|
Distinct (%) | 4.3% |
Missing | 1 |
Missing (%) | 4.2% |
Memory size | 176.0 B |
False | |
---|---|
(Missing) | 1 |
Value | Count | Frequency (%) |
False | 23 | |
(Missing) | 1 | 4.2% |
Distinct | 1 |
---|---|
Distinct (%) | 4.3% |
Missing | 1 |
Missing (%) | 4.2% |
Memory size | 320.0 B |
공중이용시설 |
---|
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 공중이용시설 |
---|---|
2nd row | 공중이용시설 |
3rd row | 공중이용시설 |
4th row | 공중이용시설 |
5th row | 공중이용시설 |
Common Values
Value | Count | Frequency (%) |
공중이용시설 | 23 | |
(Missing) | 1 | 4.2% |
Length
Histogram of lengths of the category
Category Frequency Plot
Value | Count | Frequency (%) |
공중이용시설 | 23 |
Distinct | 1 |
---|---|
Distinct (%) | 4.3% |
Missing | 1 |
Missing (%) | 4.2% |
Memory size | 320.0 B |
공연장 |
---|
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 공연장 |
---|---|
2nd row | 공연장 |
3rd row | 공연장 |
4th row | 공연장 |
5th row | 공연장 |
Common Values
Value | Count | Frequency (%) |
공연장 | 23 | |
(Missing) | 1 | 4.2% |
Length
Histogram of lengths of the category
Category Frequency Plot
Value | Count | Frequency (%) |
공연장 | 23 |
Distinct | 22 |
---|---|
Distinct (%) | 100.0% |
Missing | 2 |
Missing (%) | 8.3% |
Memory size | 320.0 B |
경기도 안산시 상록구 광덕1로 385 | 1 |
---|---|
경기도 구리시 경춘로 243 (인창동) | 1 |
경기도 군포시 고산로 599 (산본동, 군포문화예술회관) | 1 |
경기도 부천시 부일로 365 (중동) | 1 |
경기도 성남시 수정구 수정로153번길 3 (태평동) | 1 |
Other values (17) |
Length
Max length | 32 |
---|---|
Median length | 28.5 |
Mean length | 24.45454545 |
Min length | 19 |
Unique
Unique | 22 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 경기도 과천시 통영로 5 (중앙동) |
---|---|
2nd row | 경기도 광주시 회안대로 891 (송정동) |
3rd row | 경기도 구리시 경춘로 243 (인창동) |
4th row | 경기도 군포시 고산로 599 (산본동, 군포문화예술회관) |
5th row | 경기도 부천시 부일로 365 (중동) |
Common Values
Value | Count | Frequency (%) |
경기도 안산시 상록구 광덕1로 385 | 1 | 4.2% |
경기도 구리시 경춘로 243 (인창동) | 1 | 4.2% |
경기도 군포시 고산로 599 (산본동, 군포문화예술회관) | 1 | 4.2% |
경기도 부천시 부일로 365 (중동) | 1 | 4.2% |
경기도 성남시 수정구 수정로153번길 3 (태평동) | 1 | 4.2% |
경기도 성남시 분당구 성남대로 808 (야탑동) | 1 | 4.2% |
경기도 수원시 영통구 신원로 231 (매탄동) | 1 | 4.2% |
경기도 안산시 단원구 화랑로 312 (고잔동, 817) | 1 | 4.2% |
경기도 안산시 상록구 석호로 226 (사동) | 1 | 4.2% |
경기도 광주시 회안대로 891 (송정동) | 1 | 4.2% |
Other values (12) | 12 | |
(Missing) | 2 | 8.3% |
Length
Histogram of lengths of the category
Value | Count | Frequency (%) |
경기도 | 22 | 18.5% |
의정부시 | 8 | 6.7% |
의정부동 | 4 | 3.4% |
안산시 | 3 | 2.5% |
16 | 2 | 1.7% |
가능동 | 2 | 1.7% |
성남시 | 2 | 1.7% |
태평로 | 2 | 1.7% |
상록구 | 2 | 1.7% |
죽전동 | 1 | 0.8% |
Other values (71) | 71 |
Distinct | 24 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 320.0 B |
경기도 과천시 중앙동 6-2번지 | 1 |
---|---|
경기도 광주시 송정동 340-1번지 52필지 | 1 |
경기도 구리시 인창동 676-2번지 | 1 |
경기도 군포시 산본동 1101번지 군포문화예술회관 | 1 |
경기도 부천시 중동 788번지 | 1 |
Other values (19) |
Length
Max length | 29 |
---|---|
Median length | 24 |
Mean length | 21.33333333 |
Min length | 16 |
Unique
Unique | 24 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 경기도 과천시 중앙동 6-2번지 |
---|---|
2nd row | 경기도 광주시 송정동 340-1번지 52필지 |
3rd row | 경기도 구리시 인창동 676-2번지 |
4th row | 경기도 군포시 산본동 1101번지 군포문화예술회관 |
5th row | 경기도 부천시 중동 788번지 |
Common Values
Value | Count | Frequency (%) |
경기도 과천시 중앙동 6-2번지 | 1 | 4.2% |
경기도 광주시 송정동 340-1번지 52필지 | 1 | 4.2% |
경기도 구리시 인창동 676-2번지 | 1 | 4.2% |
경기도 군포시 산본동 1101번지 군포문화예술회관 | 1 | 4.2% |
경기도 부천시 중동 788번지 | 1 | 4.2% |
경기도 성남시 수정구 태평동 3493-1번지 | 1 | 4.2% |
경기도 성남시 분당구 야탑동 757번지 | 1 | 4.2% |
경기도 수원시 영통구 매탄동 491-10번지 | 1 | 4.2% |
경기도 수원시 팔달구 인계동 1117번지 | 1 | 4.2% |
경기도 안산시 단원구 고잔동 817번지 | 1 | 4.2% |
Other values (14) | 14 |
Length
Histogram of lengths of the category
Value | Count | Frequency (%) |
경기도 | 24 | |
의정부시 | 8 | 7.3% |
의정부동 | 6 | 5.5% |
안산시 | 3 | 2.8% |
수원시 | 2 | 1.8% |
성남시 | 2 | 1.8% |
상록구 | 2 | 1.8% |
가능동 | 2 | 1.8% |
안양동 | 1 | 0.9% |
550번지 | 1 | 0.9% |
Other values (58) | 58 |
소재지우편번호
Real number (ℝ≥0)
Distinct | 22 |
---|---|
Distinct (%) | 91.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 438652.0833 |
Minimum | 14613 |
---|---|
Maximum | 480849 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 344.0 B |
Quantile statistics
Minimum | 14613 |
---|---|
5-th percentile | 426043.7 |
Q1 | 434556.75 |
median | 460817.5 |
Q3 | 480209.5 |
95-th percentile | 480848.85 |
Maximum | 480849 |
Range | 466236 |
Interquartile range (IQR) | 45652.75 |
Descriptive statistics
Standard deviation | 92788.95435 |
---|---|
Coefficient of variation (CV) | 0.2115320042 |
Kurtosis | 21.21328238 |
Mean | 438652.0833 |
Median Absolute Deviation (MAD) | 20010 |
Skewness | -4.486669215 |
Sum | 10527650 |
Variance | 8609790050 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=22)
Value | Count | Frequency (%) |
480010 | 2 | 8.3% |
480849 | 2 | 8.3% |
426824 | 1 | 4.2% |
459813 | 1 | 4.2% |
471010 | 1 | 4.2% |
435802 | 1 | 4.2% |
14613 | 1 | 4.2% |
461822 | 1 | 4.2% |
463839 | 1 | 4.2% |
443803 | 1 | 4.2% |
Other values (12) | 12 |
Value | Count | Frequency (%) |
14613 | 1 | |
425906 | 1 | |
426824 | 1 | |
426863 | 1 | |
427805 | 1 | |
430821 | 1 | |
435802 | 1 | |
437802 | 1 | |
442835 | 1 | |
443803 | 1 |
Value | Count | Frequency (%) |
480849 | 2 | |
480848 | 1 | |
480842 | 1 | |
480813 | 1 | |
480808 | 1 | |
480010 | 2 | |
471010 | 1 | |
464903 | 1 | |
463839 | 1 | |
461822 | 1 |
Distinct | 23 |
---|---|
Distinct (%) | 100.0% |
Missing | 1 |
Missing (%) | 4.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 37.49138836 |
Minimum | 37.06722319 |
---|---|
Maximum | 37.75288059 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 344.0 B |
Quantile statistics
Minimum | 37.06722319 |
---|---|
5-th percentile | 37.25703997 |
Q1 | 37.32216528 |
median | 37.42820745 |
Q3 | 37.73647778 |
95-th percentile | 37.7493418 |
Maximum | 37.75288059 |
Range | 0.6856574034 |
Interquartile range (IQR) | 0.4143125 |
Descriptive statistics
Standard deviation | 0.2101527799 |
---|---|
Coefficient of variation (CV) | 0.005605361367 |
Kurtosis | -1.190226342 |
Mean | 37.49138836 |
Median Absolute Deviation (MAD) | 0.1635382892 |
Skewness | -0.01876040937 |
Sum | 862.3019322 |
Variance | 0.0441641909 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=23)
Value | Count | Frequency (%) |
37.42820745 | 1 | 4.2% |
37.38460579 | 1 | 4.2% |
37.06722319 | 1 | 4.2% |
37.60179457 | 1 | 4.2% |
37.36578447 | 1 | 4.2% |
37.48863293 | 1 | 4.2% |
37.44278795 | 1 | 4.2% |
37.40295481 | 1 | 4.2% |
37.25619228 | 1 | 4.2% |
37.26466916 | 1 | 4.2% |
Other values (13) | 13 |
Value | Count | Frequency (%) |
37.06722319 | 1 | |
37.25619228 | 1 | |
37.26466916 | 1 | |
37.29335732 | 1 | |
37.30904906 | 1 | |
37.31846036 | 1 | |
37.32587019 | 1 | |
37.36578447 | 1 | |
37.38460579 | 1 | |
37.40295481 | 1 |
Value | Count | Frequency (%) |
37.75288059 | 1 | |
37.75046674 | 1 | |
37.7392173 | 1 | |
37.73844489 | 1 | |
37.73702655 | 1 | |
37.73669097 | 1 | |
37.73626458 | 1 | |
37.73443929 | 1 | |
37.60179457 | 1 | |
37.48863293 | 1 |
Distinct | 23 |
---|---|
Distinct (%) | 100.0% |
Missing | 1 |
Missing (%) | 4.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 127.0184678 |
Minimum | 126.770656 |
---|---|
Maximum | 127.2547251 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 344.0 B |
Quantile statistics
Minimum | 126.770656 |
---|---|
5-th percentile | 126.8258818 |
Q1 | 126.9602212 |
median | 127.0417033 |
Q3 | 127.0625978 |
95-th percentile | 127.1415412 |
Maximum | 127.2547251 |
Range | 0.4840691034 |
Interquartile range (IQR) | 0.1023765342 |
Descriptive statistics
Standard deviation | 0.1139008465 |
---|---|
Coefficient of variation (CV) | 0.0008967266605 |
Kurtosis | 0.3090387353 |
Mean | 127.0184678 |
Median Absolute Deviation (MAD) | 0.0525937324 |
Skewness | -0.4704493445 |
Sum | 2921.424759 |
Variance | 0.01297340282 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=23)
Value | Count | Frequency (%) |
126.9891096 | 1 | 4.2% |
126.9313329 | 1 | 4.2% |
127.0654116 | 1 | 4.2% |
127.1420645 | 1 | 4.2% |
126.9274659 | 1 | 4.2% |
126.770656 | 1 | 4.2% |
127.1368318 | 1 | 4.2% |
127.1305499 | 1 | 4.2% |
127.0597839 | 1 | 4.2% |
127.0394453 | 1 | 4.2% |
Other values (13) | 13 |
Value | Count | Frequency (%) |
126.770656 | 1 | |
126.8230003 | 1 | |
126.8518153 | 1 | |
126.8553572 | 1 | |
126.9274659 | 1 | |
126.9313329 | 1 | |
126.9891096 | 1 | |
127.0325876 | 1 | |
127.0352931 | 1 | |
127.0394453 | 1 |
Value | Count | Frequency (%) |
127.2547251 | 1 | |
127.1420645 | 1 | |
127.1368318 | 1 | |
127.1305499 | 1 | |
127.1059038 | 1 | |
127.0654116 | 1 | |
127.0597839 | 1 | |
127.0520522 | 1 | |
127.0513325 | 1 | |
127.0452217 | 1 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.