Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 7.6 KiB |
Average record size in memory | 78.3 B |
Variable types
Categorical | 6 |
---|---|
Text | 1 |
Numeric | 2 |
Dataset
Description | Sample |
---|---|
Author | 한국문화정보원 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=b496a15e-81f5-44d4-b516-7a594f4abfd3 |
sccnt_ym has constant value "" | Constant |
file_name has constant value "" | Constant |
base_ymd has constant value "" | Constant |
ctprvn_nm is highly overall correlated with adstrd_cd and 1 other fields | High correlation |
mlsfc is highly overall correlated with ctprvn_nm and 1 other fields | High correlation |
adstrd_cd is highly overall correlated with ctprvn_nm | High correlation |
sccnt is highly overall correlated with mlsfc | High correlation |
mlsfc is highly imbalanced (80.6%) | Imbalance |
sccnt is highly imbalanced (50.8%) | Imbalance |
sgnr_nm has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 10:09:33.131896 |
---|---|
Analysis finished | 2023-12-10 10:09:35.275012 |
Duration | 2.14 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
ctprvn_nm
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 5.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
경기도 | |
---|---|
경상남도 | |
강원도 | |
경상북도 | |
충청북도 | 3 |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.4 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 강원도 |
---|---|
2nd row | 충청북도 |
3rd row | 강원도 |
4th row | 강원도 |
5th row | 강원도 |
Common Values
Value | Count | Frequency (%) |
경기도 | 44 | |
경상남도 | 22 | |
강원도 | 16 | 16.0% |
경상북도 | 15 | 15.0% |
충청북도 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
경기도 | 44 | |
경상남도 | 22 | |
강원도 | 16 | 16.0% |
경상북도 | 15 | 15.0% |
충청북도 | 3 | 3.0% |
sgnr_nm
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
창원시 | 5 | 4.0% |
수원시 | 4 | 3.2% |
용인시 | 3 | 2.4% |
부천시 | 3 | 2.4% |
성남시 | 3 | 2.4% |
고양시 | 3 | 2.4% |
안양시 | 2 | 1.6% |
안산시 | 2 | 1.6% |
양산시 | 1 | 0.8% |
산청군 | 1 | 0.8% |
Other values (97) | 97 |
Most occurring characters
Value | Count | Frequency (%) |
시 | 72 | |
군 | 31 | 7.7% |
구 | 27 | 6.7% |
24 | 5.9% | |
양 | 16 | 4.0% |
천 | 16 | 4.0% |
원 | 15 | 3.7% |
주 | 13 | 3.2% |
산 | 11 | 2.7% |
안 | 10 | 2.5% |
Other values (81) | 170 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 381 | |
Space Separator | 24 | 5.9% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
시 | 72 | |
군 | 31 | 8.1% |
구 | 27 | 7.1% |
양 | 16 | 4.2% |
천 | 16 | 4.2% |
원 | 15 | 3.9% |
주 | 13 | 3.4% |
산 | 11 | 2.9% |
안 | 10 | 2.6% |
창 | 9 | 2.4% |
Other values (80) | 161 |
Space Separator
Value | Count | Frequency (%) |
24 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 381 | |
Common | 24 | 5.9% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
시 | 72 | |
군 | 31 | 8.1% |
구 | 27 | 7.1% |
양 | 16 | 4.2% |
천 | 16 | 4.2% |
원 | 15 | 3.9% |
주 | 13 | 3.4% |
산 | 11 | 2.9% |
안 | 10 | 2.6% |
창 | 9 | 2.4% |
Other values (80) | 161 |
Common
Value | Count | Frequency (%) |
24 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 381 | |
ASCII | 24 | 5.9% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
시 | 72 | |
군 | 31 | 8.1% |
구 | 27 | 7.1% |
양 | 16 | 4.2% |
천 | 16 | 4.2% |
원 | 15 | 3.9% |
주 | 13 | 3.4% |
산 | 11 | 2.9% |
안 | 10 | 2.6% |
창 | 9 | 2.4% |
Other values (80) | 161 |
ASCII
Value | Count | Frequency (%) |
24 |
adstrd_cd
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 97 |
---|---|
Distinct (%) | 97.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.3858224 × 109 |
Minimum | 1.138051 × 109 |
---|---|
Maximum | 4.889025 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1.138051 × 109 |
---|---|
5-th percentile | 4.113082 × 109 |
Q1 | 4.1405535 × 109 |
median | 4.247541 × 109 |
Q3 | 4.7832752 × 109 |
95-th percentile | 4.884075 × 109 |
Maximum | 4.889025 × 109 |
Range | 3.750974 × 109 |
Interquartile range (IQR) | 6.427217 × 108 |
Descriptive statistics
Standard deviation | 4.534638 × 108 |
---|---|
Coefficient of variation (CV) | 0.10339311 |
Kurtosis | 25.529683 |
Mean | 4.3858224 × 109 |
Median Absolute Deviation (MAD) | 1.284707 × 108 |
Skewness | -3.5325477 |
Sum | 4.3858224 × 1011 |
Variance | 2.0562942 × 1017 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4812760000 | 3 | 3.0% |
4833051000 | 2 | 2.0% |
4159025900 | 1 | 1.0% |
4872025000 | 1 | 1.0% |
4886025000 | 1 | 1.0% |
4824052000 | 1 | 1.0% |
4827025000 | 1 | 1.0% |
4884025000 | 1 | 1.0% |
4825052000 | 1 | 1.0% |
4882025000 | 1 | 1.0% |
Other values (87) | 87 |
Value | Count | Frequency (%) |
1138051000 | 1 | |
4111159700 | 1 | |
4111368000 | 1 | |
4111574000 | 1 | |
4111752000 | 1 | |
4113152000 | 1 | |
4113351000 | 1 | |
4113566500 | 1 | |
4115062000 | 1 | |
4117154000 | 1 |
Value | Count | Frequency (%) |
4889025000 | 1 | |
4888025000 | 1 | |
4887025000 | 1 | |
4886025000 | 1 | |
4885025000 | 1 | |
4884025000 | 1 | |
4882025000 | 1 | |
4874025300 | 1 | |
4873025000 | 1 | |
4872025000 | 1 |
sccnt_ym
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
202001 |
---|
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 202001 |
---|---|
2nd row | 202001 |
3rd row | 202001 |
4th row | 202001 |
5th row | 202001 |
Common Values
Value | Count | Frequency (%) |
202001 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
202001 | 100 |
mlsfc
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
공공도서관 | |
---|---|
문예회관 | 3 |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 4.97 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 공공도서관 |
---|---|
2nd row | 문예회관 |
3rd row | 공공도서관 |
4th row | 공공도서관 |
5th row | 공공도서관 |
Common Values
Value | Count | Frequency (%) |
공공도서관 | 97 | |
문예회관 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
공공도서관 | 97 | |
문예회관 | 3 | 3.0% |
fclt_cnt
Real number (ℝ)
Distinct | 14 |
---|---|
Distinct (%) | 14.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.45 |
Minimum | 1 |
---|---|
Maximum | 17 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
median | 4 |
Q3 | 6 |
95-th percentile | 10.05 |
Maximum | 17 |
Range | 16 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 3.0989571 |
---|---|
Coefficient of variation (CV) | 0.69639486 |
Kurtosis | 2.8011485 |
Mean | 4.45 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 1.4297864 |
Sum | 445 |
Variance | 9.6035354 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 18 | |
1 | 15 | |
2 | 15 | |
5 | 12 | |
4 | 10 | |
6 | 9 | |
7 | 8 | |
8 | 4 | 4.0% |
10 | 2 | 2.0% |
9 | 2 | 2.0% |
Other values (4) | 5 | 5.0% |
Value | Count | Frequency (%) |
1 | 15 | |
2 | 15 | |
3 | 18 | |
4 | 10 | |
5 | 12 | |
6 | 9 | |
7 | 8 | |
8 | 4 | 4.0% |
9 | 2 | 2.0% |
10 | 2 | 2.0% |
Value | Count | Frequency (%) |
17 | 1 | 1.0% |
15 | 1 | 1.0% |
12 | 1 | 1.0% |
11 | 2 | 2.0% |
10 | 2 | 2.0% |
9 | 2 | 2.0% |
8 | 4 | 4.0% |
7 | 8 | |
6 | 9 | |
5 | 12 |
sccnt
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 6 |
---|---|
Distinct (%) | 6.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
<NA> | |
---|---|
0 | |
896 | 1 |
582 | 1 |
575 | 1 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 2.7 |
Min length | 1 |
Unique
Unique | 4 ? |
---|---|
Unique (%) | 4.0% |
Sample
1st row | 0 |
---|---|
2nd row | 896 |
3rd row | <NA> |
4th row | 0 |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 54 | |
0 | 42 | |
896 | 1 | 1.0% |
582 | 1 | 1.0% |
575 | 1 | 1.0% |
533 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 54 | |
0 | 42 | |
896 | 1 | 1.0% |
582 | 1 | 1.0% |
575 | 1 | 1.0% |
533 | 1 | 1.0% |
file_name
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
KC_597_DGT_CLT_STATN_BIZAEA_2021 |
---|
Length
Max length | 32 |
---|---|
Median length | 32 |
Mean length | 32 |
Min length | 32 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | KC_597_DGT_CLT_STATN_BIZAEA_2021 |
---|---|
2nd row | KC_597_DGT_CLT_STATN_BIZAEA_2021 |
3rd row | KC_597_DGT_CLT_STATN_BIZAEA_2021 |
4th row | KC_597_DGT_CLT_STATN_BIZAEA_2021 |
5th row | KC_597_DGT_CLT_STATN_BIZAEA_2021 |
Common Values
Value | Count | Frequency (%) |
KC_597_DGT_CLT_STATN_BIZAEA_2021 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
kc_597_dgt_clt_statn_bizaea_2021 | 100 |
base_ymd
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
20200101 |
---|
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20200101 |
---|---|
2nd row | 20200101 |
3rd row | 20200101 |
4th row | 20200101 |
5th row | 20200101 |
Common Values
Value | Count | Frequency (%) |
20200101 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20200101 | 100 |
ctprvn_nm | sgnr_nm | adstrd_cd | mlsfc | fclt_cnt | sccnt | |
---|---|---|---|---|---|---|
ctprvn_nm | 1.000 | 1.000 | 0.766 | 1.000 | 0.336 | 0.582 |
sgnr_nm | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
adstrd_cd | 0.766 | 1.000 | 1.000 | 0.106 | 0.296 | 0.283 |
mlsfc | 1.000 | 1.000 | 0.106 | 1.000 | 0.000 | 0.549 |
fclt_cnt | 0.336 | 1.000 | 0.296 | 0.000 | 1.000 | 0.000 |
sccnt | 0.582 | 1.000 | 0.283 | 0.549 | 0.000 | 1.000 |
sccnt | ctprvn_nm | mlsfc | |
---|---|---|---|
sccnt | 1.000 | 0.247 | 0.640 |
ctprvn_nm | 0.247 | 1.000 | 0.985 |
mlsfc | 0.640 | 0.985 | 1.000 |
adstrd_cd | fclt_cnt | ctprvn_nm | mlsfc | sccnt | |
---|---|---|---|---|---|
adstrd_cd | 1.000 | -0.479 | 0.619 | 0.143 | 0.034 |
fclt_cnt | -0.479 | 1.000 | 0.198 | 0.000 | 0.000 |
ctprvn_nm | 0.619 | 0.198 | 1.000 | 0.985 | 0.247 |
mlsfc | 0.143 | 0.000 | 0.985 | 1.000 | 0.640 |
sccnt | 0.034 | 0.000 | 0.247 | 0.640 | 1.000 |
ctprvn_nm | sgnr_nm | adstrd_cd | sccnt_ym | mlsfc | fclt_cnt | sccnt | file_name | base_ymd | |
---|---|---|---|---|---|---|---|---|---|
0 | 강원도 | 강릉시 | 4215061500 | 202001 | 공공도서관 | 4 | 0 | KC_597_DGT_CLT_STATN_BIZAEA_2021 | 20200101 |
1 | 충청북도 | 진천군 | 4375025000 | 202001 | 문예회관 | 1 | 896 | KC_597_DGT_CLT_STATN_BIZAEA_2021 | 20200101 |
2 | 강원도 | 동해시 | 4217054000 | 202001 | 공공도서관 | 3 | <NA> | KC_597_DGT_CLT_STATN_BIZAEA_2021 | 20200101 |
3 | 강원도 | 삼척시 | 4223057000 | 202001 | 공공도서관 | 3 | 0 | KC_597_DGT_CLT_STATN_BIZAEA_2021 | 20200101 |
4 | 강원도 | 속초시 | 4221056000 | 202001 | 공공도서관 | 3 | <NA> | KC_597_DGT_CLT_STATN_BIZAEA_2021 | 20200101 |
5 | 강원도 | 양구군 | 4280025000 | 202001 | 공공도서관 | 1 | <NA> | KC_597_DGT_CLT_STATN_BIZAEA_2021 | 20200101 |
6 | 강원도 | 양양군 | 4283025000 | 202001 | 공공도서관 | 1 | <NA> | KC_597_DGT_CLT_STATN_BIZAEA_2021 | 20200101 |
7 | 충청북도 | 청주시 | 4311251000 | 202001 | 문예회관 | 2 | <NA> | KC_597_DGT_CLT_STATN_BIZAEA_2021 | 20200101 |
8 | 강원도 | 원주시 | 4213025000 | 202001 | 공공도서관 | 4 | <NA> | KC_597_DGT_CLT_STATN_BIZAEA_2021 | 20200101 |
9 | 강원도 | 인제군 | 4281025000 | 202001 | 공공도서관 | 2 | <NA> | KC_597_DGT_CLT_STATN_BIZAEA_2021 | 20200101 |
ctprvn_nm | sgnr_nm | adstrd_cd | sccnt_ym | mlsfc | fclt_cnt | sccnt | file_name | base_ymd | |
---|---|---|---|---|---|---|---|---|---|
90 | 경상북도 | 김천시 | 4715053600 | 202001 | 공공도서관 | 1 | <NA> | KC_597_DGT_CLT_STATN_BIZAEA_2021 | 20200101 |
91 | 경상북도 | 문경시 | 4728059000 | 202001 | 공공도서관 | 5 | <NA> | KC_597_DGT_CLT_STATN_BIZAEA_2021 | 20200101 |
92 | 경상북도 | 봉화군 | 4792025000 | 202001 | 공공도서관 | 1 | <NA> | KC_597_DGT_CLT_STATN_BIZAEA_2021 | 20200101 |
93 | 경상북도 | 상주시 | 4725052000 | 202001 | 공공도서관 | 2 | <NA> | KC_597_DGT_CLT_STATN_BIZAEA_2021 | 20200101 |
94 | 경상북도 | 성주군 | 4784025000 | 202001 | 공공도서관 | 2 | <NA> | KC_597_DGT_CLT_STATN_BIZAEA_2021 | 20200101 |
95 | 경상북도 | 안동시 | 4717058500 | 202001 | 공공도서관 | 5 | <NA> | KC_597_DGT_CLT_STATN_BIZAEA_2021 | 20200101 |
96 | 경상북도 | 영덕군 | 4777025000 | 202001 | 공공도서관 | 1 | <NA> | KC_597_DGT_CLT_STATN_BIZAEA_2021 | 20200101 |
97 | 경상북도 | 영양군 | 4776025000 | 202001 | 공공도서관 | 1 | <NA> | KC_597_DGT_CLT_STATN_BIZAEA_2021 | 20200101 |
98 | 경상북도 | 영주시 | 4721063000 | 202001 | 공공도서관 | 3 | <NA> | KC_597_DGT_CLT_STATN_BIZAEA_2021 | 20200101 |
99 | 경상북도 | 영천시 | 4723025000 | 202001 | 공공도서관 | 2 | <NA> | KC_597_DGT_CLT_STATN_BIZAEA_2021 | 20200101 |