Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 1685 |
Duplicate rows (%) | 16.9% |
Total size in memory | 634.8 KiB |
Average record size in memory | 65.0 B |
Variable types
DateTime | 3 |
---|---|
Categorical | 2 |
Numeric | 1 |
Text | 1 |
Dataset
Description | N/A |
---|---|
Author | 충청북도 제천시 |
URL | https://www.data.go.kr/data/15122293/fileData.do |
입금상태 has constant value "" | Constant |
데이터기준일 has constant value "" | Constant |
Dataset has 1685 (16.9%) duplicate rows | Duplicates |
고지구분 is highly imbalanced (56.3%) | Imbalance |
Reproduction
Analysis started | 2024-04-21 09:25:09.148870 |
---|---|
Analysis finished | 2024-04-21 09:25:10.529002 |
Duration | 1.38 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
거래일자
Date
Distinct | 563 |
---|---|
Distinct (%) | 5.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Minimum | 2022-01-01 00:00:00 |
---|---|
Maximum | 2023-07-31 00:00:00 |
입금상태
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
입금 |
---|
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 입금 |
---|---|
2nd row | 입금 |
3rd row | 입금 |
4th row | 입금 |
5th row | 입금 |
Common Values
Value | Count | Frequency (%) |
입금 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
입금 | 10000 |
납기일자
Date
Distinct | 405 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Minimum | 2022-01-03 00:00:00 |
---|---|
Maximum | 2023-11-27 00:00:00 |
고지구분
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
일반 | |
---|---|
독촉 | |
체납 | 1 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 독촉 |
---|---|
2nd row | 일반 |
3rd row | 독촉 |
4th row | 일반 |
5th row | 일반 |
Common Values
Value | Count | Frequency (%) |
일반 | 8145 | |
독촉 | 1854 | 18.5% |
체납 | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
일반 | 8145 | |
독촉 | 1854 | 18.5% |
체납 | 1 | < 0.1% |
세목코드
Real number (ℝ)
Distinct | 127 |
---|---|
Distinct (%) | 1.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 247150.81 |
Minimum | 201001 |
---|---|
Maximum | 715002 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 201001 |
---|---|
5-th percentile | 202002 |
Q1 | 205009 |
median | 219216 |
Q3 | 288130 |
95-th percentile | 294099 |
Maximum | 715002 |
Range | 514001 |
Interquartile range (IQR) | 83121 |
Descriptive statistics
Standard deviation | 61101.136 |
---|---|
Coefficient of variation (CV) | 0.24722207 |
Kurtosis | 30.904812 |
Mean | 247150.81 |
Median Absolute Deviation (MAD) | 17214 |
Skewness | 4.386167 |
Sum | 2.4715081 × 109 |
Variance | 3.7333488 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
205007 | 1155 | |
288131 | 1104 | |
211173 | 981 | 9.8% |
288080 | 957 | 9.6% |
219216 | 818 | 8.2% |
202002 | 702 | 7.0% |
288130 | 506 | 5.1% |
205009 | 420 | 4.2% |
294099 | 345 | 3.5% |
206001 | 307 | 3.1% |
Other values (117) | 2705 |
Value | Count | Frequency (%) |
201001 | 77 | 0.8% |
201002 | 1 | < 0.1% |
202001 | 180 | 1.8% |
202002 | 702 | |
202009 | 9 | 0.1% |
202012 | 4 | < 0.1% |
202099 | 4 | < 0.1% |
205004 | 9 | 0.1% |
205006 | 5 | 0.1% |
205007 | 1155 |
Value | Count | Frequency (%) |
715002 | 57 | |
715001 | 41 | 0.4% |
299099 | 58 | |
295064 | 1 | < 0.1% |
295062 | 124 | |
295043 | 2 | < 0.1% |
295038 | 2 | < 0.1% |
295025 | 2 | < 0.1% |
294944 | 4 | < 0.1% |
294907 | 1 | < 0.1% |
세목명
Text
Distinct | 136 |
---|---|
Distinct (%) | 1.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 18 |
---|---|
Median length | 15 |
Mean length | 8.8094 |
Min length | 2 |
Characters and Unicode
Total characters | 88094 |
---|---|
Distinct characters | 195 |
Distinct categories | 6 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 29 ? |
---|---|
Unique (%) | 0.3% |
Sample
1st row | 자동차손해배상보장법위반과태료 |
---|---|
2nd row | 화물자동차운수사업법위반과징금 |
3rd row | 부가가치세 |
4th row | 차량출입시설 |
5th row | 학사사용료 |
Value | Count | Frequency (%) |
차량출입시설 | 1155 | 11.6% |
자동차검사지연과태료 | 1104 | 11.0% |
학사사용료 | 981 | 9.8% |
자동차손해배상보장법위반과태료 | 956 | 9.6% |
보건진료소진료사업수입 | 818 | 8.2% |
장애인주차구역위반과태료 | 506 | 5.1% |
옥외간판도로공간사용료 | 396 | 4.0% |
시군구재산대부료 | 376 | 3.8% |
그외수입 | 346 | 3.5% |
시군구재산임대료 | 326 | 3.3% |
Other values (126) | 3036 |
Most occurring characters
Value | Count | Frequency (%) |
료 | 8296 | 9.4% |
사 | 5311 | 6.0% |
차 | 3952 | 4.5% |
과 | 3293 | 3.7% |
태 | 3234 | 3.7% |
입 | 2706 | 3.1% |
자 | 2570 | 2.9% |
동 | 2331 | 2.6% |
용 | 2297 | 2.6% |
반 | 2256 | 2.6% |
Other values (185) | 51848 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 88079 | |
Close Punctuation | 5 | < 0.1% |
Open Punctuation | 5 | < 0.1% |
Decimal Number | 2 | < 0.1% |
Dash Punctuation | 2 | < 0.1% |
Space Separator | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
료 | 8296 | 9.4% |
사 | 5311 | 6.0% |
차 | 3952 | 4.5% |
과 | 3293 | 3.7% |
태 | 3234 | 3.7% |
입 | 2706 | 3.1% |
자 | 2570 | 2.9% |
동 | 2331 | 2.6% |
용 | 2297 | 2.6% |
반 | 2256 | 2.6% |
Other values (180) | 51833 |
Close Punctuation
Value | Count | Frequency (%) |
) | 5 |
Open Punctuation
Value | Count | Frequency (%) |
( | 5 |
Decimal Number
Value | Count | Frequency (%) |
4 | 2 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2 |
Space Separator
Value | Count | Frequency (%) |
1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 88079 | |
Common | 15 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
료 | 8296 | 9.4% |
사 | 5311 | 6.0% |
차 | 3952 | 4.5% |
과 | 3293 | 3.7% |
태 | 3234 | 3.7% |
입 | 2706 | 3.1% |
자 | 2570 | 2.9% |
동 | 2331 | 2.6% |
용 | 2297 | 2.6% |
반 | 2256 | 2.6% |
Other values (180) | 51833 |
Common
Value | Count | Frequency (%) |
) | 5 | |
( | 5 | |
4 | 2 | 13.3% |
- | 2 | 13.3% |
1 | 6.7% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 88079 | |
ASCII | 15 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
료 | 8296 | 9.4% |
사 | 5311 | 6.0% |
차 | 3952 | 4.5% |
과 | 3293 | 3.7% |
태 | 3234 | 3.7% |
입 | 2706 | 3.1% |
자 | 2570 | 2.9% |
동 | 2331 | 2.6% |
용 | 2297 | 2.6% |
반 | 2256 | 2.6% |
Other values (180) | 51833 |
ASCII
Value | Count | Frequency (%) |
) | 5 | |
( | 5 | |
4 | 2 | 13.3% |
- | 2 | 13.3% |
1 | 6.7% |
데이터기준일
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Minimum | 2023-09-01 00:00:00 |
---|---|
Maximum | 2023-09-01 00:00:00 |
고지구분 | 세목코드 | |
---|---|---|
고지구분 | 1.000 | 0.416 |
세목코드 | 0.416 | 1.000 |
세목코드 | 고지구분 | |
---|---|---|
세목코드 | 1.000 | 0.154 |
고지구분 | 0.154 | 1.000 |
거래일자 | 입금상태 | 납기일자 | 고지구분 | 세목코드 | 세목명 | 데이터기준일 | |
---|---|---|---|---|---|---|---|
1489 | 2022-02-25 | 입금 | 2022-03-07 | 독촉 | 288080 | 자동차손해배상보장법위반과태료 | 2023-09-01 |
8686 | 2022-10-31 | 입금 | 2022-10-31 | 일반 | 288238 | 화물자동차운수사업법위반과징금 | 2023-09-01 |
3553 | 2022-05-27 | 입금 | 2022-05-27 | 독촉 | 299099 | 부가가치세 | 2023-09-01 |
15393 | 2023-06-23 | 입금 | 2023-06-30 | 일반 | 205007 | 차량출입시설 | 2023-09-01 |
9864 | 2022-12-27 | 입금 | 2022-12-27 | 일반 | 211173 | 학사사용료 | 2023-09-01 |
2820 | 2022-04-26 | 입금 | 2022-05-02 | 독촉 | 288131 | 자동차검사지연과태료 | 2023-09-01 |
4721 | 2022-06-20 | 입금 | 2022-06-30 | 일반 | 205007 | 차량출입시설 | 2023-09-01 |
44 | 2022-01-05 | 입금 | 2022-01-24 | 일반 | 288702 | 감염병예방및관리법과태료 | 2023-09-01 |
13456 | 2023-05-10 | 입금 | 2023-05-10 | 일반 | 211173 | 학사사용료 | 2023-09-01 |
12528 | 2023-03-28 | 입금 | 2023-05-22 | 일반 | 251002 | 시군구유재산매각수입금 | 2023-09-01 |
거래일자 | 입금상태 | 납기일자 | 고지구분 | 세목코드 | 세목명 | 데이터기준일 | |
---|---|---|---|---|---|---|---|
3320 | 2022-05-19 | 입금 | 2022-05-31 | 독촉 | 288131 | 자동차검사지연과태료 | 2023-09-01 |
5792 | 2022-06-30 | 입금 | 2022-06-30 | 일반 | 205008 | 사설안내표지판 | 2023-09-01 |
5780 | 2022-06-30 | 입금 | 2022-06-30 | 독촉 | 288080 | 자동차손해배상보장법위반과태료 | 2023-09-01 |
4971 | 2022-06-23 | 입금 | 2022-06-23 | 일반 | 219216 | 보건진료소진료사업수입 | 2023-09-01 |
10039 | 2023-01-04 | 입금 | 2023-01-04 | 일반 | 715001 | 국고보조금등반환금 | 2023-09-01 |
9453 | 2022-12-06 | 입금 | 2023-01-02 | 일반 | 288131 | 자동차검사지연과태료 | 2023-09-01 |
16558 | 2023-07-10 | 입금 | 2023-07-10 | 일반 | 219216 | 보건진료소진료사업수입 | 2023-09-01 |
5631 | 2022-06-30 | 입금 | 2022-06-30 | 일반 | 205007 | 차량출입시설 | 2023-09-01 |
17030 | 2023-07-26 | 입금 | 2023-08-31 | 일반 | 288133 | 쓰레기불법투기과태료 | 2023-09-01 |
14004 | 2023-05-31 | 입금 | 2023-05-31 | 독촉 | 288080 | 자동차손해배상보장법위반과태료 | 2023-09-01 |
Most frequently occurring
거래일자 | 입금상태 | 납기일자 | 고지구분 | 세목코드 | 세목명 | 데이터기준일 | # duplicates | |
---|---|---|---|---|---|---|---|---|
543 | 2022-06-30 | 입금 | 2022-06-30 | 일반 | 205007 | 차량출입시설 | 2023-09-01 | 95 |
1586 | 2023-06-30 | 입금 | 2023-06-30 | 일반 | 205007 | 차량출입시설 | 2023-09-01 | 87 |
461 | 2022-06-20 | 입금 | 2022-06-30 | 일반 | 205007 | 차량출입시설 | 2023-09-01 | 44 |
1491 | 2023-06-19 | 입금 | 2023-06-30 | 일반 | 205007 | 차량출입시설 | 2023-09-01 | 42 |
444 | 2022-06-17 | 입금 | 2022-06-30 | 일반 | 205007 | 차량출입시설 | 2023-09-01 | 40 |
536 | 2022-06-29 | 입금 | 2022-06-30 | 일반 | 205007 | 차량출입시설 | 2023-09-01 | 37 |
431 | 2022-06-16 | 입금 | 2022-06-30 | 일반 | 205007 | 차량출입시설 | 2023-09-01 | 36 |
1302 | 2023-04-10 | 입금 | 2023-04-10 | 일반 | 211173 | 학사사용료 | 2023-09-01 | 36 |
1467 | 2023-06-16 | 입금 | 2023-06-30 | 일반 | 205007 | 차량출입시설 | 2023-09-01 | 36 |
1365 | 2023-05-10 | 입금 | 2023-05-10 | 일반 | 211173 | 학사사용료 | 2023-09-01 | 35 |