Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 3254 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 162.2 KiB |
Average record size in memory | 51.0 B |
Variable types
Categorical | 2 |
---|---|
Text | 1 |
Numeric | 3 |
Dataset
Description | 공단의 여객선 항로별 이용 관련 정보로 아래와 같은 데이터를 제공하고 있습니다. (지사명, 항로명, 연_월, 합계, 일반, 도서민) |
---|---|
URL | https://www.data.go.kr/data/15117984/fileData.do |
합계 is highly overall correlated with 일반 and 1 other fields | High correlation |
일반 is highly overall correlated with 합계 and 1 other fields | High correlation |
도서민 is highly overall correlated with 합계 and 1 other fields | High correlation |
합계 has 159 (4.9%) zeros | Zeros |
일반 has 160 (4.9%) zeros | Zeros |
도서민 has 349 (10.7%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 17:06:58.803995 |
---|---|
Analysis finished | 2023-12-12 17:07:00.509234 |
Duration | 1.71 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
지사명
Categorical
Distinct | 12 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 25.6 KiB |
목포지사 | |
---|---|
인천지사 | |
통영지사 | |
완도지사 | |
여수지사 | |
Other values (7) |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 부산지사 |
---|---|
2nd row | 부산지사 |
3rd row | 부산지사 |
4th row | 부산지사 |
5th row | 부산지사 |
Common Values
Value | Count | Frequency (%) |
목포지사 | 777 | |
인천지사 | 426 | |
통영지사 | 408 | |
완도지사 | 390 | |
여수지사 | 258 | 7.9% |
보령지사 | 210 | 6.5% |
포항지사 | 196 | 6.0% |
동해지사 | 151 | 4.6% |
제주지사 | 150 | 4.6% |
고흥지사 | 139 | 4.3% |
Other values (2) | 149 | 4.6% |
Length
Value | Count | Frequency (%) |
목포지사 | 777 | |
인천지사 | 426 | |
통영지사 | 408 | |
완도지사 | 390 | |
여수지사 | 258 | 7.9% |
보령지사 | 210 | 6.5% |
포항지사 | 196 | 6.0% |
동해지사 | 151 | 4.6% |
제주지사 | 150 | 4.6% |
고흥지사 | 139 | 4.3% |
Other values (2) | 149 | 4.6% |
항로명
Text
Distinct | 126 |
---|---|
Distinct (%) | 3.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 25.6 KiB |
Value | Count | Frequency (%) |
울릉(사동)-독도 | 50 | 1.5% |
울릉(도동)-독도 | 45 | 1.4% |
하리-서검 | 32 | 1.0% |
통영-삼천포 | 31 | 1.0% |
제주-우수영 | 30 | 0.9% |
당목-일정 | 30 | 0.9% |
울릉(저동)-독도 | 30 | 0.9% |
완도-덕우 | 30 | 0.9% |
완도-여서 | 30 | 0.9% |
완도-모도 | 30 | 0.9% |
Other values (117) | 2919 |
Most occurring characters
Value | Count | Frequency (%) |
- | 3284 | 17.3% |
도 | 1274 | 6.7% |
목 | 655 | 3.5% |
포 | 604 | 3.2% |
동 | 547 | 2.9% |
( | 434 | 2.3% |
) | 434 | 2.3% |
천 | 408 | 2.2% |
릉 | 374 | 2.0% |
울 | 374 | 2.0% |
Other values (136) | 10551 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 14781 | |
Dash Punctuation | 3284 | 17.3% |
Open Punctuation | 434 | 2.3% |
Close Punctuation | 434 | 2.3% |
Space Separator | 3 | < 0.1% |
Decimal Number | 3 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
도 | 1274 | 8.6% |
목 | 655 | 4.4% |
포 | 604 | 4.1% |
동 | 547 | 3.7% |
천 | 408 | 2.8% |
릉 | 374 | 2.5% |
울 | 374 | 2.5% |
산 | 340 | 2.3% |
주 | 281 | 1.9% |
영 | 261 | 1.8% |
Other values (131) | 9663 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 3284 |
Open Punctuation
Value | Count | Frequency (%) |
( | 434 |
Close Punctuation
Value | Count | Frequency (%) |
) | 434 |
Space Separator
Value | Count | Frequency (%) |
3 |
Decimal Number
Value | Count | Frequency (%) |
1 | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 14781 | |
Common | 4158 | 22.0% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
도 | 1274 | 8.6% |
목 | 655 | 4.4% |
포 | 604 | 4.1% |
동 | 547 | 3.7% |
천 | 408 | 2.8% |
릉 | 374 | 2.5% |
울 | 374 | 2.5% |
산 | 340 | 2.3% |
주 | 281 | 1.9% |
영 | 261 | 1.8% |
Other values (131) | 9663 |
Common
Value | Count | Frequency (%) |
- | 3284 | |
( | 434 | 10.4% |
) | 434 | 10.4% |
3 | 0.1% | |
1 | 3 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 14781 | |
ASCII | 4158 | 22.0% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 3284 | |
( | 434 | 10.4% |
) | 434 | 10.4% |
3 | 0.1% | |
1 | 3 | 0.1% |
Hangul
Value | Count | Frequency (%) |
도 | 1274 | 8.6% |
목 | 655 | 4.4% |
포 | 604 | 4.1% |
동 | 547 | 3.7% |
천 | 408 | 2.8% |
릉 | 374 | 2.5% |
울 | 374 | 2.5% |
산 | 340 | 2.3% |
주 | 281 | 1.9% |
영 | 261 | 1.8% |
Other values (131) | 9663 |
연-월
Categorical
Distinct | 30 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 25.6 KiB |
2023-05 | 112 |
---|---|
2021-11 | 111 |
2021-12 | 111 |
2021-10 | 111 |
2021-08 | 110 |
Other values (25) |
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2021-01 |
---|---|
2nd row | 2021-02 |
3rd row | 2021-03 |
4th row | 2021-04 |
5th row | 2021-05 |
Common Values
Value | Count | Frequency (%) |
2023-05 | 112 | 3.4% |
2021-11 | 111 | 3.4% |
2021-12 | 111 | 3.4% |
2021-10 | 111 | 3.4% |
2021-08 | 110 | 3.4% |
2021-09 | 110 | 3.4% |
2022-01 | 110 | 3.4% |
2021-07 | 110 | 3.4% |
2021-03 | 110 | 3.4% |
2022-04 | 109 | 3.3% |
Other values (20) | 2150 |
Length
Value | Count | Frequency (%) |
2023-05 | 112 | 3.4% |
2021-12 | 111 | 3.4% |
2021-10 | 111 | 3.4% |
2021-11 | 111 | 3.4% |
2022-01 | 110 | 3.4% |
2021-07 | 110 | 3.4% |
2021-03 | 110 | 3.4% |
2021-09 | 110 | 3.4% |
2021-08 | 110 | 3.4% |
2022-04 | 109 | 3.3% |
Other values (20) | 2150 |
합계
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 2807 |
---|---|
Distinct (%) | 86.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 9948.6899 |
Minimum | 0 |
---|---|
Maximum | 200042 |
Zeros | 159 |
Zeros (%) | 4.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 28.7 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 38.3 |
Q1 | 1386.5 |
median | 5157 |
Q3 | 12180 |
95-th percentile | 37332.9 |
Maximum | 200042 |
Range | 200042 |
Interquartile range (IQR) | 10793.5 |
Descriptive statistics
Standard deviation | 13817.847 |
---|---|
Coefficient of variation (CV) | 1.3889113 |
Kurtosis | 26.896662 |
Mean | 9948.6899 |
Median Absolute Deviation (MAD) | 4214 |
Skewness | 3.6654946 |
Sum | 32373037 |
Variance | 1.9093291 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 159 | 4.9% |
243 | 3 | 0.1% |
1197 | 3 | 0.1% |
302 | 3 | 0.1% |
1163 | 3 | 0.1% |
641 | 3 | 0.1% |
281 | 3 | 0.1% |
1345 | 3 | 0.1% |
3686 | 3 | 0.1% |
849 | 3 | 0.1% |
Other values (2797) | 3068 |
Value | Count | Frequency (%) |
0 | 159 | |
7 | 1 | < 0.1% |
21 | 1 | < 0.1% |
32 | 1 | < 0.1% |
37 | 1 | < 0.1% |
39 | 1 | < 0.1% |
60 | 1 | < 0.1% |
66 | 1 | < 0.1% |
93 | 1 | < 0.1% |
97 | 1 | < 0.1% |
Value | Count | Frequency (%) |
200042 | 1 | |
165165 | 1 | |
160182 | 1 | |
121809 | 1 | |
117633 | 1 | |
116978 | 1 | |
112912 | 1 | |
94608 | 1 | |
92283 | 1 | |
78044 | 1 |
일반
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 2694 |
---|---|
Distinct (%) | 82.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 7596.9644 |
Minimum | 0 |
---|---|
Maximum | 197878 |
Zeros | 160 |
Zeros (%) | 4.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 28.7 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 14.95 |
Q1 | 709 |
median | 3163.5 |
Q3 | 8909 |
95-th percentile | 29543.75 |
Maximum | 197878 |
Range | 197878 |
Interquartile range (IQR) | 8200 |
Descriptive statistics
Standard deviation | 12358.899 |
---|---|
Coefficient of variation (CV) | 1.6268208 |
Kurtosis | 42.121802 |
Mean | 7596.9644 |
Median Absolute Deviation (MAD) | 2761 |
Skewness | 4.7196704 |
Sum | 24720522 |
Variance | 1.5274239 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 160 | 4.9% |
365 | 7 | 0.2% |
417 | 6 | 0.2% |
455 | 5 | 0.2% |
901 | 5 | 0.2% |
354 | 4 | 0.1% |
328 | 4 | 0.1% |
603 | 4 | 0.1% |
308 | 4 | 0.1% |
529 | 4 | 0.1% |
Other values (2684) | 3051 |
Value | Count | Frequency (%) |
0 | 160 | |
9 | 2 | 0.1% |
13 | 1 | < 0.1% |
16 | 2 | 0.1% |
17 | 4 | 0.1% |
18 | 3 | 0.1% |
21 | 1 | < 0.1% |
24 | 2 | 0.1% |
25 | 1 | < 0.1% |
26 | 2 | 0.1% |
Value | Count | Frequency (%) |
197878 | 1 | |
162635 | 1 | |
158053 | 1 | |
119249 | 1 | |
116219 | 1 | |
110269 | 1 | |
108141 | 1 | |
94608 | 1 | |
90421 | 1 | |
77056 | 1 |
도서민
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 2082 |
---|---|
Distinct (%) | 64.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2351.7256 |
Minimum | 0 |
---|---|
Maximum | 26674 |
Zeros | 349 |
Zeros (%) | 10.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 28.7 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 317 |
median | 938 |
Q3 | 3303.25 |
95-th percentile | 8978 |
Maximum | 26674 |
Range | 26674 |
Interquartile range (IQR) | 2986.25 |
Descriptive statistics
Standard deviation | 3431.6021 |
---|---|
Coefficient of variation (CV) | 1.4591848 |
Kurtosis | 11.1729 |
Mean | 2351.7256 |
Median Absolute Deviation (MAD) | 903.5 |
Skewness | 2.9109961 |
Sum | 7652515 |
Variance | 11775893 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 349 | 10.7% |
2 | 10 | 0.3% |
4 | 9 | 0.3% |
11 | 8 | 0.2% |
417 | 7 | 0.2% |
8 | 7 | 0.2% |
382 | 6 | 0.2% |
317 | 6 | 0.2% |
1 | 6 | 0.2% |
47 | 6 | 0.2% |
Other values (2072) | 2840 |
Value | Count | Frequency (%) |
0 | 349 | |
1 | 6 | 0.2% |
2 | 10 | 0.3% |
3 | 5 | 0.2% |
4 | 9 | 0.3% |
5 | 4 | 0.1% |
6 | 6 | 0.2% |
7 | 5 | 0.2% |
8 | 7 | 0.2% |
10 | 5 | 0.2% |
Value | Count | Frequency (%) |
26674 | 1 | |
25627 | 1 | |
25579 | 1 | |
25081 | 1 | |
24572 | 1 | |
24091 | 1 | |
22671 | 1 | |
22654 | 1 | |
22436 | 1 | |
22118 | 1 |
지사명 | 연-월 | 합계 | 일반 | 도서민 | |
---|---|---|---|---|---|
지사명 | 1.000 | 0.000 | 0.319 | 0.339 | 0.404 |
연-월 | 0.000 | 1.000 | 0.153 | 0.170 | 0.000 |
합계 | 0.319 | 0.153 | 1.000 | 0.975 | 0.445 |
일반 | 0.339 | 0.170 | 0.975 | 1.000 | 0.316 |
도서민 | 0.404 | 0.000 | 0.445 | 0.316 | 1.000 |
연-월 | 지사명 | |
---|---|---|
연-월 | 1.000 | 0.000 |
지사명 | 0.000 | 1.000 |
합계 | 일반 | 도서민 | 지사명 | 연-월 | |
---|---|---|---|---|---|
합계 | 1.000 | 0.976 | 0.653 | 0.141 | 0.057 |
일반 | 0.976 | 1.000 | 0.519 | 0.149 | 0.055 |
도서민 | 0.653 | 0.519 | 1.000 | 0.182 | 0.000 |
지사명 | 0.141 | 0.149 | 0.182 | 1.000 | 0.000 |
연-월 | 0.057 | 0.055 | 0.000 | 0.000 | 1.000 |
지사명 | 항로명 | 연-월 | 합계 | 일반 | 도서민 | |
---|---|---|---|---|---|---|
0 | 부산지사 | 부산-제주 | 2021-01 | 1502 | 1502 | 0 |
1 | 부산지사 | 부산-제주 | 2021-02 | 0 | 0 | 0 |
2 | 부산지사 | 부산-제주 | 2021-03 | 1545 | 1545 | 0 |
3 | 부산지사 | 부산-제주 | 2021-04 | 1749 | 1749 | 0 |
4 | 부산지사 | 부산-제주 | 2021-05 | 2326 | 2326 | 0 |
5 | 부산지사 | 부산-제주 | 2021-06 | 2475 | 2475 | 0 |
6 | 부산지사 | 부산-제주 | 2021-07 | 3145 | 3145 | 0 |
7 | 부산지사 | 부산-제주 | 2021-08 | 3141 | 3141 | 0 |
8 | 부산지사 | 부산-제주 | 2021-09 | 2112 | 2112 | 0 |
9 | 부산지사 | 부산-제주 | 2021-10 | 2677 | 2677 | 0 |
지사명 | 항로명 | 연-월 | 합계 | 일반 | 도서민 | |
---|---|---|---|---|---|---|
3244 | 고흥지사 | 녹동-제주 | 2022-09 | 14160 | 14160 | 0 |
3245 | 고흥지사 | 녹동-제주 | 2022-10 | 18535 | 18535 | 0 |
3246 | 고흥지사 | 녹동-제주 | 2022-11 | 15518 | 15518 | 0 |
3247 | 고흥지사 | 녹동-제주 | 2022-12 | 11256 | 11256 | 0 |
3248 | 고흥지사 | 녹동-제주 | 2023-01 | 13247 | 13247 | 0 |
3249 | 고흥지사 | 녹동-제주 | 2023-02 | 11974 | 11974 | 0 |
3250 | 고흥지사 | 녹동-제주 | 2023-03 | 11422 | 11422 | 0 |
3251 | 고흥지사 | 녹동-제주 | 2023-04 | 12315 | 12315 | 0 |
3252 | 고흥지사 | 녹동-제주 | 2023-05 | 14457 | 14457 | 0 |
3253 | 고흥지사 | 녹동-제주 | 2023-06 | 4917 | 4917 | 0 |