Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 123 |
Missing cells (%) | 0.2% |
Duplicate rows | 390 |
Duplicate rows (%) | 3.9% |
Total size in memory | 664.1 KiB |
Average record size in memory | 68.0 B |
Variable types
Categorical | 2 |
---|---|
Unsupported | 1 |
Text | 1 |
Numeric | 3 |
Dataset
Description | 광주광역시 내 시내버스 승하차 인원정보에 대한 데이터로 일자별, 노선명, 정류장명, 시간별, 승하차별 거래건수를 제공합니다. |
---|---|
Author | 광주광역시 |
URL | https://www.data.go.kr/data/15088456/fileData.do |
일자 has constant value "" | Constant |
Dataset has 390 (3.9%) duplicate rows | Duplicates |
노선명 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2024-04-13 12:35:17.963765 |
---|---|
Analysis finished | 2024-04-13 12:35:24.170909 |
Duration | 6.21 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
일자
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
20240301 |
---|
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20240301 |
---|---|
2nd row | 20240301 |
3rd row | 20240301 |
4th row | 20240301 |
5th row | 20240301 |
Common Values
Value | Count | Frequency (%) |
20240301 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20240301 | 10000 |
노선명
Unsupported
REJECTED
  UNSUPPORTED
 
Missing | 0 |
---|---|
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
정류장명
Text
Distinct | 922 |
---|---|
Distinct (%) | 9.3% |
Missing | 61 |
Missing (%) | 0.6% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
광주종합버스터미널 | 152 | 1.5% |
경신여고 | 108 | 1.1% |
국립아시아문화전당(구.도청 | 88 | 0.9% |
광천치안센터 | 84 | 0.8% |
남광주역 | 75 | 0.7% |
도로교통공단 | 71 | 0.7% |
대신파크 | 71 | 0.7% |
살레시오여고 | 70 | 0.7% |
운암3단지 | 68 | 0.7% |
진월대주아파트 | 65 | 0.6% |
Other values (918) | 9195 |
Most occurring characters
Value | Count | Frequency (%) |
아 | 1625 | 2.8% |
주 | 1567 | 2.7% |
광 | 1504 | 2.6% |
동 | 1480 | 2.5% |
대 | 1402 | 2.4% |
파 | 1322 | 2.3% |
구 | 1201 | 2.0% |
트 | 1159 | 2.0% |
남 | 1080 | 1.8% |
교 | 1005 | 1.7% |
Other values (363) | 45250 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 54874 | |
Decimal Number | 1137 | 1.9% |
Close Punctuation | 913 | 1.6% |
Open Punctuation | 913 | 1.6% |
Other Punctuation | 342 | 0.6% |
Uppercase Letter | 278 | 0.5% |
Space Separator | 108 | 0.2% |
Dash Punctuation | 30 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
아 | 1625 | 3.0% |
주 | 1567 | 2.9% |
광 | 1504 | 2.7% |
동 | 1480 | 2.7% |
대 | 1402 | 2.6% |
파 | 1322 | 2.4% |
구 | 1201 | 2.2% |
트 | 1159 | 2.1% |
남 | 1080 | 2.0% |
교 | 1005 | 1.8% |
Other values (335) | 41529 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 83 | |
C | 72 | |
I | 29 | 10.4% |
B | 29 | 10.4% |
K | 24 | 8.6% |
G | 18 | 6.5% |
T | 13 | 4.7% |
N | 2 | 0.7% |
D | 2 | 0.7% |
L | 2 | 0.7% |
Other values (3) | 4 | 1.4% |
Decimal Number
Value | Count | Frequency (%) |
1 | 296 | |
2 | 284 | |
3 | 182 | |
4 | 122 | |
5 | 101 | 8.9% |
9 | 66 | 5.8% |
6 | 43 | 3.8% |
8 | 43 | 3.8% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 193 | |
. | 137 | |
& | 12 | 3.5% |
Close Punctuation
Value | Count | Frequency (%) |
) | 913 |
Open Punctuation
Value | Count | Frequency (%) |
( | 913 |
Space Separator
Value | Count | Frequency (%) |
108 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 30 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 54874 | |
Common | 3443 | 5.9% |
Latin | 278 | 0.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
아 | 1625 | 3.0% |
주 | 1567 | 2.9% |
광 | 1504 | 2.7% |
동 | 1480 | 2.7% |
대 | 1402 | 2.6% |
파 | 1322 | 2.4% |
구 | 1201 | 2.2% |
트 | 1159 | 2.1% |
남 | 1080 | 2.0% |
교 | 1005 | 1.8% |
Other values (335) | 41529 |
Common
Value | Count | Frequency (%) |
) | 913 | |
( | 913 | |
1 | 296 | 8.6% |
2 | 284 | 8.2% |
/ | 193 | 5.6% |
3 | 182 | 5.3% |
. | 137 | 4.0% |
4 | 122 | 3.5% |
108 | 3.1% | |
5 | 101 | 2.9% |
Other values (5) | 194 | 5.6% |
Latin
Value | Count | Frequency (%) |
S | 83 | |
C | 72 | |
I | 29 | 10.4% |
B | 29 | 10.4% |
K | 24 | 8.6% |
G | 18 | 6.5% |
T | 13 | 4.7% |
N | 2 | 0.7% |
D | 2 | 0.7% |
L | 2 | 0.7% |
Other values (3) | 4 | 1.4% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 54874 | |
ASCII | 3721 | 6.4% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
아 | 1625 | 3.0% |
주 | 1567 | 2.9% |
광 | 1504 | 2.7% |
동 | 1480 | 2.7% |
대 | 1402 | 2.6% |
파 | 1322 | 2.4% |
구 | 1201 | 2.2% |
트 | 1159 | 2.1% |
남 | 1080 | 2.0% |
교 | 1005 | 1.8% |
Other values (335) | 41529 |
ASCII
Value | Count | Frequency (%) |
) | 913 | |
( | 913 | |
1 | 296 | 8.0% |
2 | 284 | 7.6% |
/ | 193 | 5.2% |
3 | 182 | 4.9% |
. | 137 | 3.7% |
4 | 122 | 3.3% |
108 | 2.9% | |
5 | 101 | 2.7% |
Other values (18) | 472 |
ARS_ID
Real number (ℝ)
Distinct | 1504 |
---|---|
Distinct (%) | 15.1% |
Missing | 62 |
Missing (%) | 0.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3480.9936 |
Minimum | 1002 |
---|---|
Maximum | 6634 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1002 |
---|---|
5-th percentile | 1071.55 |
Q1 | 2164 |
median | 4077 |
Q3 | 4533.75 |
95-th percentile | 5580.45 |
Maximum | 6634 |
Range | 5632 |
Interquartile range (IQR) | 2369.75 |
Descriptive statistics
Standard deviation | 1508.5589 |
---|---|
Coefficient of variation (CV) | 0.43337021 |
Kurtosis | -1.2335517 |
Mean | 3480.9936 |
Median Absolute Deviation (MAD) | 1185 |
Skewness | -0.1856791 |
Sum | 34594114 |
Variance | 2275749.9 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2002 | 100 | 1.0% |
4435 | 55 | 0.5% |
4434 | 53 | 0.5% |
2001 | 52 | 0.5% |
1122 | 51 | 0.5% |
1130 | 48 | 0.5% |
2003 | 48 | 0.5% |
1141 | 42 | 0.4% |
1017 | 38 | 0.4% |
3236 | 38 | 0.4% |
Other values (1494) | 9413 | |
(Missing) | 62 | 0.6% |
Value | Count | Frequency (%) |
1002 | 6 | 0.1% |
1003 | 16 | |
1004 | 17 | |
1005 | 9 | |
1006 | 12 | |
1007 | 21 | |
1008 | 16 | |
1009 | 10 | |
1010 | 15 | |
1011 | 1 | < 0.1% |
Value | Count | Frequency (%) |
6634 | 1 | < 0.1% |
6633 | 1 | < 0.1% |
6626 | 1 | < 0.1% |
6625 | 4 | |
6622 | 2 | |
6621 | 2 | |
6619 | 3 | |
6612 | 1 | < 0.1% |
6611 | 1 | < 0.1% |
6468 | 1 | < 0.1% |
시간
Real number (ℝ)
Distinct | 19 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 14.1921 |
Minimum | 5 |
---|---|
Maximum | 23 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 5 |
---|---|
5-th percentile | 7 |
Q1 | 11 |
median | 14 |
Q3 | 18 |
95-th percentile | 21 |
Maximum | 23 |
Range | 18 |
Interquartile range (IQR) | 7 |
Descriptive statistics
Standard deviation | 4.5286034 |
---|---|
Coefficient of variation (CV) | 0.31909325 |
Kurtosis | -1.0201796 |
Mean | 14.1921 |
Median Absolute Deviation (MAD) | 4 |
Skewness | -0.040097755 |
Sum | 141921 |
Variance | 20.508248 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
16 | 735 | 7.3% |
17 | 720 | 7.2% |
15 | 704 | 7.0% |
13 | 691 | 6.9% |
18 | 671 | 6.7% |
14 | 661 | 6.6% |
12 | 648 | 6.5% |
11 | 625 | 6.2% |
9 | 592 | 5.9% |
10 | 578 | 5.8% |
Other values (9) | 3375 |
Value | Count | Frequency (%) |
5 | 46 | 0.5% |
6 | 309 | |
7 | 415 | |
8 | 559 | |
9 | 592 | |
10 | 578 | |
11 | 625 | |
12 | 648 | |
13 | 691 | |
14 | 661 |
Value | Count | Frequency (%) |
23 | 54 | 0.5% |
22 | 395 | |
21 | 507 | |
20 | 537 | |
19 | 553 | |
18 | 671 | |
17 | 720 | |
16 | 735 | |
15 | 704 | |
14 | 661 |
승하차
Categorical
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
승차 | |
---|---|
하차 | |
환승 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 하차 |
---|---|
2nd row | 승차 |
3rd row | 승차 |
4th row | 승차 |
5th row | 하차 |
Common Values
Value | Count | Frequency (%) |
승차 | 4954 | |
하차 | 3540 | |
환승 | 1506 | 15.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
승차 | 4954 | |
하차 | 3540 | |
환승 | 1506 | 15.1% |
거래건수
Real number (ℝ)
Distinct | 45 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.5537 |
Minimum | 1 |
---|---|
Maximum | 112 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 1 |
Q3 | 3 |
95-th percentile | 8 |
Maximum | 112 |
Range | 111 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 3.3723662 |
---|---|
Coefficient of variation (CV) | 1.3205804 |
Kurtosis | 156.32076 |
Mean | 2.5537 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 8.3161765 |
Sum | 25537 |
Variance | 11.372854 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 5092 | |
2 | 2034 | 20.3% |
3 | 1040 | 10.4% |
4 | 590 | 5.9% |
5 | 349 | 3.5% |
6 | 218 | 2.2% |
7 | 167 | 1.7% |
8 | 115 | 1.1% |
9 | 85 | 0.9% |
10 | 62 | 0.6% |
Other values (35) | 248 | 2.5% |
Value | Count | Frequency (%) |
1 | 5092 | |
2 | 2034 | 20.3% |
3 | 1040 | 10.4% |
4 | 590 | 5.9% |
5 | 349 | 3.5% |
6 | 218 | 2.2% |
7 | 167 | 1.7% |
8 | 115 | 1.1% |
9 | 85 | 0.9% |
10 | 62 | 0.6% |
Value | Count | Frequency (%) |
112 | 1 | |
62 | 1 | |
55 | 1 | |
48 | 1 | |
47 | 1 | |
46 | 1 | |
45 | 1 | |
43 | 1 | |
42 | 1 | |
40 | 1 |
ARS_ID | 시간 | 승하차 | 거래건수 | |
---|---|---|---|---|
ARS_ID | 1.000 | 0.033 | 0.109 | 0.061 |
시간 | 0.033 | 1.000 | 0.065 | 0.051 |
승하차 | 0.109 | 0.065 | 1.000 | 0.082 |
거래건수 | 0.061 | 0.051 | 0.082 | 1.000 |
ARS_ID | 시간 | 거래건수 | 승하차 | |
---|---|---|---|---|
ARS_ID | 1.000 | 0.007 | -0.031 | 0.064 |
시간 | 0.007 | 1.000 | 0.023 | 0.038 |
거래건수 | -0.031 | 0.023 | 1.000 | 0.054 |
승하차 | 0.064 | 0.038 | 0.054 | 1.000 |
일자 | 노선명 | 정류장명 | ARS_ID | 시간 | 승하차 | 거래건수 | |
---|---|---|---|---|---|---|---|
89655 | 20240301 | 진월07 | 진월대주아파트 | 3242 | 14 | 하차 | 10 |
58225 | 20240301 | 수완12 | 대성초교 | 3138 | 18 | 승차 | 2 |
32035 | 20240301 | 봉선27 | 운암시장 | 4449 | 21 | 승차 | 7 |
62130 | 20240301 | 순환01(운천저수지) | 금호초교 | 2076 | 13 | 승차 | 1 |
33225 | 20240301 | 봉선37 | 남광주역 | 1142 | 22 | 하차 | 2 |
53172 | 20240301 | 송정98 | 광주여대 | 5274 | 20 | 하차 | 1 |
25934 | 20240301 | 문흥18 | 신가동 | 5303 | 15 | 승차 | 4 |
79104 | 20240301 | 좌석02 | 도산역 | 5008 | 9 | 승차 | 3 |
36878 | 20240301 | 상무64 | 비엔날레전시관 | 4597 | 19 | 승차 | 1 |
21464 | 20240301 | 매월16 | 현대자동차 | 2008 | 17 | 환승 | 6 |
일자 | 노선명 | 정류장명 | ARS_ID | 시간 | 승하차 | 거래건수 | |
---|---|---|---|---|---|---|---|
7995 | 20240301 | 금호36 | 각화무등파크 | 4173 | 18 | 하차 | 1 |
67060 | 20240301 | 운림50 | 방림삼거리 | 3084 | 20 | 하차 | 1 |
7825 | 20240301 | 금남59 | 화정남초교 | 2315 | 21 | 승차 | 1 |
36693 | 20240301 | 상무64 | 광주종합버스터미널 | 2002 | 16 | 환승 | 2 |
16258 | 20240301 | 마을760 | 돌고개역(동) | 2164 | 16 | 승차 | 1 |
85624 | 20240301 | 지원45 | 풍암저수지 | 2264 | 21 | 승차 | 1 |
4098 | 20240301 | 금남55 | 북구청 | 4553 | 16 | 승차 | 3 |
55134 | 20240301 | 송정98 | 월곡일신아파트 | 5314 | 15 | 환승 | 1 |
77559 | 20240301 | 일곡38 | 요한병원 | 4526 | 20 | 승차 | 1 |
89911 | 20240301 | 진월07 | <NA> | <NA> | 8 | 환승 | 1 |
Most frequently occurring
일자 | 정류장명 | ARS_ID | 시간 | 승하차 | 거래건수 | # duplicates | |
---|---|---|---|---|---|---|---|
171 | 20240301 | 문화전당역 | 1130 | 17 | 환승 | 1 | 4 |
3 | 20240301 | CBS방송국 | 2013 | 18 | 승차 | 2 | 3 |
13 | 20240301 | 경신여고 | 4434 | 16 | 하차 | 2 | 3 |
59 | 20240301 | 광주종합버스터미널 | 2002 | 7 | 하차 | 1 | 3 |
61 | 20240301 | 광주종합버스터미널 | 2002 | 9 | 환승 | 2 | 3 |
62 | 20240301 | 광주종합버스터미널 | 2002 | 12 | 승차 | 8 | 3 |
66 | 20240301 | 광주종합버스터미널 | 2002 | 16 | 환승 | 2 | 3 |
70 | 20240301 | 광주지방기상청 | 4447 | 22 | 하차 | 1 | 3 |
81 | 20240301 | 국립아시아문화전당(구.도청) | 1123 | 17 | 하차 | 1 | 3 |
96 | 20240301 | 남광주사거리 | 1139 | 17 | 환승 | 3 | 3 |