Overview

Dataset statistics

Number of variables1
Number of observations3648
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory28.6 KiB
Average record size in memory8.0 B

Variable types

Categorical1

Dataset

Description2011년 대구 지역 버스전용차로 지점교통량
Author대구광역시
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=3074818&dataSetDetailId=30748181ebdf1dd41763&provdMethod=FILE

Alerts

Dataset has 1 (< 0.1%) duplicate rowsDuplicates
2a가3나 is highly imbalanced (99.6%)Imbalance

Reproduction

Analysis started2024-04-21 19:50:17.993611
Analysis finished2024-04-21 19:50:18.394892
Duration0.4 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

2a가3나
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size28.6 KiB
52
3647 
지점교통량DB_ID
 
1

Length

Max length10
Median length2
Mean length2.002193
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row지점교통량DB_ID
2nd row52
3rd row52
4th row52
5th row52

Common Values

ValueCountFrequency (%)
52 3647
> 99.9%
지점교통량DB_ID 1
 
< 0.1%

Length

2024-04-22T04:50:18.531433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T04:50:18.723309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
52 3647
> 99.9%
지점교통량db_id 1
 
< 0.1%

Missing values

2024-04-22T04:50:18.353572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

2a가3나
0지점교통량DB_ID
152
252
352
452
552
652
752
852
952
2a가3나
363852
363952
364052
364152
364252
364352
364452
364552
364652
364752

Duplicate rows

Most frequently occurring

2a가3나# duplicates
0523647