Amino acid dipepetide frequency for Podoviridae sp. ctda_1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.855AlaAla: 10.855 ± 1.408
0.802AlaCys: 0.802 ± 0.263
5.475AlaAsp: 5.475 ± 0.694
6.419AlaGlu: 6.419 ± 0.55
3.351AlaPhe: 3.351 ± 0.38
7.41AlaGly: 7.41 ± 0.91
1.652AlaHis: 1.652 ± 0.295
3.917AlaIle: 3.917 ± 0.56
5.522AlaLys: 5.522 ± 0.74
8.354AlaLeu: 8.354 ± 1.316
3.021AlaMet: 3.021 ± 0.587
4.153AlaAsn: 4.153 ± 0.59
4.295AlaPro: 4.295 ± 0.534
4.578AlaGln: 4.578 ± 0.873
4.72AlaArg: 4.72 ± 0.502
4.767AlaSer: 4.767 ± 0.453
5.475AlaThr: 5.475 ± 0.512
7.221AlaVal: 7.221 ± 0.714
0.944AlaTrp: 0.944 ± 0.217
2.973AlaTyr: 2.973 ± 0.251
0.0AlaXaa: 0.0 ± 0.0
Cys
0.897CysAla: 0.897 ± 0.249
0.378CysCys: 0.378 ± 0.23
0.425CysAsp: 0.425 ± 0.182
0.85CysGlu: 0.85 ± 0.276
0.472CysPhe: 0.472 ± 0.172
0.802CysGly: 0.802 ± 0.227
0.189CysHis: 0.189 ± 0.086
0.425CysIle: 0.425 ± 0.146
0.472CysLys: 0.472 ± 0.16
0.566CysLeu: 0.566 ± 0.224
0.094CysMet: 0.094 ± 0.077
0.189CysAsn: 0.189 ± 0.087
0.33CysPro: 0.33 ± 0.132
0.189CysGln: 0.189 ± 0.128
0.33CysArg: 0.33 ± 0.134
0.661CysSer: 0.661 ± 0.217
0.661CysThr: 0.661 ± 0.191
0.472CysVal: 0.472 ± 0.152
0.189CysTrp: 0.189 ± 0.123
0.236CysTyr: 0.236 ± 0.109
0.0CysXaa: 0.0 ± 0.0
Asp
6.23AspAla: 6.23 ± 0.801
0.755AspCys: 0.755 ± 0.244
3.776AspAsp: 3.776 ± 0.512
3.917AspGlu: 3.917 ± 0.475
2.265AspPhe: 2.265 ± 0.304
5.286AspGly: 5.286 ± 0.416
1.369AspHis: 1.369 ± 0.282
3.021AspIle: 3.021 ± 0.396
3.021AspLys: 3.021 ± 0.406
5.239AspLeu: 5.239 ± 0.797
1.51AspMet: 1.51 ± 0.223
2.643AspAsn: 2.643 ± 0.42
3.398AspPro: 3.398 ± 0.39
2.879AspGln: 2.879 ± 0.393
3.587AspArg: 3.587 ± 0.413
3.351AspSer: 3.351 ± 0.452
3.304AspThr: 3.304 ± 0.348
4.531AspVal: 4.531 ± 0.485
0.802AspTrp: 0.802 ± 0.194
1.888AspTyr: 1.888 ± 0.369
0.0AspXaa: 0.0 ± 0.0
Glu
7.457GluAla: 7.457 ± 0.913
0.425GluCys: 0.425 ± 0.142
3.021GluAsp: 3.021 ± 0.421
4.531GluGlu: 4.531 ± 0.518
3.021GluPhe: 3.021 ± 0.457
4.2GluGly: 4.2 ± 0.367
1.652GluHis: 1.652 ± 0.361
3.776GluIle: 3.776 ± 0.42
3.115GluLys: 3.115 ± 0.386
5.239GluLeu: 5.239 ± 0.424
1.888GluMet: 1.888 ± 0.295
1.557GluAsn: 1.557 ± 0.247
2.265GluPro: 2.265 ± 0.299
3.021GluGln: 3.021 ± 0.411
3.257GluArg: 3.257 ± 0.353
3.021GluSer: 3.021 ± 0.415
3.115GluThr: 3.115 ± 0.297
5.333GluVal: 5.333 ± 0.49
0.802GluTrp: 0.802 ± 0.195
1.699GluTyr: 1.699 ± 0.385
0.0GluXaa: 0.0 ± 0.0
Phe
2.785PheAla: 2.785 ± 0.357
0.472PheCys: 0.472 ± 0.214
2.454PheAsp: 2.454 ± 0.332
1.746PheGlu: 1.746 ± 0.272
0.944PhePhe: 0.944 ± 0.243
3.209PheGly: 3.209 ± 0.338
0.425PheHis: 0.425 ± 0.163
1.982PheIle: 1.982 ± 0.328
1.982PheLys: 1.982 ± 0.216
2.454PheLeu: 2.454 ± 0.353
1.557PheMet: 1.557 ± 0.271
2.077PheAsn: 2.077 ± 0.333
0.991PhePro: 0.991 ± 0.213
1.699PheGln: 1.699 ± 0.26
2.454PheArg: 2.454 ± 0.316
1.793PheSer: 1.793 ± 0.289
2.643PheThr: 2.643 ± 0.455
2.596PheVal: 2.596 ± 0.418
0.236PheTrp: 0.236 ± 0.107
0.897PheTyr: 0.897 ± 0.22
0.0PheXaa: 0.0 ± 0.0
Gly
6.23GlyAla: 6.23 ± 0.865
0.708GlyCys: 0.708 ± 0.257
3.681GlyAsp: 3.681 ± 0.448
4.814GlyGlu: 4.814 ± 0.451
3.021GlyPhe: 3.021 ± 0.445
4.625GlyGly: 4.625 ± 0.683
1.746GlyHis: 1.746 ± 0.341
4.106GlyIle: 4.106 ± 0.408
3.917GlyLys: 3.917 ± 0.695
5.664GlyLeu: 5.664 ± 0.512
2.737GlyMet: 2.737 ± 0.318
4.012GlyAsn: 4.012 ± 0.406
1.888GlyPro: 1.888 ± 0.381
3.398GlyGln: 3.398 ± 0.376
4.861GlyArg: 4.861 ± 0.643
5.522GlySer: 5.522 ± 0.758
4.672GlyThr: 4.672 ± 0.474
5.758GlyVal: 5.758 ± 0.536
1.274GlyTrp: 1.274 ± 0.309
2.218GlyTyr: 2.218 ± 0.346
0.0GlyXaa: 0.0 ± 0.0
His
1.322HisAla: 1.322 ± 0.247
0.142HisCys: 0.142 ± 0.085
1.557HisAsp: 1.557 ± 0.28
1.274HisGlu: 1.274 ± 0.291
0.897HisPhe: 0.897 ± 0.219
2.313HisGly: 2.313 ± 0.475
0.614HisHis: 0.614 ± 0.226
0.944HisIle: 0.944 ± 0.205
1.18HisLys: 1.18 ± 0.257
1.982HisLeu: 1.982 ± 0.347
0.566HisMet: 0.566 ± 0.129
0.708HisAsn: 0.708 ± 0.159
1.038HisPro: 1.038 ± 0.3
0.944HisGln: 0.944 ± 0.26
1.746HisArg: 1.746 ± 0.335
0.944HisSer: 0.944 ± 0.238
0.897HisThr: 0.897 ± 0.174
1.51HisVal: 1.51 ± 0.343
0.283HisTrp: 0.283 ± 0.118
0.802HisTyr: 0.802 ± 0.205
0.0HisXaa: 0.0 ± 0.0
Ile
4.295IleAla: 4.295 ± 0.621
0.519IleCys: 0.519 ± 0.162
3.776IleAsp: 3.776 ± 0.303
3.965IleGlu: 3.965 ± 0.378
1.888IlePhe: 1.888 ± 0.345
4.012IleGly: 4.012 ± 0.464
0.897IleHis: 0.897 ± 0.22
2.737IleIle: 2.737 ± 0.49
2.596IleLys: 2.596 ± 0.473
3.021IleLeu: 3.021 ± 0.469
1.086IleMet: 1.086 ± 0.195
2.36IleAsn: 2.36 ± 0.31
2.454IlePro: 2.454 ± 0.351
1.982IleGln: 1.982 ± 0.295
3.021IleArg: 3.021 ± 0.365
3.445IleSer: 3.445 ± 0.326
2.832IleThr: 2.832 ± 0.375
3.776IleVal: 3.776 ± 0.398
0.472IleTrp: 0.472 ± 0.226
1.652IleTyr: 1.652 ± 0.34
0.0IleXaa: 0.0 ± 0.0
Lys
5.664LysAla: 5.664 ± 0.599
0.094LysCys: 0.094 ± 0.064
3.068LysAsp: 3.068 ± 0.382
3.398LysGlu: 3.398 ± 0.394
1.652LysPhe: 1.652 ± 0.303
2.69LysGly: 2.69 ± 0.396
0.991LysHis: 0.991 ± 0.259
2.785LysIle: 2.785 ± 0.405
4.153LysLys: 4.153 ± 0.598
4.767LysLeu: 4.767 ± 0.582
1.793LysMet: 1.793 ± 0.291
2.407LysAsn: 2.407 ± 0.356
2.596LysPro: 2.596 ± 0.501
2.407LysGln: 2.407 ± 0.319
2.407LysArg: 2.407 ± 0.411
3.257LysSer: 3.257 ± 0.318
3.257LysThr: 3.257 ± 0.494
4.295LysVal: 4.295 ± 0.508
1.086LysTrp: 1.086 ± 0.325
1.699LysTyr: 1.699 ± 0.222
0.0LysXaa: 0.0 ± 0.0
Leu
8.118LeuAla: 8.118 ± 1.049
0.519LeuCys: 0.519 ± 0.169
5.852LeuAsp: 5.852 ± 0.518
4.295LeuGlu: 4.295 ± 0.458
2.643LeuPhe: 2.643 ± 0.379
6.277LeuGly: 6.277 ± 0.619
1.888LeuHis: 1.888 ± 0.231
3.729LeuIle: 3.729 ± 0.437
4.389LeuLys: 4.389 ± 0.478
6.136LeuLeu: 6.136 ± 0.73
3.021LeuMet: 3.021 ± 0.344
3.729LeuAsn: 3.729 ± 0.414
4.295LeuPro: 4.295 ± 0.575
3.304LeuGln: 3.304 ± 0.431
4.436LeuArg: 4.436 ± 0.464
5.239LeuSer: 5.239 ± 0.595
5.9LeuThr: 5.9 ± 0.514
5.192LeuVal: 5.192 ± 0.459
0.614LeuTrp: 0.614 ± 0.155
1.935LeuTyr: 1.935 ± 0.274
0.0LeuXaa: 0.0 ± 0.0
Met
2.879MetAla: 2.879 ± 0.388
0.142MetCys: 0.142 ± 0.086
1.699MetAsp: 1.699 ± 0.243
2.265MetGlu: 2.265 ± 0.298
0.991MetPhe: 0.991 ± 0.182
1.746MetGly: 1.746 ± 0.31
0.519MetHis: 0.519 ± 0.143
1.369MetIle: 1.369 ± 0.244
1.699MetLys: 1.699 ± 0.261
2.596MetLeu: 2.596 ± 0.402
0.661MetMet: 0.661 ± 0.196
1.463MetAsn: 1.463 ± 0.232
1.369MetPro: 1.369 ± 0.211
1.463MetGln: 1.463 ± 0.289
1.888MetArg: 1.888 ± 0.306
2.029MetSer: 2.029 ± 0.334
2.124MetThr: 2.124 ± 0.401
1.935MetVal: 1.935 ± 0.438
0.189MetTrp: 0.189 ± 0.089
0.472MetTyr: 0.472 ± 0.144
0.0MetXaa: 0.0 ± 0.0
Asn
4.72AsnAla: 4.72 ± 0.668
0.33AsnCys: 0.33 ± 0.134
2.832AsnAsp: 2.832 ± 0.583
2.69AsnGlu: 2.69 ± 0.318
1.51AsnPhe: 1.51 ± 0.209
3.115AsnGly: 3.115 ± 0.327
1.133AsnHis: 1.133 ± 0.209
1.605AsnIle: 1.605 ± 0.264
2.313AsnLys: 2.313 ± 0.378
3.917AsnLeu: 3.917 ± 0.546
1.416AsnMet: 1.416 ± 0.24
1.793AsnAsn: 1.793 ± 0.255
2.832AsnPro: 2.832 ± 0.384
2.501AsnGln: 2.501 ± 0.385
2.596AsnArg: 2.596 ± 0.286
2.454AsnSer: 2.454 ± 0.291
2.785AsnThr: 2.785 ± 0.346
3.115AsnVal: 3.115 ± 0.382
0.708AsnTrp: 0.708 ± 0.205
1.322AsnTyr: 1.322 ± 0.233
0.0AsnXaa: 0.0 ± 0.0
Pro
4.106ProAla: 4.106 ± 0.407
0.283ProCys: 0.283 ± 0.114
2.407ProAsp: 2.407 ± 0.292
3.257ProGlu: 3.257 ± 0.356
1.699ProPhe: 1.699 ± 0.304
3.068ProGly: 3.068 ± 0.477
0.802ProHis: 0.802 ± 0.208
2.029ProIle: 2.029 ± 0.372
2.218ProLys: 2.218 ± 0.315
3.587ProLeu: 3.587 ± 0.391
1.557ProMet: 1.557 ± 0.324
1.699ProAsn: 1.699 ± 0.299
1.416ProPro: 1.416 ± 0.345
2.313ProGln: 2.313 ± 0.429
1.463ProArg: 1.463 ± 0.281
2.926ProSer: 2.926 ± 0.47
3.445ProThr: 3.445 ± 0.386
3.776ProVal: 3.776 ± 0.558
0.472ProTrp: 0.472 ± 0.148
1.18ProTyr: 1.18 ± 0.329
0.0ProXaa: 0.0 ± 0.0
Gln
4.389GlnAla: 4.389 ± 0.706
0.189GlnCys: 0.189 ± 0.101
2.313GlnAsp: 2.313 ± 0.258
1.935GlnGlu: 1.935 ± 0.286
1.652GlnPhe: 1.652 ± 0.262
3.115GlnGly: 3.115 ± 0.41
1.18GlnHis: 1.18 ± 0.237
2.596GlnIle: 2.596 ± 0.26
1.841GlnLys: 1.841 ± 0.31
4.861GlnLeu: 4.861 ± 0.44
1.322GlnMet: 1.322 ± 0.236
2.265GlnAsn: 2.265 ± 0.505
1.746GlnPro: 1.746 ± 0.309
2.69GlnGln: 2.69 ± 0.48
2.501GlnArg: 2.501 ± 0.453
2.407GlnSer: 2.407 ± 0.381
2.785GlnThr: 2.785 ± 0.499
2.785GlnVal: 2.785 ± 0.372
0.85GlnTrp: 0.85 ± 0.225
1.557GlnTyr: 1.557 ± 0.226
0.0GlnXaa: 0.0 ± 0.0
Arg
4.861ArgAla: 4.861 ± 0.482
0.85ArgCys: 0.85 ± 0.245
2.926ArgAsp: 2.926 ± 0.463
3.115ArgGlu: 3.115 ± 0.355
1.557ArgPhe: 1.557 ± 0.228
3.917ArgGly: 3.917 ± 0.506
1.322ArgHis: 1.322 ± 0.301
2.879ArgIle: 2.879 ± 0.373
2.879ArgLys: 2.879 ± 0.369
5.239ArgLeu: 5.239 ± 0.553
1.086ArgMet: 1.086 ± 0.224
2.785ArgAsn: 2.785 ± 0.366
2.313ArgPro: 2.313 ± 0.395
2.407ArgGln: 2.407 ± 0.296
2.549ArgArg: 2.549 ± 0.35
4.153ArgSer: 4.153 ± 0.632
3.115ArgThr: 3.115 ± 0.389
2.879ArgVal: 2.879 ± 0.341
0.85ArgTrp: 0.85 ± 0.206
2.124ArgTyr: 2.124 ± 0.33
0.0ArgXaa: 0.0 ± 0.0
Ser
5.286SerAla: 5.286 ± 0.704
0.802SerCys: 0.802 ± 0.245
4.012SerAsp: 4.012 ± 0.478
3.162SerGlu: 3.162 ± 0.269
1.793SerPhe: 1.793 ± 0.217
5.05SerGly: 5.05 ± 0.493
1.038SerHis: 1.038 ± 0.196
3.351SerIle: 3.351 ± 0.426
3.115SerLys: 3.115 ± 0.382
4.436SerLeu: 4.436 ± 0.819
1.888SerMet: 1.888 ± 0.392
3.021SerAsn: 3.021 ± 0.403
2.407SerPro: 2.407 ± 0.402
1.652SerGln: 1.652 ± 0.293
3.398SerArg: 3.398 ± 0.307
4.2SerSer: 4.2 ± 0.645
3.729SerThr: 3.729 ± 0.576
4.814SerVal: 4.814 ± 0.686
0.661SerTrp: 0.661 ± 0.147
2.171SerTyr: 2.171 ± 0.293
0.0SerXaa: 0.0 ± 0.0
Thr
5.286ThrAla: 5.286 ± 0.645
0.661ThrCys: 0.661 ± 0.196
4.861ThrAsp: 4.861 ± 0.449
3.634ThrGlu: 3.634 ± 0.357
2.501ThrPhe: 2.501 ± 0.342
4.908ThrGly: 4.908 ± 0.589
1.605ThrHis: 1.605 ± 0.269
4.012ThrIle: 4.012 ± 0.427
3.54ThrLys: 3.54 ± 0.458
4.342ThrLeu: 4.342 ± 0.397
1.227ThrMet: 1.227 ± 0.244
2.926ThrAsn: 2.926 ± 0.446
2.69ThrPro: 2.69 ± 0.239
2.501ThrGln: 2.501 ± 0.34
2.454ThrArg: 2.454 ± 0.365
3.209ThrSer: 3.209 ± 0.381
3.493ThrThr: 3.493 ± 0.431
4.248ThrVal: 4.248 ± 0.465
0.85ThrTrp: 0.85 ± 0.195
1.463ThrTyr: 1.463 ± 0.252
0.0ThrXaa: 0.0 ± 0.0
Val
6.608ValAla: 6.608 ± 0.746
0.519ValCys: 0.519 ± 0.155
5.616ValAsp: 5.616 ± 0.632
5.003ValGlu: 5.003 ± 0.54
2.265ValPhe: 2.265 ± 0.353
5.9ValGly: 5.9 ± 0.74
1.793ValHis: 1.793 ± 0.306
3.634ValIle: 3.634 ± 0.385
4.2ValLys: 4.2 ± 0.468
5.616ValLeu: 5.616 ± 0.76
1.699ValMet: 1.699 ± 0.246
3.445ValAsn: 3.445 ± 0.484
3.351ValPro: 3.351 ± 0.451
2.926ValGln: 2.926 ± 0.336
4.153ValArg: 4.153 ± 0.504
4.342ValSer: 4.342 ± 0.435
3.87ValThr: 3.87 ± 0.495
5.05ValVal: 5.05 ± 0.906
0.802ValTrp: 0.802 ± 0.185
1.463ValTyr: 1.463 ± 0.28
0.0ValXaa: 0.0 ± 0.0
Trp
1.227TrpAla: 1.227 ± 0.246
0.094TrpCys: 0.094 ± 0.059
0.85TrpAsp: 0.85 ± 0.229
0.755TrpGlu: 0.755 ± 0.224
0.472TrpPhe: 0.472 ± 0.133
1.038TrpGly: 1.038 ± 0.229
0.283TrpHis: 0.283 ± 0.125
0.566TrpIle: 0.566 ± 0.181
0.566TrpLys: 0.566 ± 0.143
1.274TrpLeu: 1.274 ± 0.273
0.472TrpMet: 0.472 ± 0.149
1.086TrpAsn: 1.086 ± 0.222
0.33TrpPro: 0.33 ± 0.097
0.661TrpGln: 0.661 ± 0.224
0.755TrpArg: 0.755 ± 0.23
0.614TrpSer: 0.614 ± 0.136
0.519TrpThr: 0.519 ± 0.168
0.802TrpVal: 0.802 ± 0.173
0.047TrpTrp: 0.047 ± 0.055
0.236TrpTyr: 0.236 ± 0.102
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.69TyrAla: 2.69 ± 0.305
0.142TyrCys: 0.142 ± 0.074
2.171TyrAsp: 2.171 ± 0.309
1.369TyrGlu: 1.369 ± 0.247
0.802TyrPhe: 0.802 ± 0.199
2.171TyrGly: 2.171 ± 0.366
0.566TyrHis: 0.566 ± 0.186
1.322TyrIle: 1.322 ± 0.263
1.746TyrLys: 1.746 ± 0.299
2.218TyrLeu: 2.218 ± 0.41
0.85TyrMet: 0.85 ± 0.176
1.369TyrAsn: 1.369 ± 0.204
1.699TyrPro: 1.699 ± 0.391
1.463TyrGln: 1.463 ± 0.251
1.274TyrArg: 1.274 ± 0.257
1.652TyrSer: 1.652 ± 0.328
1.793TyrThr: 1.793 ± 0.323
2.171TyrVal: 2.171 ± 0.254
0.519TyrTrp: 0.519 ± 0.145
0.472TyrTyr: 0.472 ± 0.128
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (21189 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski