Amino acid dipepetide frequency for Gordonia phage Ziko

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.774AlaAla: 8.774 ± 1.257
0.711AlaCys: 0.711 ± 0.205
4.932AlaAsp: 4.932 ± 0.464
5.312AlaGlu: 5.312 ± 0.49
2.656AlaPhe: 2.656 ± 0.324
6.877AlaGly: 6.877 ± 1.028
1.233AlaHis: 1.233 ± 0.297
5.312AlaIle: 5.312 ± 0.555
5.122AlaLys: 5.122 ± 0.582
7.778AlaLeu: 7.778 ± 0.619
2.751AlaMet: 2.751 ± 0.309
4.126AlaAsn: 4.126 ± 0.461
2.371AlaPro: 2.371 ± 0.384
3.842AlaGln: 3.842 ± 0.372
5.075AlaArg: 5.075 ± 0.456
5.217AlaSer: 5.217 ± 0.576
6.071AlaThr: 6.071 ± 0.559
5.075AlaVal: 5.075 ± 0.538
1.186AlaTrp: 1.186 ± 0.245
2.893AlaTyr: 2.893 ± 0.419
0.0AlaXaa: 0.0 ± 0.0
Cys
1.328CysAla: 1.328 ± 0.284
0.332CysCys: 0.332 ± 0.138
1.328CysAsp: 1.328 ± 0.274
0.664CysGlu: 0.664 ± 0.211
0.617CysPhe: 0.617 ± 0.157
1.755CysGly: 1.755 ± 0.335
0.379CysHis: 0.379 ± 0.139
0.522CysIle: 0.522 ± 0.181
0.711CysLys: 0.711 ± 0.194
1.043CysLeu: 1.043 ± 0.243
0.142CysMet: 0.142 ± 0.085
0.522CysAsn: 0.522 ± 0.177
0.996CysPro: 0.996 ± 0.248
0.569CysGln: 0.569 ± 0.192
0.711CysArg: 0.711 ± 0.225
1.281CysSer: 1.281 ± 0.309
0.759CysThr: 0.759 ± 0.217
0.854CysVal: 0.854 ± 0.198
0.19CysTrp: 0.19 ± 0.103
0.427CysTyr: 0.427 ± 0.132
0.0CysXaa: 0.0 ± 0.0
Asp
5.407AspAla: 5.407 ± 0.684
0.901AspCys: 0.901 ± 0.204
5.502AspAsp: 5.502 ± 0.527
4.743AspGlu: 4.743 ± 0.534
2.371AspPhe: 2.371 ± 0.344
6.213AspGly: 6.213 ± 0.498
1.707AspHis: 1.707 ± 0.295
2.94AspIle: 2.94 ± 0.359
3.462AspLys: 3.462 ± 0.329
5.122AspLeu: 5.122 ± 0.603
1.992AspMet: 1.992 ± 0.259
2.656AspAsn: 2.656 ± 0.377
3.652AspPro: 3.652 ± 0.441
2.276AspGln: 2.276 ± 0.309
2.988AspArg: 2.988 ± 0.38
3.747AspSer: 3.747 ± 0.396
3.083AspThr: 3.083 ± 0.357
4.458AspVal: 4.458 ± 0.502
1.66AspTrp: 1.66 ± 0.322
2.324AspTyr: 2.324 ± 0.369
0.0AspXaa: 0.0 ± 0.0
Glu
6.545GluAla: 6.545 ± 0.481
1.423GluCys: 1.423 ± 0.297
5.596GluAsp: 5.596 ± 0.612
5.17GluGlu: 5.17 ± 0.609
3.035GluPhe: 3.035 ± 0.373
4.126GluGly: 4.126 ± 0.37
1.281GluHis: 1.281 ± 0.214
3.272GluIle: 3.272 ± 0.386
3.415GluLys: 3.415 ± 0.443
7.351GluLeu: 7.351 ± 0.544
2.703GluMet: 2.703 ± 0.356
3.699GluAsn: 3.699 ± 0.402
2.039GluPro: 2.039 ± 0.302
2.846GluGln: 2.846 ± 0.372
3.936GluArg: 3.936 ± 0.418
3.32GluSer: 3.32 ± 0.364
3.51GluThr: 3.51 ± 0.459
4.6GluVal: 4.6 ± 0.521
1.802GluTrp: 1.802 ± 0.327
2.514GluTyr: 2.514 ± 0.44
0.0GluXaa: 0.0 ± 0.0
Phe
2.419PheAla: 2.419 ± 0.322
0.759PheCys: 0.759 ± 0.18
2.846PheAsp: 2.846 ± 0.319
2.466PheGlu: 2.466 ± 0.325
0.901PhePhe: 0.901 ± 0.205
2.798PheGly: 2.798 ± 0.362
0.664PheHis: 0.664 ± 0.197
2.371PheIle: 2.371 ± 0.339
1.802PheLys: 1.802 ± 0.291
2.561PheLeu: 2.561 ± 0.452
1.138PheMet: 1.138 ± 0.267
1.375PheAsn: 1.375 ± 0.251
1.423PhePro: 1.423 ± 0.249
0.949PheGln: 0.949 ± 0.204
1.375PheArg: 1.375 ± 0.256
2.182PheSer: 2.182 ± 0.321
1.66PheThr: 1.66 ± 0.279
1.945PheVal: 1.945 ± 0.259
0.522PheTrp: 0.522 ± 0.182
0.949PheTyr: 0.949 ± 0.228
0.0PheXaa: 0.0 ± 0.0
Gly
5.976GlyAla: 5.976 ± 0.826
1.043GlyCys: 1.043 ± 0.214
3.984GlyAsp: 3.984 ± 0.504
5.217GlyGlu: 5.217 ± 0.43
2.846GlyPhe: 2.846 ± 0.332
7.114GlyGly: 7.114 ± 1.398
1.802GlyHis: 1.802 ± 0.345
4.695GlyIle: 4.695 ± 0.546
4.126GlyLys: 4.126 ± 0.47
5.691GlyLeu: 5.691 ± 0.573
2.466GlyMet: 2.466 ± 0.32
2.846GlyAsn: 2.846 ± 0.417
2.703GlyPro: 2.703 ± 0.321
3.225GlyGln: 3.225 ± 0.42
4.079GlyArg: 4.079 ± 0.44
5.217GlySer: 5.217 ± 0.63
6.26GlyThr: 6.26 ± 0.725
4.648GlyVal: 4.648 ± 0.595
1.802GlyTrp: 1.802 ± 0.283
3.794GlyTyr: 3.794 ± 0.406
0.0GlyXaa: 0.0 ± 0.0
His
0.854HisAla: 0.854 ± 0.273
0.522HisCys: 0.522 ± 0.188
1.043HisAsp: 1.043 ± 0.257
1.85HisGlu: 1.85 ± 0.261
0.617HisPhe: 0.617 ± 0.166
1.518HisGly: 1.518 ± 0.296
0.522HisHis: 0.522 ± 0.169
0.901HisIle: 0.901 ± 0.17
0.854HisLys: 0.854 ± 0.199
1.565HisLeu: 1.565 ± 0.323
0.664HisMet: 0.664 ± 0.195
0.664HisAsn: 0.664 ± 0.171
1.565HisPro: 1.565 ± 0.281
0.806HisGln: 0.806 ± 0.253
1.186HisArg: 1.186 ± 0.288
1.233HisSer: 1.233 ± 0.253
1.281HisThr: 1.281 ± 0.218
1.233HisVal: 1.233 ± 0.23
0.379HisTrp: 0.379 ± 0.108
0.522HisTyr: 0.522 ± 0.18
0.0HisXaa: 0.0 ± 0.0
Ile
4.174IleAla: 4.174 ± 0.343
0.854IleCys: 0.854 ± 0.176
4.174IleAsp: 4.174 ± 0.463
4.316IleGlu: 4.316 ± 0.434
0.996IlePhe: 0.996 ± 0.207
3.842IleGly: 3.842 ± 0.561
1.091IleHis: 1.091 ± 0.229
1.85IleIle: 1.85 ± 0.283
2.324IleLys: 2.324 ± 0.334
3.747IleLeu: 3.747 ± 0.55
0.806IleMet: 0.806 ± 0.208
1.707IleAsn: 1.707 ± 0.246
3.083IlePro: 3.083 ± 0.316
2.466IleGln: 2.466 ± 0.329
2.846IleArg: 2.846 ± 0.333
3.462IleSer: 3.462 ± 0.437
2.988IleThr: 2.988 ± 0.379
3.794IleVal: 3.794 ± 0.452
1.281IleTrp: 1.281 ± 0.224
1.565IleTyr: 1.565 ± 0.279
0.0IleXaa: 0.0 ± 0.0
Lys
4.411LysAla: 4.411 ± 0.539
0.854LysCys: 0.854 ± 0.208
3.557LysAsp: 3.557 ± 0.478
4.174LysGlu: 4.174 ± 0.406
1.328LysPhe: 1.328 ± 0.236
3.747LysGly: 3.747 ± 0.444
0.854LysHis: 0.854 ± 0.195
2.039LysIle: 2.039 ± 0.323
2.087LysLys: 2.087 ± 0.367
3.415LysLeu: 3.415 ± 0.404
1.755LysMet: 1.755 ± 0.34
1.755LysAsn: 1.755 ± 0.268
2.324LysPro: 2.324 ± 0.402
1.518LysGln: 1.518 ± 0.268
3.794LysArg: 3.794 ± 0.444
2.703LysSer: 2.703 ± 0.494
2.466LysThr: 2.466 ± 0.33
2.846LysVal: 2.846 ± 0.428
0.901LysTrp: 0.901 ± 0.19
1.755LysTyr: 1.755 ± 0.206
0.0LysXaa: 0.0 ± 0.0
Leu
8.347LeuAla: 8.347 ± 0.762
1.091LeuCys: 1.091 ± 0.264
5.075LeuAsp: 5.075 ± 0.365
6.45LeuGlu: 6.45 ± 0.554
2.466LeuPhe: 2.466 ± 0.403
5.17LeuGly: 5.17 ± 0.598
1.375LeuHis: 1.375 ± 0.3
3.842LeuIle: 3.842 ± 0.414
3.557LeuLys: 3.557 ± 0.454
5.834LeuLeu: 5.834 ± 0.659
1.565LeuMet: 1.565 ± 0.303
3.557LeuAsn: 3.557 ± 0.448
3.936LeuPro: 3.936 ± 0.38
2.94LeuGln: 2.94 ± 0.37
4.553LeuArg: 4.553 ± 0.418
5.454LeuSer: 5.454 ± 0.414
4.031LeuThr: 4.031 ± 0.4
4.648LeuVal: 4.648 ± 0.68
1.85LeuTrp: 1.85 ± 0.337
2.371LeuTyr: 2.371 ± 0.387
0.0LeuXaa: 0.0 ± 0.0
Met
2.703MetAla: 2.703 ± 0.371
0.237MetCys: 0.237 ± 0.102
2.324MetAsp: 2.324 ± 0.337
1.945MetGlu: 1.945 ± 0.353
0.854MetPhe: 0.854 ± 0.2
1.755MetGly: 1.755 ± 0.347
0.285MetHis: 0.285 ± 0.108
1.66MetIle: 1.66 ± 0.302
1.233MetLys: 1.233 ± 0.221
1.85MetLeu: 1.85 ± 0.298
0.237MetMet: 0.237 ± 0.097
1.66MetAsn: 1.66 ± 0.339
1.66MetPro: 1.66 ± 0.326
0.901MetGln: 0.901 ± 0.192
1.47MetArg: 1.47 ± 0.273
1.802MetSer: 1.802 ± 0.271
2.466MetThr: 2.466 ± 0.3
1.755MetVal: 1.755 ± 0.344
0.332MetTrp: 0.332 ± 0.138
0.996MetTyr: 0.996 ± 0.249
0.0MetXaa: 0.0 ± 0.0
Asn
3.035AsnAla: 3.035 ± 0.439
0.759AsnCys: 0.759 ± 0.24
1.945AsnAsp: 1.945 ± 0.219
3.699AsnGlu: 3.699 ± 0.494
1.186AsnPhe: 1.186 ± 0.268
4.411AsnGly: 4.411 ± 0.501
1.47AsnHis: 1.47 ± 0.294
1.755AsnIle: 1.755 ± 0.238
1.47AsnLys: 1.47 ± 0.247
2.466AsnLeu: 2.466 ± 0.322
0.996AsnMet: 0.996 ± 0.192
1.518AsnAsn: 1.518 ± 0.306
2.466AsnPro: 2.466 ± 0.316
1.186AsnGln: 1.186 ± 0.221
2.419AsnArg: 2.419 ± 0.326
3.178AsnSer: 3.178 ± 0.341
1.992AsnThr: 1.992 ± 0.32
2.846AsnVal: 2.846 ± 0.391
0.664AsnTrp: 0.664 ± 0.186
1.47AsnTyr: 1.47 ± 0.232
0.0AsnXaa: 0.0 ± 0.0
Pro
3.035ProAla: 3.035 ± 0.392
0.901ProCys: 0.901 ± 0.222
3.652ProAsp: 3.652 ± 0.396
3.557ProGlu: 3.557 ± 0.381
1.233ProPhe: 1.233 ± 0.282
3.178ProGly: 3.178 ± 0.459
0.806ProHis: 0.806 ± 0.213
2.846ProIle: 2.846 ± 0.356
1.755ProLys: 1.755 ± 0.315
3.083ProLeu: 3.083 ± 0.381
1.755ProMet: 1.755 ± 0.318
1.992ProAsn: 1.992 ± 0.305
1.328ProPro: 1.328 ± 0.262
1.518ProGln: 1.518 ± 0.239
1.707ProArg: 1.707 ± 0.291
3.035ProSer: 3.035 ± 0.367
2.561ProThr: 2.561 ± 0.37
3.557ProVal: 3.557 ± 0.455
0.901ProTrp: 0.901 ± 0.166
1.423ProTyr: 1.423 ± 0.206
0.0ProXaa: 0.0 ± 0.0
Gln
3.272GlnAla: 3.272 ± 0.419
0.427GlnCys: 0.427 ± 0.133
1.945GlnAsp: 1.945 ± 0.336
3.13GlnGlu: 3.13 ± 0.389
1.328GlnPhe: 1.328 ± 0.238
2.94GlnGly: 2.94 ± 0.345
0.664GlnHis: 0.664 ± 0.177
1.945GlnIle: 1.945 ± 0.313
1.897GlnLys: 1.897 ± 0.32
2.608GlnLeu: 2.608 ± 0.338
1.043GlnMet: 1.043 ± 0.209
1.091GlnAsn: 1.091 ± 0.237
1.755GlnPro: 1.755 ± 0.287
1.565GlnGln: 1.565 ± 0.37
2.466GlnArg: 2.466 ± 0.343
2.371GlnSer: 2.371 ± 0.329
1.375GlnThr: 1.375 ± 0.233
1.897GlnVal: 1.897 ± 0.307
1.091GlnTrp: 1.091 ± 0.213
1.233GlnTyr: 1.233 ± 0.252
0.0GlnXaa: 0.0 ± 0.0
Arg
5.17ArgAla: 5.17 ± 0.516
1.043ArgCys: 1.043 ± 0.286
2.798ArgAsp: 2.798 ± 0.421
4.458ArgGlu: 4.458 ± 0.389
1.613ArgPhe: 1.613 ± 0.248
3.936ArgGly: 3.936 ± 0.411
1.091ArgHis: 1.091 ± 0.226
2.846ArgIle: 2.846 ± 0.385
3.794ArgLys: 3.794 ± 0.563
4.458ArgLeu: 4.458 ± 0.446
1.707ArgMet: 1.707 ± 0.341
2.419ArgAsn: 2.419 ± 0.34
1.613ArgPro: 1.613 ± 0.222
2.087ArgGln: 2.087 ± 0.297
3.794ArgArg: 3.794 ± 0.522
3.699ArgSer: 3.699 ± 0.386
2.371ArgThr: 2.371 ± 0.349
3.272ArgVal: 3.272 ± 0.333
1.186ArgTrp: 1.186 ± 0.285
1.755ArgTyr: 1.755 ± 0.308
0.0ArgXaa: 0.0 ± 0.0
Ser
6.403SerAla: 6.403 ± 0.604
0.711SerCys: 0.711 ± 0.192
4.411SerAsp: 4.411 ± 0.508
3.699SerGlu: 3.699 ± 0.503
2.182SerPhe: 2.182 ± 0.264
6.118SerGly: 6.118 ± 0.902
1.281SerHis: 1.281 ± 0.33
3.13SerIle: 3.13 ± 0.364
2.514SerLys: 2.514 ± 0.351
5.075SerLeu: 5.075 ± 0.477
1.755SerMet: 1.755 ± 0.276
2.751SerAsn: 2.751 ± 0.315
2.561SerPro: 2.561 ± 0.401
1.66SerGln: 1.66 ± 0.283
2.94SerArg: 2.94 ± 0.356
4.268SerSer: 4.268 ± 0.526
3.984SerThr: 3.984 ± 0.538
4.79SerVal: 4.79 ± 0.544
1.186SerTrp: 1.186 ± 0.281
2.371SerTyr: 2.371 ± 0.366
0.0SerXaa: 0.0 ± 0.0
Thr
4.79ThrAla: 4.79 ± 0.828
0.711ThrCys: 0.711 ± 0.179
3.13ThrAsp: 3.13 ± 0.421
3.225ThrGlu: 3.225 ± 0.411
2.893ThrPhe: 2.893 ± 0.313
4.363ThrGly: 4.363 ± 0.559
0.711ThrHis: 0.711 ± 0.207
3.842ThrIle: 3.842 ± 0.454
3.035ThrLys: 3.035 ± 0.454
4.268ThrLeu: 4.268 ± 0.453
1.186ThrMet: 1.186 ± 0.269
2.371ThrAsn: 2.371 ± 0.327
3.083ThrPro: 3.083 ± 0.36
1.707ThrGln: 1.707 ± 0.294
3.035ThrArg: 3.035 ± 0.343
3.035ThrSer: 3.035 ± 0.453
3.32ThrThr: 3.32 ± 0.461
5.407ThrVal: 5.407 ± 0.61
1.47ThrTrp: 1.47 ± 0.295
2.324ThrTyr: 2.324 ± 0.36
0.0ThrXaa: 0.0 ± 0.0
Val
6.166ValAla: 6.166 ± 0.639
0.854ValCys: 0.854 ± 0.211
4.648ValAsp: 4.648 ± 0.467
4.363ValGlu: 4.363 ± 0.459
2.608ValPhe: 2.608 ± 0.334
5.454ValGly: 5.454 ± 0.472
1.186ValHis: 1.186 ± 0.228
3.604ValIle: 3.604 ± 0.403
2.94ValLys: 2.94 ± 0.368
4.648ValLeu: 4.648 ± 0.507
1.802ValMet: 1.802 ± 0.28
2.276ValAsn: 2.276 ± 0.262
2.798ValPro: 2.798 ± 0.35
2.514ValGln: 2.514 ± 0.313
3.32ValArg: 3.32 ± 0.335
4.885ValSer: 4.885 ± 0.561
4.411ValThr: 4.411 ± 0.701
6.26ValVal: 6.26 ± 0.635
0.901ValTrp: 0.901 ± 0.177
1.755ValTyr: 1.755 ± 0.397
0.0ValXaa: 0.0 ± 0.0
Trp
1.85TrpAla: 1.85 ± 0.259
0.379TrpCys: 0.379 ± 0.151
1.518TrpAsp: 1.518 ± 0.277
1.043TrpGlu: 1.043 ± 0.229
0.427TrpPhe: 0.427 ± 0.141
1.707TrpGly: 1.707 ± 0.281
0.664TrpHis: 0.664 ± 0.164
0.949TrpIle: 0.949 ± 0.232
0.854TrpLys: 0.854 ± 0.191
2.371TrpLeu: 2.371 ± 0.36
0.664TrpMet: 0.664 ± 0.188
0.759TrpAsn: 0.759 ± 0.215
0.711TrpPro: 0.711 ± 0.224
0.617TrpGln: 0.617 ± 0.161
1.281TrpArg: 1.281 ± 0.253
1.423TrpSer: 1.423 ± 0.254
1.328TrpThr: 1.328 ± 0.332
1.138TrpVal: 1.138 ± 0.236
0.711TrpTrp: 0.711 ± 0.173
0.474TrpTyr: 0.474 ± 0.13
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.656TyrAla: 2.656 ± 0.321
0.617TyrCys: 0.617 ± 0.181
2.798TyrAsp: 2.798 ± 0.357
2.703TyrGlu: 2.703 ± 0.471
0.949TyrPhe: 0.949 ± 0.216
2.514TyrGly: 2.514 ± 0.377
0.711TyrHis: 0.711 ± 0.198
1.043TyrIle: 1.043 ± 0.223
1.328TyrLys: 1.328 ± 0.261
3.178TyrLeu: 3.178 ± 0.403
0.949TyrMet: 0.949 ± 0.2
1.233TyrAsn: 1.233 ± 0.286
1.755TyrPro: 1.755 ± 0.311
0.854TyrGln: 0.854 ± 0.196
2.134TyrArg: 2.134 ± 0.277
2.276TyrSer: 2.276 ± 0.309
1.992TyrThr: 1.992 ± 0.306
2.371TyrVal: 2.371 ± 0.279
0.854TyrTrp: 0.854 ± 0.232
1.091TyrTyr: 1.091 ± 0.25
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 149 proteins (21086 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski