Amino acid dipepetide frequency for Shewanella sp. phage 1/44

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.968AlaAla: 7.968 ± 1.328
1.202AlaCys: 1.202 ± 0.282
4.49AlaAsp: 4.49 ± 0.71
3.225AlaGlu: 3.225 ± 0.686
2.53AlaPhe: 2.53 ± 0.358
5.755AlaGly: 5.755 ± 0.722
1.012AlaHis: 1.012 ± 0.277
5.818AlaIle: 5.818 ± 0.769
5.312AlaLys: 5.312 ± 0.88
6.514AlaLeu: 6.514 ± 0.755
2.403AlaMet: 2.403 ± 0.455
4.427AlaAsn: 4.427 ± 0.562
5.186AlaPro: 5.186 ± 2.021
3.035AlaGln: 3.035 ± 0.56
2.909AlaArg: 2.909 ± 0.36
5.375AlaSer: 5.375 ± 0.524
5.439AlaThr: 5.439 ± 0.937
5.122AlaVal: 5.122 ± 0.526
0.949AlaTrp: 0.949 ± 0.234
2.783AlaTyr: 2.783 ± 0.551
0.0AlaXaa: 0.0 ± 0.0
Cys
0.443CysAla: 0.443 ± 0.215
0.379CysCys: 0.379 ± 0.188
1.265CysAsp: 1.265 ± 0.307
1.012CysGlu: 1.012 ± 0.277
0.506CysPhe: 0.506 ± 0.203
0.696CysGly: 0.696 ± 0.229
0.569CysHis: 0.569 ± 0.19
0.759CysIle: 0.759 ± 0.192
0.949CysLys: 0.949 ± 0.34
1.265CysLeu: 1.265 ± 0.273
0.696CysMet: 0.696 ± 0.19
0.379CysAsn: 0.379 ± 0.144
0.506CysPro: 0.506 ± 0.174
0.19CysGln: 0.19 ± 0.105
1.075CysArg: 1.075 ± 0.313
0.569CysSer: 0.569 ± 0.184
0.506CysThr: 0.506 ± 0.159
1.138CysVal: 1.138 ± 0.28
0.126CysTrp: 0.126 ± 0.086
0.506CysTyr: 0.506 ± 0.191
0.0CysXaa: 0.0 ± 0.0
Asp
5.375AspAla: 5.375 ± 0.774
0.632AspCys: 0.632 ± 0.189
4.553AspAsp: 4.553 ± 0.521
3.794AspGlu: 3.794 ± 0.439
3.288AspPhe: 3.288 ± 0.448
5.059AspGly: 5.059 ± 0.652
1.075AspHis: 1.075 ± 0.284
4.49AspIle: 4.49 ± 0.455
2.719AspLys: 2.719 ± 0.425
5.249AspLeu: 5.249 ± 0.813
2.024AspMet: 2.024 ± 0.364
2.783AspAsn: 2.783 ± 0.358
2.024AspPro: 2.024 ± 0.426
1.518AspGln: 1.518 ± 0.377
1.897AspArg: 1.897 ± 0.459
4.363AspSer: 4.363 ± 0.585
4.49AspThr: 4.49 ± 0.592
5.944AspVal: 5.944 ± 0.583
1.075AspTrp: 1.075 ± 0.305
2.213AspTyr: 2.213 ± 0.407
0.0AspXaa: 0.0 ± 0.0
Glu
4.237GluAla: 4.237 ± 0.727
0.696GluCys: 0.696 ± 0.232
1.075GluAsp: 1.075 ± 0.297
1.581GluGlu: 1.581 ± 0.401
2.277GluPhe: 2.277 ± 0.415
2.213GluGly: 2.213 ± 0.386
1.454GluHis: 1.454 ± 0.368
4.047GluIle: 4.047 ± 0.604
2.403GluLys: 2.403 ± 0.594
8.284GluLeu: 8.284 ± 1.23
1.391GluMet: 1.391 ± 0.313
2.277GluAsn: 2.277 ± 0.318
1.707GluPro: 1.707 ± 0.375
2.719GluGln: 2.719 ± 0.679
2.15GluArg: 2.15 ± 0.41
3.099GluSer: 3.099 ± 0.423
2.656GluThr: 2.656 ± 0.518
3.731GluVal: 3.731 ± 0.534
0.822GluTrp: 0.822 ± 0.237
2.593GluTyr: 2.593 ± 0.475
0.0GluXaa: 0.0 ± 0.0
Phe
3.035PheAla: 3.035 ± 0.4
0.316PheCys: 0.316 ± 0.161
2.593PheAsp: 2.593 ± 0.382
2.087PheGlu: 2.087 ± 0.346
0.759PhePhe: 0.759 ± 0.279
2.403PheGly: 2.403 ± 0.447
0.443PheHis: 0.443 ± 0.26
2.34PheIle: 2.34 ± 0.409
3.035PheLys: 3.035 ± 0.437
2.024PheLeu: 2.024 ± 0.34
0.822PheMet: 0.822 ± 0.202
2.719PheAsn: 2.719 ± 0.472
1.265PhePro: 1.265 ± 0.293
1.138PheGln: 1.138 ± 0.251
1.581PheArg: 1.581 ± 0.345
2.213PheSer: 2.213 ± 0.465
3.541PheThr: 3.541 ± 0.551
2.15PheVal: 2.15 ± 0.425
0.063PheTrp: 0.063 ± 0.058
1.202PheTyr: 1.202 ± 0.285
0.0PheXaa: 0.0 ± 0.0
Gly
3.478GlyAla: 3.478 ± 0.485
1.012GlyCys: 1.012 ± 0.201
5.375GlyAsp: 5.375 ± 0.508
3.668GlyGlu: 3.668 ± 0.598
2.909GlyPhe: 2.909 ± 0.434
4.68GlyGly: 4.68 ± 0.641
1.265GlyHis: 1.265 ± 0.325
3.035GlyIle: 3.035 ± 0.488
4.68GlyLys: 4.68 ± 0.606
6.64GlyLeu: 6.64 ± 0.941
2.34GlyMet: 2.34 ± 0.442
4.174GlyAsn: 4.174 ± 0.534
0.506GlyPro: 0.506 ± 0.194
1.96GlyGln: 1.96 ± 0.359
2.656GlyArg: 2.656 ± 0.424
3.352GlySer: 3.352 ± 0.491
5.312GlyThr: 5.312 ± 0.623
6.387GlyVal: 6.387 ± 0.713
1.075GlyTrp: 1.075 ± 0.268
3.225GlyTyr: 3.225 ± 0.505
0.0GlyXaa: 0.0 ± 0.0
His
0.759HisAla: 0.759 ± 0.296
0.379HisCys: 0.379 ± 0.216
1.454HisAsp: 1.454 ± 0.337
1.391HisGlu: 1.391 ± 0.384
0.822HisPhe: 0.822 ± 0.244
1.454HisGly: 1.454 ± 0.249
0.253HisHis: 0.253 ± 0.151
1.012HisIle: 1.012 ± 0.223
0.696HisLys: 0.696 ± 0.246
2.15HisLeu: 2.15 ± 0.421
0.19HisMet: 0.19 ± 0.101
1.075HisAsn: 1.075 ± 0.265
0.443HisPro: 0.443 ± 0.201
0.632HisGln: 0.632 ± 0.256
0.949HisArg: 0.949 ± 0.321
1.265HisSer: 1.265 ± 0.315
1.075HisThr: 1.075 ± 0.259
1.454HisVal: 1.454 ± 0.289
0.19HisTrp: 0.19 ± 0.115
0.949HisTyr: 0.949 ± 0.251
0.0HisXaa: 0.0 ± 0.0
Ile
5.249IleAla: 5.249 ± 0.558
0.632IleCys: 0.632 ± 0.213
4.427IleAsp: 4.427 ± 0.524
5.249IleGlu: 5.249 ± 0.586
1.644IlePhe: 1.644 ± 0.269
4.49IleGly: 4.49 ± 0.661
0.632IleHis: 0.632 ± 0.163
3.731IleIle: 3.731 ± 0.539
4.933IleLys: 4.933 ± 0.492
3.035IleLeu: 3.035 ± 0.464
1.771IleMet: 1.771 ± 0.379
4.996IleAsn: 4.996 ± 0.638
2.466IlePro: 2.466 ± 0.499
1.834IleGln: 1.834 ± 0.267
3.225IleArg: 3.225 ± 0.41
4.743IleSer: 4.743 ± 0.801
5.692IleThr: 5.692 ± 0.565
4.553IleVal: 4.553 ± 0.609
0.632IleTrp: 0.632 ± 0.239
1.707IleTyr: 1.707 ± 0.274
0.0IleXaa: 0.0 ± 0.0
Lys
5.439LysAla: 5.439 ± 1.085
1.138LysCys: 1.138 ± 0.324
2.593LysAsp: 2.593 ± 0.384
2.656LysGlu: 2.656 ± 0.774
2.087LysPhe: 2.087 ± 0.385
3.731LysGly: 3.731 ± 0.404
1.391LysHis: 1.391 ± 0.314
3.352LysIle: 3.352 ± 0.388
2.846LysLys: 2.846 ± 0.482
6.387LysLeu: 6.387 ± 0.906
1.897LysMet: 1.897 ± 0.414
2.213LysAsn: 2.213 ± 0.293
3.288LysPro: 3.288 ± 0.567
2.783LysGln: 2.783 ± 0.485
3.352LysArg: 3.352 ± 0.503
5.186LysSer: 5.186 ± 0.584
3.984LysThr: 3.984 ± 0.537
4.174LysVal: 4.174 ± 0.44
0.885LysTrp: 0.885 ± 0.262
1.518LysTyr: 1.518 ± 0.317
0.0LysXaa: 0.0 ± 0.0
Leu
7.842LeuAla: 7.842 ± 1.012
1.075LeuCys: 1.075 ± 0.255
4.806LeuAsp: 4.806 ± 0.537
4.3LeuGlu: 4.3 ± 0.936
2.34LeuPhe: 2.34 ± 0.465
4.806LeuGly: 4.806 ± 0.636
1.391LeuHis: 1.391 ± 0.329
5.881LeuIle: 5.881 ± 0.653
5.565LeuLys: 5.565 ± 0.823
5.565LeuLeu: 5.565 ± 0.575
2.34LeuMet: 2.34 ± 0.369
5.881LeuAsn: 5.881 ± 0.646
3.415LeuPro: 3.415 ± 0.514
2.277LeuGln: 2.277 ± 0.378
4.047LeuArg: 4.047 ± 0.587
5.628LeuSer: 5.628 ± 0.751
6.893LeuThr: 6.893 ± 1.04
5.249LeuVal: 5.249 ± 0.549
0.759LeuTrp: 0.759 ± 0.224
2.466LeuTyr: 2.466 ± 0.386
0.0LeuXaa: 0.0 ± 0.0
Met
2.466MetAla: 2.466 ± 0.491
0.19MetCys: 0.19 ± 0.116
0.949MetAsp: 0.949 ± 0.321
0.569MetGlu: 0.569 ± 0.208
0.569MetPhe: 0.569 ± 0.172
1.202MetGly: 1.202 ± 0.282
0.506MetHis: 0.506 ± 0.2
2.466MetIle: 2.466 ± 0.535
1.454MetLys: 1.454 ± 0.271
2.34MetLeu: 2.34 ± 0.432
1.138MetMet: 1.138 ± 0.318
1.454MetAsn: 1.454 ± 0.268
1.328MetPro: 1.328 ± 0.353
0.822MetGln: 0.822 ± 0.224
1.075MetArg: 1.075 ± 0.31
2.213MetSer: 2.213 ± 0.467
2.909MetThr: 2.909 ± 0.306
2.024MetVal: 2.024 ± 0.337
0.316MetTrp: 0.316 ± 0.132
1.328MetTyr: 1.328 ± 0.293
0.0MetXaa: 0.0 ± 0.0
Asn
3.984AsnAla: 3.984 ± 0.665
1.012AsnCys: 1.012 ± 0.317
3.984AsnAsp: 3.984 ± 0.454
3.162AsnGlu: 3.162 ± 0.459
2.087AsnPhe: 2.087 ± 0.378
4.869AsnGly: 4.869 ± 0.618
1.265AsnHis: 1.265 ± 0.3
3.352AsnIle: 3.352 ± 0.479
3.162AsnLys: 3.162 ± 0.513
3.921AsnLeu: 3.921 ± 0.607
1.012AsnMet: 1.012 ± 0.279
3.035AsnAsn: 3.035 ± 0.567
2.656AsnPro: 2.656 ± 0.517
2.656AsnGln: 2.656 ± 0.43
1.897AsnArg: 1.897 ± 0.315
4.111AsnSer: 4.111 ± 0.556
3.415AsnThr: 3.415 ± 0.417
4.3AsnVal: 4.3 ± 0.473
1.012AsnTrp: 1.012 ± 0.227
1.518AsnTyr: 1.518 ± 0.348
0.0AsnXaa: 0.0 ± 0.0
Pro
4.743ProAla: 4.743 ± 1.231
0.443ProCys: 0.443 ± 0.154
2.403ProAsp: 2.403 ± 0.463
2.087ProGlu: 2.087 ± 0.322
1.075ProPhe: 1.075 ± 0.296
2.15ProGly: 2.15 ± 0.504
0.632ProHis: 0.632 ± 0.244
2.34ProIle: 2.34 ± 0.627
2.53ProLys: 2.53 ± 0.485
3.035ProLeu: 3.035 ± 0.525
1.391ProMet: 1.391 ± 0.386
1.96ProAsn: 1.96 ± 0.392
1.897ProPro: 1.897 ± 0.451
1.518ProGln: 1.518 ± 0.776
1.265ProArg: 1.265 ± 0.297
2.783ProSer: 2.783 ± 0.439
3.035ProThr: 3.035 ± 0.443
2.783ProVal: 2.783 ± 0.634
0.316ProTrp: 0.316 ± 0.182
1.328ProTyr: 1.328 ± 0.423
0.0ProXaa: 0.0 ± 0.0
Gln
3.352GlnAla: 3.352 ± 0.663
0.506GlnCys: 0.506 ± 0.186
1.138GlnAsp: 1.138 ± 0.27
1.707GlnGlu: 1.707 ± 0.336
1.075GlnPhe: 1.075 ± 0.219
1.834GlnGly: 1.834 ± 0.331
1.265GlnHis: 1.265 ± 0.328
2.277GlnIle: 2.277 ± 0.342
1.581GlnLys: 1.581 ± 0.319
3.984GlnLeu: 3.984 ± 0.603
0.569GlnMet: 0.569 ± 0.199
1.265GlnAsn: 1.265 ± 0.371
1.644GlnPro: 1.644 ± 0.351
2.024GlnGln: 2.024 ± 0.575
2.213GlnArg: 2.213 ± 0.327
2.593GlnSer: 2.593 ± 0.408
1.265GlnThr: 1.265 ± 0.23
2.656GlnVal: 2.656 ± 0.358
1.012GlnTrp: 1.012 ± 0.256
1.265GlnTyr: 1.265 ± 0.301
0.0GlnXaa: 0.0 ± 0.0
Arg
4.363ArgAla: 4.363 ± 0.619
0.885ArgCys: 0.885 ± 0.194
2.846ArgAsp: 2.846 ± 0.495
2.593ArgGlu: 2.593 ± 0.603
2.213ArgPhe: 2.213 ± 0.46
3.099ArgGly: 3.099 ± 0.568
0.696ArgHis: 0.696 ± 0.188
2.213ArgIle: 2.213 ± 0.448
3.415ArgLys: 3.415 ± 0.54
3.035ArgLeu: 3.035 ± 0.52
1.138ArgMet: 1.138 ± 0.253
2.466ArgAsn: 2.466 ± 0.347
1.138ArgPro: 1.138 ± 0.254
1.391ArgGln: 1.391 ± 0.268
1.771ArgArg: 1.771 ± 0.45
1.897ArgSer: 1.897 ± 0.296
1.707ArgThr: 1.707 ± 0.339
3.035ArgVal: 3.035 ± 0.493
0.379ArgTrp: 0.379 ± 0.157
2.15ArgTyr: 2.15 ± 0.397
0.0ArgXaa: 0.0 ± 0.0
Ser
4.68SerAla: 4.68 ± 0.522
0.632SerCys: 0.632 ± 0.207
5.565SerAsp: 5.565 ± 0.575
2.972SerGlu: 2.972 ± 0.574
2.656SerPhe: 2.656 ± 0.477
5.755SerGly: 5.755 ± 0.853
1.138SerHis: 1.138 ± 0.25
5.755SerIle: 5.755 ± 0.66
4.174SerLys: 4.174 ± 0.557
5.122SerLeu: 5.122 ± 0.463
1.454SerMet: 1.454 ± 0.296
3.731SerAsn: 3.731 ± 0.529
1.834SerPro: 1.834 ± 0.445
2.403SerGln: 2.403 ± 0.333
2.972SerArg: 2.972 ± 0.424
3.668SerSer: 3.668 ± 0.488
4.174SerThr: 4.174 ± 0.65
5.059SerVal: 5.059 ± 0.662
0.885SerTrp: 0.885 ± 0.176
2.024SerTyr: 2.024 ± 0.327
0.0SerXaa: 0.0 ± 0.0
Thr
5.565ThrAla: 5.565 ± 0.678
0.569ThrCys: 0.569 ± 0.203
5.502ThrAsp: 5.502 ± 0.775
3.478ThrGlu: 3.478 ± 0.435
2.783ThrPhe: 2.783 ± 0.581
5.186ThrGly: 5.186 ± 0.747
1.391ThrHis: 1.391 ± 0.281
4.616ThrIle: 4.616 ± 0.513
3.794ThrLys: 3.794 ± 0.549
5.186ThrLeu: 5.186 ± 0.655
1.328ThrMet: 1.328 ± 0.313
3.731ThrAsn: 3.731 ± 0.571
3.921ThrPro: 3.921 ± 0.564
2.15ThrGln: 2.15 ± 0.257
2.34ThrArg: 2.34 ± 0.448
4.869ThrSer: 4.869 ± 0.558
4.68ThrThr: 4.68 ± 0.615
5.439ThrVal: 5.439 ± 0.789
0.822ThrTrp: 0.822 ± 0.306
1.897ThrTyr: 1.897 ± 0.32
0.0ThrXaa: 0.0 ± 0.0
Val
4.743ValAla: 4.743 ± 0.549
0.759ValCys: 0.759 ± 0.255
6.008ValAsp: 6.008 ± 0.688
3.415ValGlu: 3.415 ± 0.457
2.656ValPhe: 2.656 ± 0.435
4.743ValGly: 4.743 ± 0.501
1.075ValHis: 1.075 ± 0.297
4.996ValIle: 4.996 ± 0.607
5.059ValLys: 5.059 ± 0.694
5.439ValLeu: 5.439 ± 0.615
2.34ValMet: 2.34 ± 0.55
4.869ValAsn: 4.869 ± 0.4
2.277ValPro: 2.277 ± 0.637
2.593ValGln: 2.593 ± 0.379
2.909ValArg: 2.909 ± 0.493
5.312ValSer: 5.312 ± 0.527
5.375ValThr: 5.375 ± 0.701
4.616ValVal: 4.616 ± 0.651
0.632ValTrp: 0.632 ± 0.183
2.909ValTyr: 2.909 ± 0.473
0.0ValXaa: 0.0 ± 0.0
Trp
0.949TrpAla: 0.949 ± 0.296
0.316TrpCys: 0.316 ± 0.158
1.328TrpAsp: 1.328 ± 0.308
0.443TrpGlu: 0.443 ± 0.157
0.253TrpPhe: 0.253 ± 0.125
0.443TrpGly: 0.443 ± 0.183
0.379TrpHis: 0.379 ± 0.16
0.822TrpIle: 0.822 ± 0.228
0.822TrpLys: 0.822 ± 0.263
1.454TrpLeu: 1.454 ± 0.391
0.316TrpMet: 0.316 ± 0.181
0.379TrpAsn: 0.379 ± 0.197
0.569TrpPro: 0.569 ± 0.208
0.316TrpGln: 0.316 ± 0.154
0.569TrpArg: 0.569 ± 0.224
1.012TrpSer: 1.012 ± 0.332
0.822TrpThr: 0.822 ± 0.196
0.885TrpVal: 0.885 ± 0.291
0.253TrpTrp: 0.253 ± 0.113
0.443TrpTyr: 0.443 ± 0.162
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.783TyrAla: 2.783 ± 0.538
0.759TyrCys: 0.759 ± 0.204
2.213TyrAsp: 2.213 ± 0.273
2.024TyrGlu: 2.024 ± 0.327
1.202TyrPhe: 1.202 ± 0.299
3.162TyrGly: 3.162 ± 0.507
0.759TyrHis: 0.759 ± 0.203
2.34TyrIle: 2.34 ± 0.377
1.771TyrLys: 1.771 ± 0.331
1.897TyrLeu: 1.897 ± 0.297
0.696TyrMet: 0.696 ± 0.187
2.593TyrAsn: 2.593 ± 0.494
1.644TyrPro: 1.644 ± 0.449
1.328TyrGln: 1.328 ± 0.339
1.581TyrArg: 1.581 ± 0.267
2.403TyrSer: 2.403 ± 0.472
2.277TyrThr: 2.277 ± 0.337
2.024TyrVal: 2.024 ± 0.484
0.569TyrTrp: 0.569 ± 0.215
1.391TyrTyr: 1.391 ± 0.342
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 75 proteins (15814 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski