Amino acid dipepetide frequency for Synechococcus virus S-ESS1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.709AlaAla: 14.709 ± 2.295
0.652AlaCys: 0.652 ± 0.233
5.942AlaAsp: 5.942 ± 0.782
7.97AlaGlu: 7.97 ± 0.938
3.55AlaPhe: 3.55 ± 0.54
8.478AlaGly: 8.478 ± 0.696
1.594AlaHis: 1.594 ± 0.324
5.869AlaIle: 5.869 ± 0.843
4.71AlaLys: 4.71 ± 0.582
7.681AlaLeu: 7.681 ± 0.922
3.333AlaMet: 3.333 ± 0.474
2.681AlaAsn: 2.681 ± 0.346
3.623AlaPro: 3.623 ± 0.523
4.565AlaGln: 4.565 ± 0.7
8.84AlaArg: 8.84 ± 1.531
6.159AlaSer: 6.159 ± 0.627
4.71AlaThr: 4.71 ± 0.957
6.231AlaVal: 6.231 ± 0.818
1.522AlaTrp: 1.522 ± 0.374
2.753AlaTyr: 2.753 ± 0.437
0.0AlaXaa: 0.0 ± 0.0
Cys
0.507CysAla: 0.507 ± 0.193
0.217CysCys: 0.217 ± 0.139
0.435CysAsp: 0.435 ± 0.172
0.87CysGlu: 0.87 ± 0.227
0.362CysPhe: 0.362 ± 0.152
0.797CysGly: 0.797 ± 0.267
0.362CysHis: 0.362 ± 0.197
0.217CysIle: 0.217 ± 0.113
0.217CysLys: 0.217 ± 0.147
0.435CysLeu: 0.435 ± 0.199
0.362CysMet: 0.362 ± 0.157
0.217CysAsn: 0.217 ± 0.112
1.087CysPro: 1.087 ± 0.291
0.435CysGln: 0.435 ± 0.175
0.87CysArg: 0.87 ± 0.246
0.435CysSer: 0.435 ± 0.174
0.435CysThr: 0.435 ± 0.144
0.652CysVal: 0.652 ± 0.236
0.145CysTrp: 0.145 ± 0.093
0.29CysTyr: 0.29 ± 0.163
0.0CysXaa: 0.0 ± 0.0
Asp
5.942AspAla: 5.942 ± 0.722
1.087AspCys: 1.087 ± 0.327
4.782AspAsp: 4.782 ± 0.763
3.985AspGlu: 3.985 ± 0.642
2.464AspPhe: 2.464 ± 0.345
4.71AspGly: 4.71 ± 0.518
1.811AspHis: 1.811 ± 0.387
2.826AspIle: 2.826 ± 0.507
2.174AspLys: 2.174 ± 0.432
5.434AspLeu: 5.434 ± 0.644
2.536AspMet: 2.536 ± 0.349
2.246AspAsn: 2.246 ± 0.358
3.695AspPro: 3.695 ± 0.632
2.609AspGln: 2.609 ± 0.416
4.565AspArg: 4.565 ± 0.719
2.826AspSer: 2.826 ± 0.469
2.609AspThr: 2.609 ± 0.392
3.55AspVal: 3.55 ± 0.467
0.942AspTrp: 0.942 ± 0.334
2.246AspTyr: 2.246 ± 0.339
0.0AspXaa: 0.0 ± 0.0
Glu
9.999GluAla: 9.999 ± 1.274
0.217GluCys: 0.217 ± 0.158
4.927GluAsp: 4.927 ± 0.642
6.666GluGlu: 6.666 ± 0.846
2.246GluPhe: 2.246 ± 0.364
4.927GluGly: 4.927 ± 0.556
0.87GluHis: 0.87 ± 0.3
2.971GluIle: 2.971 ± 0.502
2.681GluLys: 2.681 ± 0.56
6.159GluLeu: 6.159 ± 0.905
2.391GluMet: 2.391 ± 0.452
2.174GluAsn: 2.174 ± 0.381
2.319GluPro: 2.319 ± 0.478
2.753GluGln: 2.753 ± 0.607
4.637GluArg: 4.637 ± 0.605
3.913GluSer: 3.913 ± 0.592
3.768GluThr: 3.768 ± 0.531
6.304GluVal: 6.304 ± 0.712
1.449GluTrp: 1.449 ± 0.298
1.594GluTyr: 1.594 ± 0.326
0.0GluXaa: 0.0 ± 0.0
Phe
3.261PheAla: 3.261 ± 0.611
0.652PheCys: 0.652 ± 0.21
2.609PheAsp: 2.609 ± 0.482
2.101PheGlu: 2.101 ± 0.515
1.232PhePhe: 1.232 ± 0.373
2.391PheGly: 2.391 ± 0.413
0.507PheHis: 0.507 ± 0.208
1.739PheIle: 1.739 ± 0.292
1.377PheLys: 1.377 ± 0.311
2.971PheLeu: 2.971 ± 0.424
0.942PheMet: 0.942 ± 0.286
1.014PheAsn: 1.014 ± 0.23
1.449PhePro: 1.449 ± 0.307
1.667PheGln: 1.667 ± 0.399
2.753PheArg: 2.753 ± 0.435
2.971PheSer: 2.971 ± 0.388
1.667PheThr: 1.667 ± 0.292
1.884PheVal: 1.884 ± 0.342
0.87PheTrp: 0.87 ± 0.284
0.797PheTyr: 0.797 ± 0.186
0.0PheXaa: 0.0 ± 0.0
Gly
6.521GlyAla: 6.521 ± 0.725
1.159GlyCys: 1.159 ± 0.33
4.13GlyAsp: 4.13 ± 0.615
5.072GlyGlu: 5.072 ± 0.493
3.261GlyPhe: 3.261 ± 0.518
5.724GlyGly: 5.724 ± 1.148
1.739GlyHis: 1.739 ± 0.402
3.55GlyIle: 3.55 ± 0.442
4.13GlyLys: 4.13 ± 0.691
7.898GlyLeu: 7.898 ± 1.08
2.029GlyMet: 2.029 ± 0.467
2.464GlyAsn: 2.464 ± 0.43
3.84GlyPro: 3.84 ± 0.653
2.971GlyGln: 2.971 ± 0.5
5.072GlyArg: 5.072 ± 0.747
4.71GlySer: 4.71 ± 0.588
4.71GlyThr: 4.71 ± 0.726
5.579GlyVal: 5.579 ± 0.602
1.449GlyTrp: 1.449 ± 0.294
2.464GlyTyr: 2.464 ± 0.453
0.0GlyXaa: 0.0 ± 0.0
His
1.522HisAla: 1.522 ± 0.393
0.362HisCys: 0.362 ± 0.151
1.232HisAsp: 1.232 ± 0.25
1.304HisGlu: 1.304 ± 0.288
0.87HisPhe: 0.87 ± 0.279
1.594HisGly: 1.594 ± 0.346
0.652HisHis: 0.652 ± 0.29
1.159HisIle: 1.159 ± 0.364
0.797HisLys: 0.797 ± 0.233
1.449HisLeu: 1.449 ± 0.248
0.652HisMet: 0.652 ± 0.194
0.797HisAsn: 0.797 ± 0.177
1.522HisPro: 1.522 ± 0.321
0.797HisGln: 0.797 ± 0.234
1.377HisArg: 1.377 ± 0.337
0.87HisSer: 0.87 ± 0.246
1.159HisThr: 1.159 ± 0.341
1.594HisVal: 1.594 ± 0.421
0.435HisTrp: 0.435 ± 0.179
0.435HisTyr: 0.435 ± 0.187
0.0HisXaa: 0.0 ± 0.0
Ile
5.072IleAla: 5.072 ± 0.764
0.435IleCys: 0.435 ± 0.185
3.768IleAsp: 3.768 ± 0.567
4.348IleGlu: 4.348 ± 0.456
1.667IlePhe: 1.667 ± 0.315
2.826IleGly: 2.826 ± 0.574
0.652IleHis: 0.652 ± 0.205
1.811IleIle: 1.811 ± 0.517
2.971IleLys: 2.971 ± 0.502
2.753IleLeu: 2.753 ± 0.459
1.377IleMet: 1.377 ± 0.334
2.319IleAsn: 2.319 ± 0.446
2.391IlePro: 2.391 ± 0.453
1.739IleGln: 1.739 ± 0.437
3.55IleArg: 3.55 ± 0.547
2.681IleSer: 2.681 ± 0.477
2.826IleThr: 2.826 ± 0.441
3.406IleVal: 3.406 ± 0.433
0.507IleTrp: 0.507 ± 0.255
1.377IleTyr: 1.377 ± 0.286
0.0IleXaa: 0.0 ± 0.0
Lys
4.42LysAla: 4.42 ± 0.673
0.29LysCys: 0.29 ± 0.181
2.536LysAsp: 2.536 ± 0.47
3.406LysGlu: 3.406 ± 0.504
1.884LysPhe: 1.884 ± 0.37
2.898LysGly: 2.898 ± 0.525
1.087LysHis: 1.087 ± 0.228
1.811LysIle: 1.811 ± 0.375
2.898LysLys: 2.898 ± 0.555
2.971LysLeu: 2.971 ± 0.58
1.087LysMet: 1.087 ± 0.293
1.594LysAsn: 1.594 ± 0.333
3.478LysPro: 3.478 ± 0.53
1.232LysGln: 1.232 ± 0.315
3.768LysArg: 3.768 ± 0.665
2.319LysSer: 2.319 ± 0.462
2.753LysThr: 2.753 ± 0.561
3.55LysVal: 3.55 ± 0.407
1.159LysTrp: 1.159 ± 0.282
0.942LysTyr: 0.942 ± 0.208
0.0LysXaa: 0.0 ± 0.0
Leu
8.405LeuAla: 8.405 ± 0.948
0.507LeuCys: 0.507 ± 0.152
5.072LeuAsp: 5.072 ± 0.633
6.087LeuGlu: 6.087 ± 0.647
1.739LeuPhe: 1.739 ± 0.379
5.072LeuGly: 5.072 ± 0.652
1.594LeuHis: 1.594 ± 0.376
3.406LeuIle: 3.406 ± 0.476
3.623LeuLys: 3.623 ± 0.651
4.71LeuLeu: 4.71 ± 0.546
2.101LeuMet: 2.101 ± 0.468
2.464LeuAsn: 2.464 ± 0.436
3.695LeuPro: 3.695 ± 0.608
2.753LeuGln: 2.753 ± 0.475
6.884LeuArg: 6.884 ± 0.812
4.927LeuSer: 4.927 ± 0.711
5.217LeuThr: 5.217 ± 0.687
4.637LeuVal: 4.637 ± 0.647
1.087LeuTrp: 1.087 ± 0.307
1.956LeuTyr: 1.956 ± 0.509
0.0LeuXaa: 0.0 ± 0.0
Met
3.406MetAla: 3.406 ± 0.524
0.362MetCys: 0.362 ± 0.165
1.377MetAsp: 1.377 ± 0.34
2.319MetGlu: 2.319 ± 0.435
0.507MetPhe: 0.507 ± 0.209
2.391MetGly: 2.391 ± 0.432
0.29MetHis: 0.29 ± 0.158
1.811MetIle: 1.811 ± 0.355
1.159MetLys: 1.159 ± 0.353
2.319MetLeu: 2.319 ± 0.452
1.087MetMet: 1.087 ± 0.373
0.942MetAsn: 0.942 ± 0.259
1.377MetPro: 1.377 ± 0.353
0.725MetGln: 0.725 ± 0.233
1.884MetArg: 1.884 ± 0.412
1.449MetSer: 1.449 ± 0.326
2.319MetThr: 2.319 ± 0.372
1.811MetVal: 1.811 ± 0.403
0.362MetTrp: 0.362 ± 0.144
0.507MetTyr: 0.507 ± 0.181
0.0MetXaa: 0.0 ± 0.0
Asn
3.623AsnAla: 3.623 ± 0.584
0.435AsnCys: 0.435 ± 0.154
2.101AsnAsp: 2.101 ± 0.347
1.304AsnGlu: 1.304 ± 0.363
1.667AsnPhe: 1.667 ± 0.363
3.333AsnGly: 3.333 ± 0.477
0.507AsnHis: 0.507 ± 0.185
1.594AsnIle: 1.594 ± 0.36
1.087AsnLys: 1.087 ± 0.283
2.246AsnLeu: 2.246 ± 0.449
0.58AsnMet: 0.58 ± 0.245
1.014AsnAsn: 1.014 ± 0.283
2.464AsnPro: 2.464 ± 0.399
0.87AsnGln: 0.87 ± 0.305
3.116AsnArg: 3.116 ± 0.543
1.522AsnSer: 1.522 ± 0.31
1.594AsnThr: 1.594 ± 0.448
1.449AsnVal: 1.449 ± 0.273
0.942AsnTrp: 0.942 ± 0.241
1.087AsnTyr: 1.087 ± 0.306
0.0AsnXaa: 0.0 ± 0.0
Pro
5.0ProAla: 5.0 ± 0.799
0.435ProCys: 0.435 ± 0.166
3.478ProAsp: 3.478 ± 0.421
3.695ProGlu: 3.695 ± 0.597
1.667ProPhe: 1.667 ± 0.395
4.348ProGly: 4.348 ± 0.548
1.667ProHis: 1.667 ± 0.336
2.681ProIle: 2.681 ± 0.505
2.753ProLys: 2.753 ± 0.498
3.043ProLeu: 3.043 ± 0.455
1.232ProMet: 1.232 ± 0.29
1.594ProAsn: 1.594 ± 0.371
2.971ProPro: 2.971 ± 0.43
1.667ProGln: 1.667 ± 0.337
2.826ProArg: 2.826 ± 0.51
2.971ProSer: 2.971 ± 0.413
2.536ProThr: 2.536 ± 0.454
3.188ProVal: 3.188 ± 0.449
1.232ProTrp: 1.232 ± 0.247
1.884ProTyr: 1.884 ± 0.384
0.0ProXaa: 0.0 ± 0.0
Gln
4.782GlnAla: 4.782 ± 0.696
0.145GlnCys: 0.145 ± 0.102
1.811GlnAsp: 1.811 ± 0.299
2.536GlnGlu: 2.536 ± 0.523
1.232GlnPhe: 1.232 ± 0.353
2.681GlnGly: 2.681 ± 0.392
0.87GlnHis: 0.87 ± 0.246
1.739GlnIle: 1.739 ± 0.402
2.101GlnLys: 2.101 ± 0.415
2.536GlnLeu: 2.536 ± 0.437
1.159GlnMet: 1.159 ± 0.283
1.377GlnAsn: 1.377 ± 0.289
2.029GlnPro: 2.029 ± 0.448
2.101GlnGln: 2.101 ± 0.435
2.174GlnArg: 2.174 ± 0.346
1.739GlnSer: 1.739 ± 0.399
2.319GlnThr: 2.319 ± 0.423
2.029GlnVal: 2.029 ± 0.323
1.159GlnTrp: 1.159 ± 0.329
0.87GlnTyr: 0.87 ± 0.302
0.0GlnXaa: 0.0 ± 0.0
Arg
7.391ArgAla: 7.391 ± 1.35
0.652ArgCys: 0.652 ± 0.229
4.42ArgAsp: 4.42 ± 0.51
5.0ArgGlu: 5.0 ± 0.638
2.681ArgPhe: 2.681 ± 0.453
4.71ArgGly: 4.71 ± 0.703
2.609ArgHis: 2.609 ± 0.539
4.203ArgIle: 4.203 ± 0.636
4.058ArgLys: 4.058 ± 0.538
5.0ArgLeu: 5.0 ± 0.917
2.101ArgMet: 2.101 ± 0.34
2.609ArgAsn: 2.609 ± 0.431
3.985ArgPro: 3.985 ± 0.653
2.898ArgGln: 2.898 ± 0.509
5.145ArgArg: 5.145 ± 0.574
4.13ArgSer: 4.13 ± 0.447
3.84ArgThr: 3.84 ± 0.587
4.71ArgVal: 4.71 ± 0.572
1.159ArgTrp: 1.159 ± 0.305
1.884ArgTyr: 1.884 ± 0.396
0.0ArgXaa: 0.0 ± 0.0
Ser
5.072SerAla: 5.072 ± 0.574
0.58SerCys: 0.58 ± 0.193
3.406SerAsp: 3.406 ± 0.404
3.985SerGlu: 3.985 ± 0.597
2.391SerPhe: 2.391 ± 0.481
4.637SerGly: 4.637 ± 0.689
1.014SerHis: 1.014 ± 0.256
2.898SerIle: 2.898 ± 0.42
2.319SerLys: 2.319 ± 0.389
5.0SerLeu: 5.0 ± 0.505
1.304SerMet: 1.304 ± 0.356
1.739SerAsn: 1.739 ± 0.338
3.188SerPro: 3.188 ± 0.601
2.029SerGln: 2.029 ± 0.417
4.71SerArg: 4.71 ± 0.721
3.55SerSer: 3.55 ± 0.602
2.971SerThr: 2.971 ± 0.526
3.406SerVal: 3.406 ± 0.551
0.797SerTrp: 0.797 ± 0.276
1.884SerTyr: 1.884 ± 0.307
0.0SerXaa: 0.0 ± 0.0
Thr
5.0ThrAla: 5.0 ± 0.664
0.435ThrCys: 0.435 ± 0.165
3.116ThrAsp: 3.116 ± 0.486
3.84ThrGlu: 3.84 ± 0.437
2.101ThrPhe: 2.101 ± 0.392
6.449ThrGly: 6.449 ± 0.712
0.725ThrHis: 0.725 ± 0.263
3.623ThrIle: 3.623 ± 0.519
2.174ThrLys: 2.174 ± 0.341
3.768ThrLeu: 3.768 ± 0.54
1.377ThrMet: 1.377 ± 0.302
1.594ThrAsn: 1.594 ± 0.342
2.971ThrPro: 2.971 ± 0.454
1.884ThrGln: 1.884 ± 0.314
3.84ThrArg: 3.84 ± 0.545
2.536ThrSer: 2.536 ± 0.543
3.84ThrThr: 3.84 ± 0.515
3.768ThrVal: 3.768 ± 0.549
0.652ThrTrp: 0.652 ± 0.204
1.811ThrTyr: 1.811 ± 0.415
0.0ThrXaa: 0.0 ± 0.0
Val
7.028ValAla: 7.028 ± 1.005
0.362ValCys: 0.362 ± 0.162
4.42ValAsp: 4.42 ± 0.482
5.724ValGlu: 5.724 ± 0.785
2.029ValPhe: 2.029 ± 0.436
5.724ValGly: 5.724 ± 0.678
1.232ValHis: 1.232 ± 0.234
2.536ValIle: 2.536 ± 0.417
2.898ValLys: 2.898 ± 0.471
5.507ValLeu: 5.507 ± 0.562
1.304ValMet: 1.304 ± 0.456
2.174ValAsn: 2.174 ± 0.395
2.464ValPro: 2.464 ± 0.649
1.739ValGln: 1.739 ± 0.362
4.058ValArg: 4.058 ± 0.523
4.203ValSer: 4.203 ± 0.648
3.985ValThr: 3.985 ± 0.664
3.406ValVal: 3.406 ± 0.586
1.159ValTrp: 1.159 ± 0.366
2.319ValTyr: 2.319 ± 0.382
0.0ValXaa: 0.0 ± 0.0
Trp
1.667TrpAla: 1.667 ± 0.347
0.217TrpCys: 0.217 ± 0.12
1.014TrpAsp: 1.014 ± 0.238
0.87TrpGlu: 0.87 ± 0.317
0.797TrpPhe: 0.797 ± 0.228
1.449TrpGly: 1.449 ± 0.378
0.362TrpHis: 0.362 ± 0.155
1.014TrpIle: 1.014 ± 0.268
0.87TrpLys: 0.87 ± 0.27
1.884TrpLeu: 1.884 ± 0.385
0.507TrpMet: 0.507 ± 0.2
0.725TrpAsn: 0.725 ± 0.242
0.797TrpPro: 0.797 ± 0.188
0.87TrpGln: 0.87 ± 0.283
1.014TrpArg: 1.014 ± 0.319
1.014TrpSer: 1.014 ± 0.246
0.58TrpThr: 0.58 ± 0.233
1.377TrpVal: 1.377 ± 0.341
0.0TrpTrp: 0.0 ± 0.0
0.435TrpTyr: 0.435 ± 0.159
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.246TyrAla: 2.246 ± 0.413
0.145TyrCys: 0.145 ± 0.103
2.464TyrAsp: 2.464 ± 0.478
1.956TyrGlu: 1.956 ± 0.425
0.507TyrPhe: 0.507 ± 0.163
3.333TyrGly: 3.333 ± 0.505
0.435TyrHis: 0.435 ± 0.163
1.014TyrIle: 1.014 ± 0.29
1.087TyrLys: 1.087 ± 0.365
2.174TyrLeu: 2.174 ± 0.315
0.87TyrMet: 0.87 ± 0.232
0.797TyrAsn: 0.797 ± 0.196
1.522TyrPro: 1.522 ± 0.432
1.087TyrGln: 1.087 ± 0.272
2.101TyrArg: 2.101 ± 0.319
1.956TyrSer: 1.956 ± 0.28
1.667TyrThr: 1.667 ± 0.331
1.739TyrVal: 1.739 ± 0.399
0.435TyrTrp: 0.435 ± 0.182
0.652TyrTyr: 0.652 ± 0.248
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (13802 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski