Amino acid dipepetide frequency for Dishui Lake virophage 8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.178AlaAla: 1.178 ± 0.566
0.294AlaCys: 0.294 ± 0.175
3.975AlaAsp: 3.975 ± 0.953
3.239AlaGlu: 3.239 ± 0.677
1.767AlaPhe: 1.767 ± 0.4
5.595AlaGly: 5.595 ± 1.55
0.589AlaHis: 0.589 ± 0.22
3.975AlaIle: 3.975 ± 0.735
3.975AlaLys: 3.975 ± 0.972
4.27AlaLeu: 4.27 ± 0.458
2.061AlaMet: 2.061 ± 0.594
3.681AlaAsn: 3.681 ± 1.078
3.534AlaPro: 3.534 ± 0.992
3.092AlaGln: 3.092 ± 1.131
1.767AlaArg: 1.767 ± 0.559
3.092AlaSer: 3.092 ± 0.825
6.331AlaThr: 6.331 ± 1.348
2.503AlaVal: 2.503 ± 0.704
0.736AlaTrp: 0.736 ± 0.376
1.031AlaTyr: 1.031 ± 0.315
0.0AlaXaa: 0.0 ± 0.0
Cys
0.147CysAla: 0.147 ± 0.128
0.589CysCys: 0.589 ± 0.236
0.883CysAsp: 0.883 ± 0.292
0.294CysGlu: 0.294 ± 0.175
0.294CysPhe: 0.294 ± 0.207
0.883CysGly: 0.883 ± 0.311
0.0CysHis: 0.0 ± 0.0
0.589CysIle: 0.589 ± 0.303
1.031CysLys: 1.031 ± 0.287
0.736CysLeu: 0.736 ± 0.294
0.0CysMet: 0.0 ± 0.0
1.472CysAsn: 1.472 ± 0.372
0.442CysPro: 0.442 ± 0.228
0.0CysGln: 0.0 ± 0.0
0.589CysArg: 0.589 ± 0.278
0.589CysSer: 0.589 ± 0.211
0.0CysThr: 0.0 ± 0.0
0.589CysVal: 0.589 ± 0.35
0.0CysTrp: 0.0 ± 0.0
0.589CysTyr: 0.589 ± 0.207
0.0CysXaa: 0.0 ± 0.0
Asp
2.65AspAla: 2.65 ± 0.632
0.442AspCys: 0.442 ± 0.27
3.239AspAsp: 3.239 ± 0.753
3.534AspGlu: 3.534 ± 0.697
3.828AspPhe: 3.828 ± 0.716
3.239AspGly: 3.239 ± 0.628
0.294AspHis: 0.294 ± 0.207
5.153AspIle: 5.153 ± 1.196
6.037AspLys: 6.037 ± 1.614
7.067AspLeu: 7.067 ± 0.706
1.325AspMet: 1.325 ± 0.49
3.534AspAsn: 3.534 ± 0.9
2.945AspPro: 2.945 ± 0.564
2.061AspGln: 2.061 ± 0.715
2.208AspArg: 2.208 ± 0.531
2.061AspSer: 2.061 ± 0.703
4.417AspThr: 4.417 ± 0.8
3.092AspVal: 3.092 ± 0.728
1.031AspTrp: 1.031 ± 0.473
3.828AspTyr: 3.828 ± 0.617
0.0AspXaa: 0.0 ± 0.0
Glu
2.356GluAla: 2.356 ± 0.444
0.589GluCys: 0.589 ± 0.269
3.534GluAsp: 3.534 ± 0.71
2.797GluGlu: 2.797 ± 0.774
2.061GluPhe: 2.061 ± 0.616
2.945GluGly: 2.945 ± 0.636
0.442GluHis: 0.442 ± 0.271
4.27GluIle: 4.27 ± 0.784
6.773GluLys: 6.773 ± 1.451
4.122GluLeu: 4.122 ± 0.51
1.914GluMet: 1.914 ± 0.512
3.681GluAsn: 3.681 ± 0.918
1.62GluPro: 1.62 ± 0.453
2.503GluGln: 2.503 ± 0.514
2.797GluArg: 2.797 ± 0.699
2.061GluSer: 2.061 ± 0.368
2.945GluThr: 2.945 ± 0.674
2.797GluVal: 2.797 ± 0.746
1.472GluTrp: 1.472 ± 0.39
3.239GluTyr: 3.239 ± 0.588
0.0GluXaa: 0.0 ± 0.0
Phe
1.914PheAla: 1.914 ± 0.53
0.442PheCys: 0.442 ± 0.271
2.061PheAsp: 2.061 ± 0.368
1.914PheGlu: 1.914 ± 0.399
2.061PhePhe: 2.061 ± 0.529
1.472PheGly: 1.472 ± 0.585
0.147PheHis: 0.147 ± 0.151
2.503PheIle: 2.503 ± 0.58
3.092PheLys: 3.092 ± 0.726
2.65PheLeu: 2.65 ± 0.666
0.736PheMet: 0.736 ± 0.225
3.828PheAsn: 3.828 ± 0.71
1.472PhePro: 1.472 ± 0.494
2.945PheGln: 2.945 ± 0.944
1.178PheArg: 1.178 ± 0.278
3.386PheSer: 3.386 ± 0.824
2.65PheThr: 2.65 ± 0.562
2.503PheVal: 2.503 ± 0.705
0.736PheTrp: 0.736 ± 0.422
0.736PheTyr: 0.736 ± 0.31
0.0PheXaa: 0.0 ± 0.0
Gly
4.417GlyAla: 4.417 ± 0.899
0.883GlyCys: 0.883 ± 0.335
5.595GlyAsp: 5.595 ± 1.224
3.828GlyGlu: 3.828 ± 0.661
1.62GlyPhe: 1.62 ± 0.415
10.306GlyGly: 10.306 ± 3.166
0.736GlyHis: 0.736 ± 0.321
3.828GlyIle: 3.828 ± 0.719
5.006GlyLys: 5.006 ± 1.195
6.478GlyLeu: 6.478 ± 1.481
3.681GlyMet: 3.681 ± 1.635
3.534GlyAsn: 3.534 ± 0.936
0.736GlyPro: 0.736 ± 0.396
2.503GlyGln: 2.503 ± 1.06
3.239GlyArg: 3.239 ± 0.819
4.417GlySer: 4.417 ± 0.759
4.27GlyThr: 4.27 ± 0.976
4.711GlyVal: 4.711 ± 0.67
0.589GlyTrp: 0.589 ± 0.275
2.356GlyTyr: 2.356 ± 0.619
0.0GlyXaa: 0.0 ± 0.0
His
0.442HisAla: 0.442 ± 0.269
0.0HisCys: 0.0 ± 0.0
0.294HisAsp: 0.294 ± 0.235
0.589HisGlu: 0.589 ± 0.409
0.294HisPhe: 0.294 ± 0.201
0.589HisGly: 0.589 ± 0.408
0.294HisHis: 0.294 ± 0.207
0.442HisIle: 0.442 ± 0.274
0.589HisLys: 0.589 ± 0.376
0.736HisLeu: 0.736 ± 0.284
0.736HisMet: 0.736 ± 0.519
0.736HisAsn: 0.736 ± 0.358
0.883HisPro: 0.883 ± 0.283
0.294HisGln: 0.294 ± 0.18
0.294HisArg: 0.294 ± 0.188
1.031HisSer: 1.031 ± 0.342
1.325HisThr: 1.325 ± 0.306
0.736HisVal: 0.736 ± 0.236
0.294HisTrp: 0.294 ± 0.253
0.294HisTyr: 0.294 ± 0.159
0.147HisXaa: 0.147 ± 0.127
Ile
2.797IleAla: 2.797 ± 0.602
0.883IleCys: 0.883 ± 0.314
5.3IleAsp: 5.3 ± 1.004
3.534IleGlu: 3.534 ± 0.644
2.356IlePhe: 2.356 ± 0.588
4.122IleGly: 4.122 ± 0.609
0.442IleHis: 0.442 ± 0.213
4.859IleIle: 4.859 ± 1.091
5.595IleLys: 5.595 ± 1.109
5.595IleLeu: 5.595 ± 0.633
1.767IleMet: 1.767 ± 0.585
5.153IleAsn: 5.153 ± 1.142
3.534IlePro: 3.534 ± 0.651
3.681IleGln: 3.681 ± 0.854
1.914IleArg: 1.914 ± 0.525
4.564IleSer: 4.564 ± 1.021
6.478IleThr: 6.478 ± 0.73
3.828IleVal: 3.828 ± 0.673
0.736IleTrp: 0.736 ± 0.306
4.564IleTyr: 4.564 ± 0.998
0.0IleXaa: 0.0 ± 0.0
Lys
5.006LysAla: 5.006 ± 0.944
0.442LysCys: 0.442 ± 0.271
5.448LysAsp: 5.448 ± 0.992
7.951LysGlu: 7.951 ± 1.462
2.65LysPhe: 2.65 ± 0.809
6.037LysGly: 6.037 ± 0.944
0.589LysHis: 0.589 ± 0.23
6.625LysIle: 6.625 ± 1.183
6.625LysLys: 6.625 ± 1.711
5.448LysLeu: 5.448 ± 1.165
2.061LysMet: 2.061 ± 0.414
5.006LysAsn: 5.006 ± 0.962
2.65LysPro: 2.65 ± 0.814
3.239LysGln: 3.239 ± 0.815
3.386LysArg: 3.386 ± 0.785
3.681LysSer: 3.681 ± 0.62
3.239LysThr: 3.239 ± 0.513
4.711LysVal: 4.711 ± 0.841
0.589LysTrp: 0.589 ± 0.3
4.564LysTyr: 4.564 ± 1.273
0.147LysXaa: 0.147 ± 0.127
Leu
3.975LeuAla: 3.975 ± 0.813
1.031LeuCys: 1.031 ± 0.45
5.3LeuAsp: 5.3 ± 0.83
3.239LeuGlu: 3.239 ± 0.66
1.767LeuPhe: 1.767 ± 0.443
6.037LeuGly: 6.037 ± 1.359
0.589LeuHis: 0.589 ± 0.317
4.27LeuIle: 4.27 ± 0.761
8.392LeuLys: 8.392 ± 1.61
5.889LeuLeu: 5.889 ± 0.656
1.472LeuMet: 1.472 ± 0.402
6.037LeuAsn: 6.037 ± 0.905
5.448LeuPro: 5.448 ± 0.853
3.681LeuGln: 3.681 ± 0.627
2.945LeuArg: 2.945 ± 0.571
6.331LeuSer: 6.331 ± 1.318
5.153LeuThr: 5.153 ± 0.551
4.27LeuVal: 4.27 ± 0.781
0.589LeuTrp: 0.589 ± 0.217
4.122LeuTyr: 4.122 ± 0.774
0.0LeuXaa: 0.0 ± 0.0
Met
1.914MetAla: 1.914 ± 0.632
0.294MetCys: 0.294 ± 0.349
0.736MetAsp: 0.736 ± 0.34
0.589MetGlu: 0.589 ± 0.322
0.883MetPhe: 0.883 ± 0.276
1.325MetGly: 1.325 ± 0.416
0.0MetHis: 0.0 ± 0.0
1.178MetIle: 1.178 ± 0.472
1.325MetLys: 1.325 ± 0.406
2.208MetLeu: 2.208 ± 0.528
0.147MetMet: 0.147 ± 0.141
2.356MetAsn: 2.356 ± 0.602
1.62MetPro: 1.62 ± 0.475
0.0MetGln: 0.0 ± 0.0
1.325MetArg: 1.325 ± 0.487
3.534MetSer: 3.534 ± 1.269
1.767MetThr: 1.767 ± 0.384
0.736MetVal: 0.736 ± 0.275
0.0MetTrp: 0.0 ± 0.0
0.736MetTyr: 0.736 ± 0.337
0.0MetXaa: 0.0 ± 0.0
Asn
3.534AsnAla: 3.534 ± 0.822
0.442AsnCys: 0.442 ± 0.286
4.27AsnAsp: 4.27 ± 0.802
5.006AsnGlu: 5.006 ± 1.252
3.092AsnPhe: 3.092 ± 0.586
4.27AsnGly: 4.27 ± 0.777
1.178AsnHis: 1.178 ± 0.266
7.214AsnIle: 7.214 ± 1.337
4.417AsnLys: 4.417 ± 1.129
5.153AsnLeu: 5.153 ± 0.834
1.472AsnMet: 1.472 ± 0.461
7.656AsnAsn: 7.656 ± 1.82
5.3AsnPro: 5.3 ± 0.694
3.092AsnGln: 3.092 ± 0.852
1.031AsnArg: 1.031 ± 0.328
3.681AsnSer: 3.681 ± 0.987
5.742AsnThr: 5.742 ± 1.132
3.386AsnVal: 3.386 ± 0.897
0.442AsnTrp: 0.442 ± 0.236
2.797AsnTyr: 2.797 ± 0.716
0.0AsnXaa: 0.0 ± 0.0
Pro
3.975ProAla: 3.975 ± 1.476
0.294ProCys: 0.294 ± 0.207
2.356ProAsp: 2.356 ± 0.786
2.65ProGlu: 2.65 ± 0.787
1.914ProPhe: 1.914 ± 0.411
0.147ProGly: 0.147 ± 0.127
0.883ProHis: 0.883 ± 0.31
3.975ProIle: 3.975 ± 0.652
3.092ProLys: 3.092 ± 0.764
5.153ProLeu: 5.153 ± 0.746
0.736ProMet: 0.736 ± 0.219
2.797ProAsn: 2.797 ± 0.656
2.65ProPro: 2.65 ± 0.867
3.681ProGln: 3.681 ± 1.826
1.178ProArg: 1.178 ± 0.285
3.828ProSer: 3.828 ± 0.616
4.122ProThr: 4.122 ± 0.902
3.681ProVal: 3.681 ± 0.703
0.0ProTrp: 0.0 ± 0.0
1.767ProTyr: 1.767 ± 0.551
0.0ProXaa: 0.0 ± 0.0
Gln
3.239GlnAla: 3.239 ± 0.949
0.147GlnCys: 0.147 ± 0.128
2.945GlnAsp: 2.945 ± 0.575
1.472GlnGlu: 1.472 ± 0.419
0.442GlnPhe: 0.442 ± 0.266
3.386GlnGly: 3.386 ± 0.88
0.883GlnHis: 0.883 ± 0.274
2.797GlnIle: 2.797 ± 0.865
2.356GlnLys: 2.356 ± 0.446
3.828GlnLeu: 3.828 ± 0.799
0.736GlnMet: 0.736 ± 0.32
3.681GlnAsn: 3.681 ± 0.927
4.122GlnPro: 4.122 ± 1.473
3.534GlnGln: 3.534 ± 1.278
1.914GlnArg: 1.914 ± 0.466
3.681GlnSer: 3.681 ± 0.876
2.356GlnThr: 2.356 ± 0.85
1.178GlnVal: 1.178 ± 0.49
0.147GlnTrp: 0.147 ± 0.146
1.325GlnTyr: 1.325 ± 0.39
0.0GlnXaa: 0.0 ± 0.0
Arg
2.65ArgAla: 2.65 ± 0.505
0.147ArgCys: 0.147 ± 0.128
1.914ArgAsp: 1.914 ± 0.637
1.472ArgGlu: 1.472 ± 0.476
1.62ArgPhe: 1.62 ± 0.423
2.503ArgGly: 2.503 ± 0.58
1.178ArgHis: 1.178 ± 0.351
2.797ArgIle: 2.797 ± 0.419
2.503ArgLys: 2.503 ± 0.58
3.534ArgLeu: 3.534 ± 0.973
0.883ArgMet: 0.883 ± 0.327
3.239ArgAsn: 3.239 ± 0.495
1.325ArgPro: 1.325 ± 0.371
1.472ArgGln: 1.472 ± 0.362
1.62ArgArg: 1.62 ± 0.625
1.325ArgSer: 1.325 ± 0.259
1.62ArgThr: 1.62 ± 0.396
1.472ArgVal: 1.472 ± 0.429
0.589ArgTrp: 0.589 ± 0.304
1.767ArgTyr: 1.767 ± 0.564
0.0ArgXaa: 0.0 ± 0.0
Ser
3.975SerAla: 3.975 ± 1.65
0.294SerCys: 0.294 ± 0.213
3.681SerAsp: 3.681 ± 0.723
2.208SerGlu: 2.208 ± 0.645
3.239SerPhe: 3.239 ± 0.451
7.509SerGly: 7.509 ± 2.56
1.031SerHis: 1.031 ± 0.451
4.564SerIle: 4.564 ± 0.933
6.478SerLys: 6.478 ± 1.73
4.711SerLeu: 4.711 ± 0.564
1.031SerMet: 1.031 ± 0.606
3.534SerAsn: 3.534 ± 0.924
2.356SerPro: 2.356 ± 0.816
1.914SerGln: 1.914 ± 0.63
2.061SerArg: 2.061 ± 0.496
5.006SerSer: 5.006 ± 1.029
2.797SerThr: 2.797 ± 0.598
6.184SerVal: 6.184 ± 0.962
0.736SerTrp: 0.736 ± 0.335
2.503SerTyr: 2.503 ± 0.647
0.0SerXaa: 0.0 ± 0.0
Thr
6.037ThrAla: 6.037 ± 1.39
1.62ThrCys: 1.62 ± 0.566
3.975ThrAsp: 3.975 ± 0.785
3.534ThrGlu: 3.534 ± 0.701
3.092ThrPhe: 3.092 ± 0.61
6.037ThrGly: 6.037 ± 1.308
0.589ThrHis: 0.589 ± 0.278
4.27ThrIle: 4.27 ± 0.922
2.945ThrLys: 2.945 ± 0.784
3.975ThrLeu: 3.975 ± 0.605
0.736ThrMet: 0.736 ± 0.261
3.828ThrAsn: 3.828 ± 1.042
3.681ThrPro: 3.681 ± 0.78
2.65ThrGln: 2.65 ± 0.658
1.472ThrArg: 1.472 ± 0.289
3.975ThrSer: 3.975 ± 0.676
5.006ThrThr: 5.006 ± 1.148
0.294ThrVal: 0.294 ± 0.254
0.883ThrTrp: 0.883 ± 0.307
3.828ThrTyr: 3.828 ± 0.694
0.0ThrXaa: 0.0 ± 0.0
Val
3.239ValAla: 3.239 ± 0.7
0.442ValCys: 0.442 ± 0.187
2.503ValAsp: 2.503 ± 0.515
3.975ValGlu: 3.975 ± 0.95
2.65ValPhe: 2.65 ± 0.732
3.092ValGly: 3.092 ± 0.76
0.442ValHis: 0.442 ± 0.241
3.828ValIle: 3.828 ± 0.943
3.681ValLys: 3.681 ± 0.564
4.711ValLeu: 4.711 ± 0.899
0.294ValMet: 0.294 ± 0.183
4.122ValAsn: 4.122 ± 0.798
2.797ValPro: 2.797 ± 0.818
2.208ValGln: 2.208 ± 0.37
2.503ValArg: 2.503 ± 0.751
4.859ValSer: 4.859 ± 0.687
0.147ValThr: 0.147 ± 0.129
2.65ValVal: 2.65 ± 0.655
0.442ValTrp: 0.442 ± 0.188
3.239ValTyr: 3.239 ± 0.676
0.0ValXaa: 0.0 ± 0.0
Trp
0.442TrpAla: 0.442 ± 0.188
0.294TrpCys: 0.294 ± 0.196
0.294TrpAsp: 0.294 ± 0.18
0.883TrpGlu: 0.883 ± 0.369
0.589TrpPhe: 0.589 ± 0.229
0.442TrpGly: 0.442 ± 0.248
0.0TrpHis: 0.0 ± 0.0
1.031TrpIle: 1.031 ± 0.382
1.472TrpLys: 1.472 ± 0.369
0.589TrpLeu: 0.589 ± 0.289
0.147TrpMet: 0.147 ± 0.152
1.325TrpAsn: 1.325 ± 0.322
0.0TrpPro: 0.0 ± 0.0
0.147TrpGln: 0.147 ± 0.128
0.294TrpArg: 0.294 ± 0.159
0.736TrpSer: 0.736 ± 0.426
0.294TrpThr: 0.294 ± 0.177
0.736TrpVal: 0.736 ± 0.229
0.442TrpTrp: 0.442 ± 0.229
0.589TrpTyr: 0.589 ± 0.314
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.65TyrAla: 2.65 ± 0.573
0.147TyrCys: 0.147 ± 0.129
3.681TyrAsp: 3.681 ± 0.737
2.208TyrGlu: 2.208 ± 0.572
2.208TyrPhe: 2.208 ± 0.706
2.797TyrGly: 2.797 ± 0.532
0.736TyrHis: 0.736 ± 0.262
3.534TyrIle: 3.534 ± 0.68
4.564TyrLys: 4.564 ± 0.76
3.681TyrLeu: 3.681 ± 0.723
0.589TyrMet: 0.589 ± 0.223
3.975TyrAsn: 3.975 ± 0.746
1.767TyrPro: 1.767 ± 0.469
1.62TyrGln: 1.62 ± 0.392
1.914TyrArg: 1.914 ± 0.443
3.975TyrSer: 3.975 ± 0.675
1.767TyrThr: 1.767 ± 0.568
1.767TyrVal: 1.767 ± 0.432
0.294TyrTrp: 0.294 ± 0.24
2.208TyrTyr: 2.208 ± 0.632
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.147XaaAla: 0.147 ± 0.127
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.147XaaLys: 0.147 ± 0.127
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.442XaaXaa: 0.442 ± 0.381
Statistics based on 16 proteins (6793 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski