Amino acid dipepetide frequency for Phaeocystis globosa virus virophage

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.513AlaAla: 8.513 ± 2.22
0.378AlaCys: 0.378 ± 0.294
3.027AlaAsp: 3.027 ± 1.033
5.675AlaGlu: 5.675 ± 1.525
3.216AlaPhe: 3.216 ± 0.693
6.243AlaGly: 6.243 ± 1.882
0.568AlaHis: 0.568 ± 0.4
4.729AlaIle: 4.729 ± 0.743
4.54AlaLys: 4.54 ± 0.813
3.594AlaLeu: 3.594 ± 0.972
1.703AlaMet: 1.703 ± 0.567
4.351AlaAsn: 4.351 ± 1.042
2.838AlaPro: 2.838 ± 0.592
2.27AlaGln: 2.27 ± 0.582
2.838AlaArg: 2.838 ± 0.675
3.216AlaSer: 3.216 ± 0.964
6.621AlaThr: 6.621 ± 1.613
4.54AlaVal: 4.54 ± 0.999
0.378AlaTrp: 0.378 ± 0.259
2.081AlaTyr: 2.081 ± 0.511
0.0AlaXaa: 0.0 ± 0.0
Cys
0.189CysAla: 0.189 ± 0.167
0.0CysCys: 0.0 ± 0.0
0.568CysAsp: 0.568 ± 0.249
0.757CysGlu: 0.757 ± 0.233
0.378CysPhe: 0.378 ± 0.222
1.135CysGly: 1.135 ± 0.64
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.568CysLys: 0.568 ± 0.274
2.081CysLeu: 2.081 ± 0.621
0.0CysMet: 0.0 ± 0.0
0.189CysAsn: 0.189 ± 0.187
0.378CysPro: 0.378 ± 0.25
0.0CysGln: 0.0 ± 0.0
0.757CysArg: 0.757 ± 0.291
0.946CysSer: 0.946 ± 0.431
0.0CysThr: 0.0 ± 0.0
0.757CysVal: 0.757 ± 0.365
0.0CysTrp: 0.0 ± 0.0
0.378CysTyr: 0.378 ± 0.297
0.0CysXaa: 0.0 ± 0.0
Asp
2.459AspAla: 2.459 ± 0.97
0.568AspCys: 0.568 ± 0.309
4.351AspAsp: 4.351 ± 0.866
4.919AspGlu: 4.919 ± 1.712
2.27AspPhe: 2.27 ± 0.691
4.162AspGly: 4.162 ± 1.036
0.189AspHis: 0.189 ± 0.183
4.54AspIle: 4.54 ± 0.953
5.486AspLys: 5.486 ± 1.431
6.054AspLeu: 6.054 ± 1.004
1.513AspMet: 1.513 ± 0.574
2.649AspAsn: 2.649 ± 0.714
1.513AspPro: 1.513 ± 0.532
0.568AspGln: 0.568 ± 0.255
2.081AspArg: 2.081 ± 0.446
2.459AspSer: 2.459 ± 0.515
3.405AspThr: 3.405 ± 0.71
3.784AspVal: 3.784 ± 1.185
0.189AspTrp: 0.189 ± 0.193
3.784AspTyr: 3.784 ± 0.709
0.0AspXaa: 0.0 ± 0.0
Glu
5.675GluAla: 5.675 ± 1.419
0.946GluCys: 0.946 ± 0.413
3.784GluAsp: 3.784 ± 1.003
7.756GluGlu: 7.756 ± 2.307
3.594GluPhe: 3.594 ± 0.883
3.594GluGly: 3.594 ± 0.674
1.324GluHis: 1.324 ± 0.511
6.243GluIle: 6.243 ± 1.017
4.351GluLys: 4.351 ± 1.177
5.675GluLeu: 5.675 ± 0.729
1.892GluMet: 1.892 ± 0.9
4.54GluAsn: 4.54 ± 0.827
2.27GluPro: 2.27 ± 1.109
1.892GluGln: 1.892 ± 0.767
3.405GluArg: 3.405 ± 0.707
1.703GluSer: 1.703 ± 0.573
4.54GluThr: 4.54 ± 0.916
6.621GluVal: 6.621 ± 1.091
0.757GluTrp: 0.757 ± 0.33
3.405GluTyr: 3.405 ± 1.139
0.0GluXaa: 0.0 ± 0.0
Phe
3.216PheAla: 3.216 ± 0.847
0.757PheCys: 0.757 ± 0.333
2.081PheAsp: 2.081 ± 0.394
1.703PheGlu: 1.703 ± 0.521
0.568PhePhe: 0.568 ± 0.249
2.838PheGly: 2.838 ± 1.186
0.568PheHis: 0.568 ± 0.386
2.27PheIle: 2.27 ± 0.688
3.784PheLys: 3.784 ± 0.735
2.838PheLeu: 2.838 ± 0.975
1.703PheMet: 1.703 ± 0.503
2.649PheAsn: 2.649 ± 0.625
1.324PhePro: 1.324 ± 0.543
1.892PheGln: 1.892 ± 0.455
1.703PheArg: 1.703 ± 0.452
3.027PheSer: 3.027 ± 0.563
2.081PheThr: 2.081 ± 0.675
3.027PheVal: 3.027 ± 0.801
0.0PheTrp: 0.0 ± 0.0
0.189PheTyr: 0.189 ± 0.233
0.0PheXaa: 0.0 ± 0.0
Gly
3.405GlyAla: 3.405 ± 1.164
0.757GlyCys: 0.757 ± 0.298
4.729GlyAsp: 4.729 ± 1.203
3.973GlyGlu: 3.973 ± 0.903
3.027GlyPhe: 3.027 ± 0.954
5.486GlyGly: 5.486 ± 1.113
1.513GlyHis: 1.513 ± 0.424
5.297GlyIle: 5.297 ± 1.898
4.351GlyLys: 4.351 ± 0.743
5.108GlyLeu: 5.108 ± 1.06
1.703GlyMet: 1.703 ± 0.267
4.162GlyAsn: 4.162 ± 1.65
0.189GlyPro: 0.189 ± 0.188
1.324GlyGln: 1.324 ± 0.593
3.027GlyArg: 3.027 ± 0.575
5.486GlySer: 5.486 ± 0.988
5.486GlyThr: 5.486 ± 1.86
4.919GlyVal: 4.919 ± 0.999
0.757GlyTrp: 0.757 ± 0.371
1.892GlyTyr: 1.892 ± 0.472
0.0GlyXaa: 0.0 ± 0.0
His
0.946HisAla: 0.946 ± 0.482
0.0HisCys: 0.0 ± 0.0
0.189HisAsp: 0.189 ± 0.167
1.135HisGlu: 1.135 ± 0.442
0.946HisPhe: 0.946 ± 0.644
1.513HisGly: 1.513 ± 0.541
0.568HisHis: 0.568 ± 0.242
0.568HisIle: 0.568 ± 0.225
1.703HisLys: 1.703 ± 0.364
1.324HisLeu: 1.324 ± 0.504
0.946HisMet: 0.946 ± 0.297
1.135HisAsn: 1.135 ± 0.319
0.757HisPro: 0.757 ± 0.314
0.378HisGln: 0.378 ± 0.218
0.568HisArg: 0.568 ± 0.223
1.892HisSer: 1.892 ± 0.43
1.892HisThr: 1.892 ± 0.537
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.946HisTyr: 0.946 ± 0.42
0.0HisXaa: 0.0 ± 0.0
Ile
4.729IleAla: 4.729 ± 0.805
0.189IleCys: 0.189 ± 0.227
4.919IleAsp: 4.919 ± 0.904
4.54IleGlu: 4.54 ± 1.214
1.513IlePhe: 1.513 ± 0.591
5.108IleGly: 5.108 ± 2.258
1.703IleHis: 1.703 ± 0.614
3.973IleIle: 3.973 ± 0.872
5.865IleLys: 5.865 ± 1.099
3.784IleLeu: 3.784 ± 0.667
0.946IleMet: 0.946 ± 0.462
4.919IleAsn: 4.919 ± 0.936
1.324IlePro: 1.324 ± 0.541
2.459IleGln: 2.459 ± 0.59
3.216IleArg: 3.216 ± 0.99
5.675IleSer: 5.675 ± 1.08
6.81IleThr: 6.81 ± 1.05
2.459IleVal: 2.459 ± 1.316
0.757IleTrp: 0.757 ± 0.402
2.27IleTyr: 2.27 ± 0.813
0.0IleXaa: 0.0 ± 0.0
Lys
4.729LysAla: 4.729 ± 0.66
0.757LysCys: 0.757 ± 0.398
4.351LysAsp: 4.351 ± 0.941
7.378LysGlu: 7.378 ± 1.682
3.594LysPhe: 3.594 ± 1.016
3.784LysGly: 3.784 ± 0.938
2.081LysHis: 2.081 ± 0.479
5.108LysIle: 5.108 ± 1.271
10.026LysLys: 10.026 ± 2.734
5.675LysLeu: 5.675 ± 1.037
1.513LysMet: 1.513 ± 0.485
5.486LysAsn: 5.486 ± 2.022
2.459LysPro: 2.459 ± 0.802
3.027LysGln: 3.027 ± 0.713
3.216LysArg: 3.216 ± 1.043
5.865LysSer: 5.865 ± 1.613
6.81LysThr: 6.81 ± 0.934
3.405LysVal: 3.405 ± 0.788
0.378LysTrp: 0.378 ± 0.249
3.216LysTyr: 3.216 ± 0.847
0.0LysXaa: 0.0 ± 0.0
Leu
7.378LeuAla: 7.378 ± 1.063
0.568LeuCys: 0.568 ± 0.359
3.594LeuAsp: 3.594 ± 0.985
5.108LeuGlu: 5.108 ± 1.072
1.324LeuPhe: 1.324 ± 0.383
6.054LeuGly: 6.054 ± 1.085
1.135LeuHis: 1.135 ± 0.81
3.784LeuIle: 3.784 ± 1.022
6.243LeuLys: 6.243 ± 1.211
5.675LeuLeu: 5.675 ± 0.71
2.081LeuMet: 2.081 ± 0.747
5.865LeuAsn: 5.865 ± 1.388
3.594LeuPro: 3.594 ± 1.148
3.594LeuGln: 3.594 ± 0.934
2.649LeuArg: 2.649 ± 0.565
6.621LeuSer: 6.621 ± 0.957
5.486LeuThr: 5.486 ± 0.854
2.649LeuVal: 2.649 ± 0.637
1.135LeuTrp: 1.135 ± 0.411
1.513LeuTyr: 1.513 ± 0.42
0.0LeuXaa: 0.0 ± 0.0
Met
1.324MetAla: 1.324 ± 0.456
0.189MetCys: 0.189 ± 0.163
0.757MetAsp: 0.757 ± 0.427
0.946MetGlu: 0.946 ± 0.346
1.135MetPhe: 1.135 ± 0.353
1.135MetGly: 1.135 ± 0.31
0.0MetHis: 0.0 ± 0.0
1.703MetIle: 1.703 ± 0.64
2.649MetLys: 2.649 ± 0.638
1.513MetLeu: 1.513 ± 0.469
0.757MetMet: 0.757 ± 0.317
2.459MetAsn: 2.459 ± 0.588
1.135MetPro: 1.135 ± 0.548
0.0MetGln: 0.0 ± 0.0
0.757MetArg: 0.757 ± 0.307
1.324MetSer: 1.324 ± 0.432
2.081MetThr: 2.081 ± 0.491
1.135MetVal: 1.135 ± 0.475
0.568MetTrp: 0.568 ± 0.502
1.135MetTyr: 1.135 ± 0.551
0.0MetXaa: 0.0 ± 0.0
Asn
3.784AsnAla: 3.784 ± 0.795
0.757AsnCys: 0.757 ± 0.346
3.594AsnAsp: 3.594 ± 0.823
6.432AsnGlu: 6.432 ± 1.425
2.649AsnPhe: 2.649 ± 0.654
3.973AsnGly: 3.973 ± 0.948
1.703AsnHis: 1.703 ± 0.68
6.432AsnIle: 6.432 ± 1.011
4.351AsnLys: 4.351 ± 0.798
5.297AsnLeu: 5.297 ± 0.604
1.703AsnMet: 1.703 ± 0.696
4.54AsnAsn: 4.54 ± 0.928
2.649AsnPro: 2.649 ± 0.915
1.513AsnGln: 1.513 ± 0.589
3.216AsnArg: 3.216 ± 1.117
4.919AsnSer: 4.919 ± 1.234
4.919AsnThr: 4.919 ± 0.736
3.784AsnVal: 3.784 ± 0.723
0.189AsnTrp: 0.189 ± 0.163
4.351AsnTyr: 4.351 ± 1.423
0.0AsnXaa: 0.0 ± 0.0
Pro
0.946ProAla: 0.946 ± 0.384
0.757ProCys: 0.757 ± 0.319
1.892ProAsp: 1.892 ± 0.781
3.027ProGlu: 3.027 ± 1.048
1.135ProPhe: 1.135 ± 0.485
0.378ProGly: 0.378 ± 0.284
0.189ProHis: 0.189 ± 0.227
1.513ProIle: 1.513 ± 0.655
3.027ProLys: 3.027 ± 0.681
2.459ProLeu: 2.459 ± 0.898
0.378ProMet: 0.378 ± 0.235
2.081ProAsn: 2.081 ± 0.684
2.27ProPro: 2.27 ± 1.477
1.513ProGln: 1.513 ± 0.49
2.081ProArg: 2.081 ± 0.731
3.027ProSer: 3.027 ± 1.048
2.459ProThr: 2.459 ± 0.714
3.405ProVal: 3.405 ± 0.999
0.0ProTrp: 0.0 ± 0.0
0.946ProTyr: 0.946 ± 0.446
0.0ProXaa: 0.0 ± 0.0
Gln
1.892GlnAla: 1.892 ± 0.636
0.0GlnCys: 0.0 ± 0.0
2.27GlnAsp: 2.27 ± 0.495
3.027GlnGlu: 3.027 ± 1.067
0.946GlnPhe: 0.946 ± 0.446
2.27GlnGly: 2.27 ± 0.589
1.135GlnHis: 1.135 ± 0.376
2.081GlnIle: 2.081 ± 0.584
2.459GlnLys: 2.459 ± 0.831
1.513GlnLeu: 1.513 ± 0.283
0.757GlnMet: 0.757 ± 0.43
2.27GlnAsn: 2.27 ± 0.541
0.568GlnPro: 0.568 ± 0.255
1.703GlnGln: 1.703 ± 0.748
1.513GlnArg: 1.513 ± 0.73
2.459GlnSer: 2.459 ± 0.719
1.892GlnThr: 1.892 ± 0.58
1.513GlnVal: 1.513 ± 0.387
0.0GlnTrp: 0.0 ± 0.0
0.946GlnTyr: 0.946 ± 0.42
0.0GlnXaa: 0.0 ± 0.0
Arg
2.459ArgAla: 2.459 ± 0.67
0.0ArgCys: 0.0 ± 0.0
3.216ArgAsp: 3.216 ± 0.98
3.405ArgGlu: 3.405 ± 0.93
2.649ArgPhe: 2.649 ± 0.509
2.649ArgGly: 2.649 ± 0.478
0.757ArgHis: 0.757 ± 0.368
4.54ArgIle: 4.54 ± 1.159
2.649ArgLys: 2.649 ± 0.707
3.973ArgLeu: 3.973 ± 1.074
1.135ArgMet: 1.135 ± 0.427
4.729ArgAsn: 4.729 ± 2.069
0.757ArgPro: 0.757 ± 0.469
1.892ArgGln: 1.892 ± 0.533
1.513ArgArg: 1.513 ± 0.59
2.27ArgSer: 2.27 ± 0.622
2.27ArgThr: 2.27 ± 0.812
2.838ArgVal: 2.838 ± 0.788
0.378ArgTrp: 0.378 ± 0.235
0.568ArgTyr: 0.568 ± 0.298
0.0ArgXaa: 0.0 ± 0.0
Ser
4.729SerAla: 4.729 ± 1.054
1.135SerCys: 1.135 ± 0.579
2.838SerAsp: 2.838 ± 0.928
2.459SerGlu: 2.459 ± 0.397
2.27SerPhe: 2.27 ± 0.485
4.54SerGly: 4.54 ± 0.806
1.513SerHis: 1.513 ± 0.37
5.108SerIle: 5.108 ± 0.973
5.108SerLys: 5.108 ± 1.059
4.919SerLeu: 4.919 ± 1.662
0.757SerMet: 0.757 ± 0.241
5.108SerAsn: 5.108 ± 0.661
2.649SerPro: 2.649 ± 0.671
2.459SerGln: 2.459 ± 0.922
3.405SerArg: 3.405 ± 0.723
6.81SerSer: 6.81 ± 1.373
3.784SerThr: 3.784 ± 0.665
3.594SerVal: 3.594 ± 0.957
0.757SerTrp: 0.757 ± 0.448
3.027SerTyr: 3.027 ± 0.987
0.0SerXaa: 0.0 ± 0.0
Thr
5.675ThrAla: 5.675 ± 1.712
0.568ThrCys: 0.568 ± 0.356
4.54ThrAsp: 4.54 ± 1.002
5.108ThrGlu: 5.108 ± 1.065
3.027ThrPhe: 3.027 ± 0.771
5.675ThrGly: 5.675 ± 2.188
1.135ThrHis: 1.135 ± 0.43
3.216ThrIle: 3.216 ± 0.912
5.108ThrLys: 5.108 ± 1.125
5.486ThrLeu: 5.486 ± 1.059
1.135ThrMet: 1.135 ± 0.351
5.675ThrAsn: 5.675 ± 0.813
2.838ThrPro: 2.838 ± 0.914
1.513ThrGln: 1.513 ± 0.545
3.216ThrArg: 3.216 ± 1.206
4.351ThrSer: 4.351 ± 1.017
5.865ThrThr: 5.865 ± 1.734
2.649ThrVal: 2.649 ± 1.007
0.0ThrTrp: 0.0 ± 0.0
2.081ThrTyr: 2.081 ± 0.668
0.0ThrXaa: 0.0 ± 0.0
Val
6.243ValAla: 6.243 ± 1.294
0.189ValCys: 0.189 ± 0.163
3.973ValAsp: 3.973 ± 0.823
3.973ValGlu: 3.973 ± 0.594
2.27ValPhe: 2.27 ± 0.521
4.162ValGly: 4.162 ± 1.554
0.757ValHis: 0.757 ± 0.358
2.838ValIle: 2.838 ± 0.435
5.108ValLys: 5.108 ± 1.401
5.297ValLeu: 5.297 ± 1.103
0.946ValMet: 0.946 ± 0.489
4.351ValAsn: 4.351 ± 0.894
2.649ValPro: 2.649 ± 0.735
1.892ValGln: 1.892 ± 0.739
2.459ValArg: 2.459 ± 0.772
3.027ValSer: 3.027 ± 1.139
0.757ValThr: 0.757 ± 0.35
4.162ValVal: 4.162 ± 0.946
0.189ValTrp: 0.189 ± 0.187
2.459ValTyr: 2.459 ± 0.705
0.0ValXaa: 0.0 ± 0.0
Trp
0.378TrpAla: 0.378 ± 0.226
0.0TrpCys: 0.0 ± 0.0
0.189TrpAsp: 0.189 ± 0.236
0.189TrpGlu: 0.189 ± 0.167
0.378TrpPhe: 0.378 ± 0.249
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.378TrpIle: 0.378 ± 0.243
0.568TrpLys: 0.568 ± 0.264
1.135TrpLeu: 1.135 ± 0.394
0.378TrpMet: 0.378 ± 0.189
0.378TrpAsn: 0.378 ± 0.257
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.135TrpArg: 1.135 ± 0.709
0.189TrpSer: 0.189 ± 0.227
0.568TrpThr: 0.568 ± 0.212
0.189TrpVal: 0.189 ± 0.167
0.0TrpTrp: 0.0 ± 0.0
0.568TrpTyr: 0.568 ± 0.328
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.649TyrAla: 2.649 ± 0.903
0.568TyrCys: 0.568 ± 0.296
2.081TyrAsp: 2.081 ± 0.445
2.649TyrGlu: 2.649 ± 1.037
1.324TyrPhe: 1.324 ± 0.52
1.892TyrGly: 1.892 ± 0.533
0.568TyrHis: 0.568 ± 0.336
2.838TyrIle: 2.838 ± 1.187
4.919TyrLys: 4.919 ± 1.602
2.649TyrLeu: 2.649 ± 0.784
0.568TyrMet: 0.568 ± 0.32
3.216TyrAsn: 3.216 ± 1.023
1.324TyrPro: 1.324 ± 0.571
1.324TyrGln: 1.324 ± 0.526
1.892TyrArg: 1.892 ± 0.548
1.703TyrSer: 1.703 ± 0.59
1.135TyrThr: 1.135 ± 0.343
2.27TyrVal: 2.27 ± 0.757
0.189TyrTrp: 0.189 ± 0.233
1.703TyrTyr: 1.703 ± 0.686
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 16 proteins (5287 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski