Amino acid dipepetide frequency for Cystoviridae sp.

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.552AlaAla: 1.552 ± 0.869
1.862AlaCys: 1.862 ± 0.646
2.483AlaAsp: 2.483 ± 0.68
4.655AlaGlu: 4.655 ± 1.454
1.862AlaPhe: 1.862 ± 1.108
3.104AlaGly: 3.104 ± 0.772
0.621AlaHis: 0.621 ± 0.412
3.724AlaIle: 3.724 ± 1.105
5.587AlaLys: 5.587 ± 1.05
4.035AlaLeu: 4.035 ± 1.141
1.862AlaMet: 1.862 ± 0.938
2.173AlaAsn: 2.173 ± 0.879
1.241AlaPro: 1.241 ± 0.429
2.793AlaGln: 2.793 ± 0.933
3.414AlaArg: 3.414 ± 1.201
2.793AlaSer: 2.793 ± 0.568
2.173AlaThr: 2.173 ± 0.98
1.862AlaVal: 1.862 ± 0.813
0.31AlaTrp: 0.31 ± 0.221
1.552AlaTyr: 1.552 ± 0.651
0.0AlaXaa: 0.0 ± 0.0
Cys
0.931CysAla: 0.931 ± 0.35
0.621CysCys: 0.621 ± 0.412
1.241CysAsp: 1.241 ± 0.735
2.793CysGlu: 2.793 ± 0.761
0.31CysPhe: 0.31 ± 0.221
1.241CysGly: 1.241 ± 0.776
0.31CysHis: 0.31 ± 0.343
0.931CysIle: 0.931 ± 0.635
1.552CysLys: 1.552 ± 0.793
0.931CysLeu: 0.931 ± 0.725
0.621CysMet: 0.621 ± 0.311
2.173CysAsn: 2.173 ± 0.668
0.931CysPro: 0.931 ± 0.496
0.621CysGln: 0.621 ± 0.444
0.31CysArg: 0.31 ± 0.343
1.241CysSer: 1.241 ± 0.498
1.862CysThr: 1.862 ± 0.683
1.862CysVal: 1.862 ± 0.783
0.31CysTrp: 0.31 ± 0.299
0.621CysTyr: 0.621 ± 0.432
0.0CysXaa: 0.0 ± 0.0
Asp
3.414AspAla: 3.414 ± 1.006
1.552AspCys: 1.552 ± 0.528
2.173AspAsp: 2.173 ± 0.7
3.724AspGlu: 3.724 ± 1.248
3.414AspPhe: 3.414 ± 0.886
1.862AspGly: 1.862 ± 0.859
1.552AspHis: 1.552 ± 0.468
5.587AspIle: 5.587 ± 1.139
4.655AspLys: 4.655 ± 1.121
2.173AspLeu: 2.173 ± 1.099
1.241AspMet: 1.241 ± 0.451
3.724AspAsn: 3.724 ± 1.141
2.173AspPro: 2.173 ± 0.542
2.793AspGln: 2.793 ± 0.733
1.241AspArg: 1.241 ± 0.452
3.104AspSer: 3.104 ± 1.401
2.483AspThr: 2.483 ± 0.895
3.104AspVal: 3.104 ± 1.05
1.241AspTrp: 1.241 ± 0.882
4.035AspTyr: 4.035 ± 0.764
0.0AspXaa: 0.0 ± 0.0
Glu
6.518GluAla: 6.518 ± 1.213
1.862GluCys: 1.862 ± 0.808
5.897GluAsp: 5.897 ± 1.175
6.518GluGlu: 6.518 ± 0.663
3.104GluPhe: 3.104 ± 0.725
6.828GluGly: 6.828 ± 1.047
2.793GluHis: 2.793 ± 0.916
5.587GluIle: 5.587 ± 1.04
6.828GluLys: 6.828 ± 0.872
5.587GluLeu: 5.587 ± 0.982
2.483GluMet: 2.483 ± 0.842
3.414GluAsn: 3.414 ± 0.976
1.862GluPro: 1.862 ± 0.717
1.552GluGln: 1.552 ± 0.955
1.552GluArg: 1.552 ± 0.955
4.345GluSer: 4.345 ± 0.721
5.276GluThr: 5.276 ± 1.092
5.587GluVal: 5.587 ± 1.307
2.173GluTrp: 2.173 ± 0.639
2.483GluTyr: 2.483 ± 0.694
0.0GluXaa: 0.0 ± 0.0
Phe
1.552PheAla: 1.552 ± 0.885
0.621PheCys: 0.621 ± 0.387
4.035PheAsp: 4.035 ± 1.032
3.414PheGlu: 3.414 ± 0.696
2.793PhePhe: 2.793 ± 1.061
3.104PheGly: 3.104 ± 1.448
0.31PheHis: 0.31 ± 0.326
1.552PheIle: 1.552 ± 0.759
3.104PheLys: 3.104 ± 0.877
2.173PheLeu: 2.173 ± 0.745
0.931PheMet: 0.931 ± 0.537
3.414PheAsn: 3.414 ± 1.208
2.483PhePro: 2.483 ± 0.611
1.862PheGln: 1.862 ± 0.476
2.173PheArg: 2.173 ± 0.973
2.793PheSer: 2.793 ± 1.092
2.793PheThr: 2.793 ± 0.923
1.862PheVal: 1.862 ± 0.729
1.552PheTrp: 1.552 ± 0.853
3.104PheTyr: 3.104 ± 1.276
0.0PheXaa: 0.0 ± 0.0
Gly
2.483GlyAla: 2.483 ± 0.699
1.862GlyCys: 1.862 ± 0.921
4.035GlyAsp: 4.035 ± 0.798
5.276GlyGlu: 5.276 ± 0.833
2.173GlyPhe: 2.173 ± 0.942
4.655GlyGly: 4.655 ± 0.904
0.931GlyHis: 0.931 ± 0.668
4.655GlyIle: 4.655 ± 1.437
6.518GlyLys: 6.518 ± 1.195
4.966GlyLeu: 4.966 ± 1.419
2.173GlyMet: 2.173 ± 0.73
3.724GlyAsn: 3.724 ± 0.795
0.0GlyPro: 0.0 ± 0.0
1.552GlyGln: 1.552 ± 0.635
1.862GlyArg: 1.862 ± 0.623
4.966GlySer: 4.966 ± 0.939
2.793GlyThr: 2.793 ± 0.762
4.345GlyVal: 4.345 ± 0.977
1.862GlyTrp: 1.862 ± 0.652
3.104GlyTyr: 3.104 ± 0.814
0.0GlyXaa: 0.0 ± 0.0
His
0.931HisAla: 0.931 ± 0.646
0.621HisCys: 0.621 ± 0.407
0.931HisAsp: 0.931 ± 0.628
1.552HisGlu: 1.552 ± 0.851
1.241HisPhe: 1.241 ± 0.715
1.552HisGly: 1.552 ± 0.777
0.621HisHis: 0.621 ± 0.425
1.862HisIle: 1.862 ± 0.531
1.241HisLys: 1.241 ± 0.583
2.483HisLeu: 2.483 ± 0.726
0.931HisMet: 0.931 ± 0.383
1.241HisAsn: 1.241 ± 0.564
0.621HisPro: 0.621 ± 0.475
1.552HisGln: 1.552 ± 0.73
0.621HisArg: 0.621 ± 0.659
2.483HisSer: 2.483 ± 0.811
1.241HisThr: 1.241 ± 0.398
1.552HisVal: 1.552 ± 0.623
0.621HisTrp: 0.621 ± 0.419
2.173HisTyr: 2.173 ± 0.9
0.0HisXaa: 0.0 ± 0.0
Ile
1.241IleAla: 1.241 ± 0.464
1.241IleCys: 1.241 ± 0.479
2.483IleAsp: 2.483 ± 1.304
4.655IleGlu: 4.655 ± 0.94
2.173IlePhe: 2.173 ± 0.907
2.483IleGly: 2.483 ± 0.639
2.173IleHis: 2.173 ± 0.628
4.345IleIle: 4.345 ± 1.014
5.587IleLys: 5.587 ± 1.144
4.966IleLeu: 4.966 ± 0.967
2.173IleMet: 2.173 ± 0.589
3.724IleAsn: 3.724 ± 1.033
3.724IlePro: 3.724 ± 0.743
2.173IleGln: 2.173 ± 0.768
3.724IleArg: 3.724 ± 0.908
4.966IleSer: 4.966 ± 0.969
4.966IleThr: 4.966 ± 0.734
2.483IleVal: 2.483 ± 0.657
0.0IleTrp: 0.0 ± 0.0
4.035IleTyr: 4.035 ± 1.388
0.0IleXaa: 0.0 ± 0.0
Lys
2.793LysAla: 2.793 ± 0.695
1.862LysCys: 1.862 ± 0.959
6.518LysAsp: 6.518 ± 0.863
9.001LysGlu: 9.001 ± 1.523
3.724LysPhe: 3.724 ± 1.037
4.345LysGly: 4.345 ± 0.995
2.173LysHis: 2.173 ± 1.028
3.724LysIle: 3.724 ± 1.394
5.276LysLys: 5.276 ± 1.247
8.07LysLeu: 8.07 ± 1.194
1.862LysMet: 1.862 ± 0.924
3.414LysAsn: 3.414 ± 0.99
1.241LysPro: 1.241 ± 0.548
3.104LysGln: 3.104 ± 0.741
4.966LysArg: 4.966 ± 1.25
5.587LysSer: 5.587 ± 1.092
4.035LysThr: 4.035 ± 0.787
5.897LysVal: 5.897 ± 1.205
2.793LysTrp: 2.793 ± 1.113
3.104LysTyr: 3.104 ± 1.077
0.0LysXaa: 0.0 ± 0.0
Leu
4.035LeuAla: 4.035 ± 0.714
1.241LeuCys: 1.241 ± 0.684
4.345LeuAsp: 4.345 ± 1.212
6.207LeuGlu: 6.207 ± 1.712
1.862LeuPhe: 1.862 ± 0.632
5.276LeuGly: 5.276 ± 0.819
2.173LeuHis: 2.173 ± 0.883
3.724LeuIle: 3.724 ± 1.193
6.207LeuLys: 6.207 ± 1.282
4.035LeuLeu: 4.035 ± 1.029
1.552LeuMet: 1.552 ± 0.543
5.897LeuAsn: 5.897 ± 1.373
3.724LeuPro: 3.724 ± 1.253
0.931LeuGln: 0.931 ± 0.35
5.276LeuArg: 5.276 ± 1.502
4.966LeuSer: 4.966 ± 0.645
4.345LeuThr: 4.345 ± 0.785
4.035LeuVal: 4.035 ± 1.178
0.31LeuTrp: 0.31 ± 0.221
3.414LeuTyr: 3.414 ± 0.777
0.0LeuXaa: 0.0 ± 0.0
Met
0.621MetAla: 0.621 ± 0.438
0.31MetCys: 0.31 ± 0.285
0.621MetAsp: 0.621 ± 0.446
1.552MetGlu: 1.552 ± 0.707
1.862MetPhe: 1.862 ± 0.867
1.552MetGly: 1.552 ± 0.651
0.31MetHis: 0.31 ± 0.285
2.793MetIle: 2.793 ± 0.978
1.552MetLys: 1.552 ± 0.63
1.862MetLeu: 1.862 ± 0.851
0.31MetMet: 0.31 ± 0.221
2.483MetAsn: 2.483 ± 0.943
0.621MetPro: 0.621 ± 0.286
0.931MetGln: 0.931 ± 0.646
0.931MetArg: 0.931 ± 0.569
2.483MetSer: 2.483 ± 1.172
1.862MetThr: 1.862 ± 0.641
0.931MetVal: 0.931 ± 0.505
0.0MetTrp: 0.0 ± 0.0
0.621MetTyr: 0.621 ± 0.441
0.0MetXaa: 0.0 ± 0.0
Asn
4.345AsnAla: 4.345 ± 1.298
0.931AsnCys: 0.931 ± 0.574
1.552AsnAsp: 1.552 ± 0.515
2.793AsnGlu: 2.793 ± 0.904
2.173AsnPhe: 2.173 ± 0.64
4.966AsnGly: 4.966 ± 0.93
1.552AsnHis: 1.552 ± 0.534
2.793AsnIle: 2.793 ± 1.004
5.897AsnLys: 5.897 ± 1.002
4.966AsnLeu: 4.966 ± 1.254
1.552AsnMet: 1.552 ± 0.624
4.345AsnAsn: 4.345 ± 1.208
1.241AsnPro: 1.241 ± 0.57
2.173AsnGln: 2.173 ± 0.839
2.173AsnArg: 2.173 ± 0.663
3.104AsnSer: 3.104 ± 1.624
3.104AsnThr: 3.104 ± 0.671
4.966AsnVal: 4.966 ± 0.924
0.621AsnTrp: 0.621 ± 0.311
2.793AsnTyr: 2.793 ± 0.9
0.0AsnXaa: 0.0 ± 0.0
Pro
2.793ProAla: 2.793 ± 0.68
0.0ProCys: 0.0 ± 0.0
1.241ProAsp: 1.241 ± 0.484
2.173ProGlu: 2.173 ± 0.712
2.483ProPhe: 2.483 ± 1.053
0.621ProGly: 0.621 ± 0.475
0.931ProHis: 0.931 ± 0.383
1.862ProIle: 1.862 ± 0.75
0.931ProLys: 0.931 ± 0.429
1.241ProLeu: 1.241 ± 0.528
0.931ProMet: 0.931 ± 0.35
1.862ProAsn: 1.862 ± 0.631
0.931ProPro: 0.931 ± 0.455
1.552ProGln: 1.552 ± 0.531
1.552ProArg: 1.552 ± 0.664
2.483ProSer: 2.483 ± 0.526
2.483ProThr: 2.483 ± 0.658
2.173ProVal: 2.173 ± 0.63
0.31ProTrp: 0.31 ± 0.285
1.552ProTyr: 1.552 ± 0.597
0.0ProXaa: 0.0 ± 0.0
Gln
3.104GlnAla: 3.104 ± 0.951
0.621GlnCys: 0.621 ± 0.393
2.793GlnAsp: 2.793 ± 1.36
4.035GlnGlu: 4.035 ± 0.898
0.931GlnPhe: 0.931 ± 0.392
2.173GlnGly: 2.173 ± 0.482
0.931GlnHis: 0.931 ± 0.576
2.483GlnIle: 2.483 ± 0.68
2.173GlnLys: 2.173 ± 0.883
3.104GlnLeu: 3.104 ± 0.812
0.621GlnMet: 0.621 ± 0.286
1.241GlnAsn: 1.241 ± 0.533
0.31GlnPro: 0.31 ± 0.285
0.621GlnGln: 0.621 ± 0.424
1.862GlnArg: 1.862 ± 0.87
1.552GlnSer: 1.552 ± 0.536
1.862GlnThr: 1.862 ± 0.728
2.173GlnVal: 2.173 ± 0.842
0.0GlnTrp: 0.0 ± 0.0
1.862GlnTyr: 1.862 ± 0.71
0.0GlnXaa: 0.0 ± 0.0
Arg
3.104ArgAla: 3.104 ± 0.91
1.552ArgCys: 1.552 ± 0.715
0.931ArgAsp: 0.931 ± 0.548
4.966ArgGlu: 4.966 ± 1.515
2.173ArgPhe: 2.173 ± 0.652
2.483ArgGly: 2.483 ± 0.709
1.241ArgHis: 1.241 ± 0.482
3.104ArgIle: 3.104 ± 1.016
2.793ArgLys: 2.793 ± 0.983
3.724ArgLeu: 3.724 ± 1.063
1.241ArgMet: 1.241 ± 0.354
2.483ArgAsn: 2.483 ± 0.711
0.931ArgPro: 0.931 ± 0.432
2.173ArgGln: 2.173 ± 0.981
0.931ArgArg: 0.931 ± 0.534
1.862ArgSer: 1.862 ± 0.543
1.552ArgThr: 1.552 ± 0.896
3.104ArgVal: 3.104 ± 0.793
0.621ArgTrp: 0.621 ± 0.467
1.862ArgTyr: 1.862 ± 0.861
0.0ArgXaa: 0.0 ± 0.0
Ser
2.483SerAla: 2.483 ± 1.143
1.552SerCys: 1.552 ± 0.699
2.173SerAsp: 2.173 ± 0.907
3.414SerGlu: 3.414 ± 0.931
4.655SerPhe: 4.655 ± 1.093
7.138SerGly: 7.138 ± 1.145
1.241SerHis: 1.241 ± 0.629
2.793SerIle: 2.793 ± 0.72
6.207SerLys: 6.207 ± 1.272
5.587SerLeu: 5.587 ± 1.43
0.31SerMet: 0.31 ± 0.285
3.104SerAsn: 3.104 ± 0.984
1.552SerPro: 1.552 ± 0.553
2.483SerGln: 2.483 ± 0.572
2.483SerArg: 2.483 ± 0.945
3.414SerSer: 3.414 ± 0.843
2.483SerThr: 2.483 ± 0.729
4.655SerVal: 4.655 ± 1.108
1.241SerTrp: 1.241 ± 0.773
4.966SerTyr: 4.966 ± 1.513
0.0SerXaa: 0.0 ± 0.0
Thr
1.241ThrAla: 1.241 ± 0.882
0.931ThrCys: 0.931 ± 0.505
3.104ThrAsp: 3.104 ± 0.913
4.655ThrGlu: 4.655 ± 0.671
1.552ThrPhe: 1.552 ± 0.495
3.414ThrGly: 3.414 ± 0.749
2.483ThrHis: 2.483 ± 0.828
4.966ThrIle: 4.966 ± 0.869
4.035ThrLys: 4.035 ± 1.443
4.035ThrLeu: 4.035 ± 1.566
0.621ThrMet: 0.621 ± 0.393
3.104ThrAsn: 3.104 ± 0.99
3.414ThrPro: 3.414 ± 0.931
2.173ThrGln: 2.173 ± 0.996
2.793ThrArg: 2.793 ± 0.97
1.862ThrSer: 1.862 ± 0.569
1.862ThrThr: 1.862 ± 1.198
3.104ThrVal: 3.104 ± 0.78
1.241ThrTrp: 1.241 ± 0.728
3.104ThrTyr: 3.104 ± 1.083
0.0ThrXaa: 0.0 ± 0.0
Val
3.414ValAla: 3.414 ± 1.228
0.621ValCys: 0.621 ± 0.554
6.207ValAsp: 6.207 ± 1.194
4.035ValGlu: 4.035 ± 0.983
2.793ValPhe: 2.793 ± 1.03
3.414ValGly: 3.414 ± 0.664
0.931ValHis: 0.931 ± 0.507
4.035ValIle: 4.035 ± 1.41
7.138ValLys: 7.138 ± 1.224
2.793ValLeu: 2.793 ± 1.138
1.241ValMet: 1.241 ± 0.739
4.035ValAsn: 4.035 ± 1.115
1.241ValPro: 1.241 ± 0.54
0.931ValGln: 0.931 ± 0.695
2.793ValArg: 2.793 ± 0.813
5.587ValSer: 5.587 ± 1.307
2.793ValThr: 2.793 ± 0.952
6.828ValVal: 6.828 ± 2.011
1.552ValTrp: 1.552 ± 0.658
0.931ValTyr: 0.931 ± 0.692
0.0ValXaa: 0.0 ± 0.0
Trp
1.241TrpAla: 1.241 ± 0.608
0.31TrpCys: 0.31 ± 0.299
0.621TrpAsp: 0.621 ± 0.387
2.793TrpGlu: 2.793 ± 1.237
0.0TrpPhe: 0.0 ± 0.0
0.931TrpGly: 0.931 ± 0.523
0.621TrpHis: 0.621 ± 0.286
1.241TrpIle: 1.241 ± 0.766
2.173TrpLys: 2.173 ± 1.14
2.483TrpLeu: 2.483 ± 0.888
0.621TrpMet: 0.621 ± 0.388
0.931TrpAsn: 0.931 ± 0.507
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.241TrpArg: 1.241 ± 0.396
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.931TrpVal: 0.931 ± 0.47
0.621TrpTrp: 0.621 ± 0.516
0.621TrpTyr: 0.621 ± 0.441
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.862TyrAla: 1.862 ± 0.724
1.241TyrCys: 1.241 ± 0.486
1.862TyrAsp: 1.862 ± 0.811
4.035TyrGlu: 4.035 ± 0.766
4.345TyrPhe: 4.345 ± 1.192
2.793TyrGly: 2.793 ± 0.724
2.173TyrHis: 2.173 ± 0.698
1.552TyrIle: 1.552 ± 0.568
4.345TyrLys: 4.345 ± 1.561
4.345TyrLeu: 4.345 ± 0.917
0.621TyrMet: 0.621 ± 0.311
1.552TyrAsn: 1.552 ± 0.615
1.862TyrPro: 1.862 ± 0.556
2.483TyrGln: 2.483 ± 0.647
0.931TyrArg: 0.931 ± 0.674
4.345TyrSer: 4.345 ± 1.471
3.724TyrThr: 3.724 ± 1.209
1.862TyrVal: 1.862 ± 0.603
0.0TyrTrp: 0.0 ± 0.0
1.552TyrTyr: 1.552 ± 0.683
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 15 proteins (3223 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski