Amino acid dipepetide frequency for Sulfolobus virus-like particle SSV2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.0AlaCys: 0.0 ± 0.0
1.406AlaAsp: 1.406 ± 0.426
3.214AlaGlu: 3.214 ± 0.927
1.808AlaPhe: 1.808 ± 0.59
3.415AlaGly: 3.415 ± 1.325
0.402AlaHis: 0.402 ± 0.26
5.223AlaIle: 5.223 ± 1.302
3.616AlaLys: 3.616 ± 1.167
6.027AlaLeu: 6.027 ± 1.042
0.603AlaMet: 0.603 ± 0.254
4.018AlaAsn: 4.018 ± 0.817
1.808AlaPro: 1.808 ± 0.503
1.406AlaGln: 1.406 ± 0.492
1.406AlaArg: 1.406 ± 0.548
3.214AlaSer: 3.214 ± 0.844
2.21AlaThr: 2.21 ± 0.699
4.62AlaVal: 4.62 ± 1.088
1.004AlaTrp: 1.004 ± 0.45
3.214AlaTyr: 3.214 ± 0.647
0.0AlaXaa: 0.0 ± 0.0
Cys
0.201CysAla: 0.201 ± 0.176
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.201CysPhe: 0.201 ± 0.205
1.004CysGly: 1.004 ± 0.49
0.0CysHis: 0.0 ± 0.0
0.402CysIle: 0.402 ± 0.275
0.201CysLys: 0.201 ± 0.19
1.004CysLeu: 1.004 ± 0.508
0.0CysMet: 0.0 ± 0.0
0.201CysAsn: 0.201 ± 0.223
1.004CysPro: 1.004 ± 0.592
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.201CysSer: 0.201 ± 0.189
0.0CysThr: 0.0 ± 0.0
0.402CysVal: 0.402 ± 0.254
0.201CysTrp: 0.201 ± 0.176
0.201CysTyr: 0.201 ± 0.223
0.0CysXaa: 0.0 ± 0.0
Asp
1.808AspAla: 1.808 ± 0.849
0.0AspCys: 0.0 ± 0.0
1.205AspAsp: 1.205 ± 0.621
1.607AspGlu: 1.607 ± 0.547
1.607AspPhe: 1.607 ± 0.645
2.611AspGly: 2.611 ± 0.777
1.808AspHis: 1.808 ± 0.978
2.21AspIle: 2.21 ± 0.599
2.21AspLys: 2.21 ± 0.667
4.821AspLeu: 4.821 ± 1.582
1.808AspMet: 1.808 ± 0.689
0.804AspAsn: 0.804 ± 0.412
0.201AspPro: 0.201 ± 0.183
0.201AspGln: 0.201 ± 0.189
2.21AspArg: 2.21 ± 1.061
1.808AspSer: 1.808 ± 0.556
1.406AspThr: 1.406 ± 0.376
2.21AspVal: 2.21 ± 0.711
0.402AspTrp: 0.402 ± 0.229
1.808AspTyr: 1.808 ± 0.605
0.0AspXaa: 0.0 ± 0.0
Glu
2.812GluAla: 2.812 ± 0.823
0.603GluCys: 0.603 ± 0.355
2.009GluAsp: 2.009 ± 0.742
6.227GluGlu: 6.227 ± 1.962
2.411GluPhe: 2.411 ± 0.632
2.411GluGly: 2.411 ± 0.724
1.004GluHis: 1.004 ± 0.431
5.022GluIle: 5.022 ± 1.448
4.018GluLys: 4.018 ± 1.249
7.634GluLeu: 7.634 ± 1.886
1.607GluMet: 1.607 ± 0.427
2.411GluAsn: 2.411 ± 1.042
1.205GluPro: 1.205 ± 0.592
1.808GluGln: 1.808 ± 0.568
2.21GluArg: 2.21 ± 0.855
3.013GluSer: 3.013 ± 1.051
1.808GluThr: 1.808 ± 0.737
3.817GluVal: 3.817 ± 1.207
0.402GluTrp: 0.402 ± 0.243
3.214GluTyr: 3.214 ± 0.812
0.0GluXaa: 0.0 ± 0.0
Phe
2.611PheAla: 2.611 ± 0.703
0.402PheCys: 0.402 ± 0.277
2.009PheAsp: 2.009 ± 0.534
2.611PheGlu: 2.611 ± 0.763
2.812PhePhe: 2.812 ± 0.815
2.611PheGly: 2.611 ± 0.689
0.402PheHis: 0.402 ± 0.307
2.812PheIle: 2.812 ± 0.804
2.411PheLys: 2.411 ± 0.651
4.821PheLeu: 4.821 ± 1.248
1.205PheMet: 1.205 ± 0.394
2.21PheAsn: 2.21 ± 0.991
1.406PhePro: 1.406 ± 0.426
2.009PheGln: 2.009 ± 0.771
2.009PheArg: 2.009 ± 0.677
4.419PheSer: 4.419 ± 1.282
3.616PheThr: 3.616 ± 0.957
2.411PheVal: 2.411 ± 1.045
0.603PheTrp: 0.603 ± 0.269
4.419PheTyr: 4.419 ± 0.919
0.0PheXaa: 0.0 ± 0.0
Gly
1.808GlyAla: 1.808 ± 0.705
0.0GlyCys: 0.0 ± 0.0
1.406GlyAsp: 1.406 ± 0.457
0.804GlyGlu: 0.804 ± 0.386
5.022GlyPhe: 5.022 ± 1.306
2.812GlyGly: 2.812 ± 0.924
0.201GlyHis: 0.201 ± 0.223
4.821GlyIle: 4.821 ± 0.925
4.821GlyLys: 4.821 ± 0.963
5.826GlyLeu: 5.826 ± 1.175
0.603GlyMet: 0.603 ± 0.375
1.607GlyAsn: 1.607 ± 0.611
4.821GlyPro: 4.821 ± 2.067
2.009GlyGln: 2.009 ± 0.926
2.611GlyArg: 2.611 ± 0.952
5.424GlySer: 5.424 ± 1.286
3.415GlyThr: 3.415 ± 0.917
4.219GlyVal: 4.219 ± 0.854
1.004GlyTrp: 1.004 ± 0.345
3.214GlyTyr: 3.214 ± 0.812
0.0GlyXaa: 0.0 ± 0.0
His
0.402HisAla: 0.402 ± 0.284
0.0HisCys: 0.0 ± 0.0
0.402HisAsp: 0.402 ± 0.277
0.402HisGlu: 0.402 ± 0.26
0.603HisPhe: 0.603 ± 0.331
0.402HisGly: 0.402 ± 0.329
0.201HisHis: 0.201 ± 0.213
0.804HisIle: 0.804 ± 0.446
0.804HisLys: 0.804 ± 0.381
2.009HisLeu: 2.009 ± 0.889
0.201HisMet: 0.201 ± 0.176
0.804HisAsn: 0.804 ± 0.489
0.201HisPro: 0.201 ± 0.173
1.607HisGln: 1.607 ± 0.885
0.804HisArg: 0.804 ± 0.488
1.004HisSer: 1.004 ± 0.435
0.603HisThr: 0.603 ± 0.329
1.004HisVal: 1.004 ± 0.491
0.0HisTrp: 0.0 ± 0.0
1.205HisTyr: 1.205 ± 0.544
0.0HisXaa: 0.0 ± 0.0
Ile
3.817IleAla: 3.817 ± 0.777
0.402IleCys: 0.402 ± 0.272
2.611IleAsp: 2.611 ± 0.635
3.817IleGlu: 3.817 ± 0.915
5.625IlePhe: 5.625 ± 1.217
3.817IleGly: 3.817 ± 0.801
1.406IleHis: 1.406 ± 0.556
6.83IleIle: 6.83 ± 1.14
3.817IleLys: 3.817 ± 1.446
9.241IleLeu: 9.241 ± 1.979
2.009IleMet: 2.009 ± 0.585
4.018IleAsn: 4.018 ± 1.355
4.219IlePro: 4.219 ± 0.836
2.21IleGln: 2.21 ± 0.965
4.821IleArg: 4.821 ± 1.136
7.232IleSer: 7.232 ± 1.252
4.62IleThr: 4.62 ± 1.091
4.018IleVal: 4.018 ± 1.181
0.603IleTrp: 0.603 ± 0.315
4.821IleTyr: 4.821 ± 0.965
0.0IleXaa: 0.0 ± 0.0
Lys
3.415LysAla: 3.415 ± 0.877
0.804LysCys: 0.804 ± 0.528
2.21LysAsp: 2.21 ± 0.736
6.227LysGlu: 6.227 ± 1.668
2.411LysPhe: 2.411 ± 0.755
3.415LysGly: 3.415 ± 0.76
1.205LysHis: 1.205 ± 0.39
7.232LysIle: 7.232 ± 1.774
7.031LysLys: 7.031 ± 1.949
7.433LysLeu: 7.433 ± 1.651
1.607LysMet: 1.607 ± 0.437
3.013LysAsn: 3.013 ± 0.623
1.808LysPro: 1.808 ± 0.661
3.214LysGln: 3.214 ± 0.77
2.411LysArg: 2.411 ± 0.84
2.812LysSer: 2.812 ± 0.927
4.62LysThr: 4.62 ± 0.911
4.219LysVal: 4.219 ± 1.002
0.603LysTrp: 0.603 ± 0.289
3.415LysTyr: 3.415 ± 0.83
0.0LysXaa: 0.0 ± 0.0
Leu
8.035LeuAla: 8.035 ± 1.169
0.201LeuCys: 0.201 ± 0.19
4.419LeuAsp: 4.419 ± 1.336
6.227LeuGlu: 6.227 ± 1.412
5.625LeuPhe: 5.625 ± 1.645
4.821LeuGly: 4.821 ± 1.009
0.402LeuHis: 0.402 ± 0.277
9.241LeuIle: 9.241 ± 1.432
7.031LeuLys: 7.031 ± 1.603
14.062LeuLeu: 14.062 ± 1.549
3.214LeuMet: 3.214 ± 0.813
8.035LeuAsn: 8.035 ± 1.055
4.821LeuPro: 4.821 ± 0.828
4.018LeuGln: 4.018 ± 0.733
4.419LeuArg: 4.419 ± 1.189
10.044LeuSer: 10.044 ± 1.469
8.035LeuThr: 8.035 ± 1.1
7.634LeuVal: 7.634 ± 1.865
2.411LeuTrp: 2.411 ± 1.063
4.821LeuTyr: 4.821 ± 0.994
0.0LeuXaa: 0.0 ± 0.0
Met
1.406MetAla: 1.406 ± 0.523
0.0MetCys: 0.0 ± 0.0
1.004MetAsp: 1.004 ± 0.542
1.205MetGlu: 1.205 ± 0.421
0.804MetPhe: 0.804 ± 0.399
2.411MetGly: 2.411 ± 0.703
0.201MetHis: 0.201 ± 0.176
1.406MetIle: 1.406 ± 0.471
2.21MetLys: 2.21 ± 0.437
2.411MetLeu: 2.411 ± 0.736
1.004MetMet: 1.004 ± 0.438
0.804MetAsn: 0.804 ± 0.477
0.804MetPro: 0.804 ± 0.37
0.201MetGln: 0.201 ± 0.201
1.205MetArg: 1.205 ± 0.558
2.009MetSer: 2.009 ± 0.501
1.205MetThr: 1.205 ± 0.405
1.205MetVal: 1.205 ± 0.426
0.201MetTrp: 0.201 ± 0.19
0.804MetTyr: 0.804 ± 0.487
0.0MetXaa: 0.0 ± 0.0
Asn
4.219AsnAla: 4.219 ± 0.929
0.402AsnCys: 0.402 ± 0.366
2.009AsnAsp: 2.009 ± 0.75
3.214AsnGlu: 3.214 ± 0.87
3.013AsnPhe: 3.013 ± 0.767
4.62AsnGly: 4.62 ± 0.828
0.402AsnHis: 0.402 ± 0.272
4.219AsnIle: 4.219 ± 1.409
2.411AsnLys: 2.411 ± 0.717
3.415AsnLeu: 3.415 ± 0.772
0.804AsnMet: 0.804 ± 0.396
4.219AsnAsn: 4.219 ± 0.93
2.411AsnPro: 2.411 ± 0.623
2.611AsnGln: 2.611 ± 0.979
1.004AsnArg: 1.004 ± 0.487
5.826AsnSer: 5.826 ± 1.255
2.611AsnThr: 2.611 ± 0.739
4.419AsnVal: 4.419 ± 1.509
1.004AsnTrp: 1.004 ± 0.348
2.611AsnTyr: 2.611 ± 0.828
0.0AsnXaa: 0.0 ± 0.0
Pro
1.808ProAla: 1.808 ± 0.446
0.0ProCys: 0.0 ± 0.0
1.205ProAsp: 1.205 ± 0.659
1.406ProGlu: 1.406 ± 0.559
2.611ProPhe: 2.611 ± 0.805
1.808ProGly: 1.808 ± 0.758
0.804ProHis: 0.804 ± 0.386
1.808ProIle: 1.808 ± 0.586
2.812ProLys: 2.812 ± 0.927
5.022ProLeu: 5.022 ± 0.858
0.804ProMet: 0.804 ± 0.419
2.611ProAsn: 2.611 ± 0.806
2.411ProPro: 2.411 ± 0.976
1.808ProGln: 1.808 ± 0.442
1.004ProArg: 1.004 ± 0.535
3.415ProSer: 3.415 ± 0.897
2.009ProThr: 2.009 ± 0.644
3.214ProVal: 3.214 ± 0.696
0.603ProTrp: 0.603 ± 0.518
2.21ProTyr: 2.21 ± 0.87
0.0ProXaa: 0.0 ± 0.0
Gln
0.603GlnAla: 0.603 ± 0.284
0.201GlnCys: 0.201 ± 0.225
1.004GlnAsp: 1.004 ± 0.39
1.406GlnGlu: 1.406 ± 0.488
1.808GlnPhe: 1.808 ± 1.009
1.607GlnGly: 1.607 ± 0.545
0.804GlnHis: 0.804 ± 0.499
3.817GlnIle: 3.817 ± 0.843
3.013GlnLys: 3.013 ± 0.959
3.817GlnLeu: 3.817 ± 0.876
1.004GlnMet: 1.004 ± 0.363
2.009GlnAsn: 2.009 ± 0.683
1.808GlnPro: 1.808 ± 0.989
0.804GlnGln: 0.804 ± 0.403
0.603GlnArg: 0.603 ± 0.33
2.411GlnSer: 2.411 ± 0.763
2.812GlnThr: 2.812 ± 0.611
3.214GlnVal: 3.214 ± 0.81
1.808GlnTrp: 1.808 ± 0.835
1.406GlnTyr: 1.406 ± 0.73
0.0GlnXaa: 0.0 ± 0.0
Arg
1.406ArgAla: 1.406 ± 0.591
0.402ArgCys: 0.402 ± 0.288
2.611ArgAsp: 2.611 ± 0.984
4.018ArgGlu: 4.018 ± 1.075
1.004ArgPhe: 1.004 ± 0.429
1.205ArgGly: 1.205 ± 0.564
1.004ArgHis: 1.004 ± 0.494
2.411ArgIle: 2.411 ± 0.692
4.419ArgLys: 4.419 ± 1.206
4.821ArgLeu: 4.821 ± 1.172
0.804ArgMet: 0.804 ± 0.399
1.406ArgAsn: 1.406 ± 0.475
0.402ArgPro: 0.402 ± 0.288
2.009ArgGln: 2.009 ± 0.7
2.812ArgArg: 2.812 ± 0.987
0.804ArgSer: 0.804 ± 0.373
1.004ArgThr: 1.004 ± 0.472
3.214ArgVal: 3.214 ± 1.072
0.201ArgTrp: 0.201 ± 0.205
1.205ArgTyr: 1.205 ± 0.512
0.0ArgXaa: 0.0 ± 0.0
Ser
5.022SerAla: 5.022 ± 1.019
0.201SerCys: 0.201 ± 0.2
1.607SerAsp: 1.607 ± 0.552
2.411SerGlu: 2.411 ± 0.866
3.214SerPhe: 3.214 ± 0.696
5.625SerGly: 5.625 ± 1.18
1.205SerHis: 1.205 ± 0.49
5.424SerIle: 5.424 ± 1.411
5.625SerLys: 5.625 ± 1.183
7.433SerLeu: 7.433 ± 1.145
1.406SerMet: 1.406 ± 0.409
4.62SerAsn: 4.62 ± 0.854
2.411SerPro: 2.411 ± 0.636
3.013SerGln: 3.013 ± 0.8
2.21SerArg: 2.21 ± 0.7
5.424SerSer: 5.424 ± 1.923
4.821SerThr: 4.821 ± 1.651
6.629SerVal: 6.629 ± 1.543
1.406SerTrp: 1.406 ± 0.551
5.022SerTyr: 5.022 ± 1.084
0.0SerXaa: 0.0 ± 0.0
Thr
2.21ThrAla: 2.21 ± 0.771
0.201ThrCys: 0.201 ± 0.189
1.808ThrAsp: 1.808 ± 0.447
4.62ThrGlu: 4.62 ± 1.376
2.21ThrPhe: 2.21 ± 0.698
3.214ThrGly: 3.214 ± 1.332
0.804ThrHis: 0.804 ± 0.42
5.826ThrIle: 5.826 ± 2.099
4.018ThrLys: 4.018 ± 1.105
9.241ThrLeu: 9.241 ± 1.422
0.804ThrMet: 0.804 ± 0.5
3.616ThrAsn: 3.616 ± 1.218
3.013ThrPro: 3.013 ± 0.72
3.013ThrGln: 3.013 ± 0.809
1.205ThrArg: 1.205 ± 0.446
4.219ThrSer: 4.219 ± 0.793
6.629ThrThr: 6.629 ± 1.689
3.415ThrVal: 3.415 ± 0.901
0.603ThrTrp: 0.603 ± 0.518
3.817ThrTyr: 3.817 ± 1.052
0.0ThrXaa: 0.0 ± 0.0
Val
3.214ValAla: 3.214 ± 0.932
1.205ValCys: 1.205 ± 0.834
2.009ValAsp: 2.009 ± 0.915
3.013ValGlu: 3.013 ± 0.822
1.808ValPhe: 1.808 ± 0.462
5.223ValGly: 5.223 ± 1.14
0.603ValHis: 0.603 ± 0.351
5.022ValIle: 5.022 ± 1.117
4.821ValLys: 4.821 ± 1.159
8.035ValLeu: 8.035 ± 1.641
1.406ValMet: 1.406 ± 0.455
4.419ValAsn: 4.419 ± 1.304
2.21ValPro: 2.21 ± 0.735
2.009ValGln: 2.009 ± 0.638
2.411ValArg: 2.411 ± 0.981
7.031ValSer: 7.031 ± 1.33
6.227ValThr: 6.227 ± 1.378
5.826ValVal: 5.826 ± 1.202
1.808ValTrp: 1.808 ± 0.919
3.415ValTyr: 3.415 ± 0.773
0.0ValXaa: 0.0 ± 0.0
Trp
1.004TrpAla: 1.004 ± 0.369
0.0TrpCys: 0.0 ± 0.0
0.201TrpAsp: 0.201 ± 0.199
0.402TrpGlu: 0.402 ± 0.29
0.402TrpPhe: 0.402 ± 0.272
0.603TrpGly: 0.603 ± 0.432
0.0TrpHis: 0.0 ± 0.0
0.804TrpIle: 0.804 ± 0.476
0.804TrpLys: 0.804 ± 0.327
2.21TrpLeu: 2.21 ± 0.551
0.402TrpMet: 0.402 ± 0.29
0.0TrpAsn: 0.0 ± 0.0
0.201TrpPro: 0.201 ± 0.188
0.201TrpGln: 0.201 ± 0.201
0.603TrpArg: 0.603 ± 0.329
1.406TrpSer: 1.406 ± 0.529
3.415TrpThr: 3.415 ± 1.665
1.607TrpVal: 1.607 ± 0.528
0.0TrpTrp: 0.0 ± 0.0
1.406TrpTyr: 1.406 ± 0.727
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.812TyrAla: 2.812 ± 0.618
0.201TyrCys: 0.201 ± 0.223
1.607TyrAsp: 1.607 ± 0.474
2.812TyrGlu: 2.812 ± 0.713
2.812TyrPhe: 2.812 ± 0.896
3.013TyrGly: 3.013 ± 1.056
0.603TyrHis: 0.603 ± 0.432
4.62TyrIle: 4.62 ± 0.894
3.013TyrLys: 3.013 ± 0.948
8.236TyrLeu: 8.236 ± 0.841
1.004TyrMet: 1.004 ± 0.452
4.62TyrAsn: 4.62 ± 1.057
2.21TyrPro: 2.21 ± 0.927
1.808TyrGln: 1.808 ± 0.481
1.205TyrArg: 1.205 ± 0.422
3.013TyrSer: 3.013 ± 0.828
3.415TyrThr: 3.415 ± 1.004
4.419TyrVal: 4.419 ± 1.03
0.804TyrTrp: 0.804 ± 0.377
4.219TyrTyr: 4.219 ± 1.199
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 33 proteins (4979 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski