Amino acid dipepetide frequency for Sulfolobus spindle-shaped virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.0AlaCys: 0.0 ± 0.0
1.435AlaAsp: 1.435 ± 0.643
2.63AlaGlu: 2.63 ± 0.775
4.304AlaPhe: 4.304 ± 0.967
2.869AlaGly: 2.869 ± 1.126
0.239AlaHis: 0.239 ± 0.236
5.5AlaIle: 5.5 ± 1.141
5.022AlaLys: 5.022 ± 1.36
6.934AlaLeu: 6.934 ± 0.926
0.956AlaMet: 0.956 ± 0.428
4.065AlaAsn: 4.065 ± 0.754
2.391AlaPro: 2.391 ± 0.691
1.913AlaGln: 1.913 ± 0.849
0.717AlaArg: 0.717 ± 0.424
4.543AlaSer: 4.543 ± 0.932
2.391AlaThr: 2.391 ± 0.748
2.63AlaVal: 2.63 ± 0.825
1.435AlaTrp: 1.435 ± 1.027
3.348AlaTyr: 3.348 ± 1.097
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.478CysAsp: 0.478 ± 0.328
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.478CysIle: 0.478 ± 0.297
0.478CysLys: 0.478 ± 0.395
0.956CysLeu: 0.956 ± 0.445
0.239CysMet: 0.239 ± 0.235
0.239CysAsn: 0.239 ± 0.305
1.196CysPro: 1.196 ± 0.548
0.239CysGln: 0.239 ± 0.236
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.239CysVal: 0.239 ± 0.197
0.478CysTrp: 0.478 ± 0.347
0.239CysTyr: 0.239 ± 0.237
0.0CysXaa: 0.0 ± 0.0
Asp
1.435AspAla: 1.435 ± 0.579
0.0AspCys: 0.0 ± 0.0
0.717AspAsp: 0.717 ± 0.393
1.913AspGlu: 1.913 ± 0.667
0.956AspPhe: 0.956 ± 0.731
3.348AspGly: 3.348 ± 0.936
0.478AspHis: 0.478 ± 0.294
2.63AspIle: 2.63 ± 1.257
1.913AspLys: 1.913 ± 0.788
1.674AspLeu: 1.674 ± 0.828
1.196AspMet: 1.196 ± 0.493
1.196AspAsn: 1.196 ± 0.614
0.239AspPro: 0.239 ± 0.237
0.478AspGln: 0.478 ± 0.298
1.196AspArg: 1.196 ± 0.517
1.435AspSer: 1.435 ± 0.624
1.435AspThr: 1.435 ± 0.759
1.913AspVal: 1.913 ± 0.698
0.239AspTrp: 0.239 ± 0.235
1.674AspTyr: 1.674 ± 0.688
0.0AspXaa: 0.0 ± 0.0
Glu
0.956GluAla: 0.956 ± 0.493
0.478GluCys: 0.478 ± 0.47
1.196GluAsp: 1.196 ± 0.465
6.695GluGlu: 6.695 ± 2.171
1.913GluPhe: 1.913 ± 0.692
2.869GluGly: 2.869 ± 0.908
0.717GluHis: 0.717 ± 0.375
2.152GluIle: 2.152 ± 0.64
6.456GluLys: 6.456 ± 1.897
4.304GluLeu: 4.304 ± 1.405
1.435GluMet: 1.435 ± 0.738
3.348GluAsn: 3.348 ± 1.146
1.196GluPro: 1.196 ± 0.49
1.674GluGln: 1.674 ± 0.604
3.109GluArg: 3.109 ± 1.23
1.674GluSer: 1.674 ± 0.694
1.196GluThr: 1.196 ± 0.577
3.109GluVal: 3.109 ± 1.047
0.478GluTrp: 0.478 ± 0.315
1.674GluTyr: 1.674 ± 0.925
0.0GluXaa: 0.0 ± 0.0
Phe
2.63PheAla: 2.63 ± 0.701
0.478PheCys: 0.478 ± 0.347
1.913PheAsp: 1.913 ± 0.807
1.674PheGlu: 1.674 ± 0.658
2.869PhePhe: 2.869 ± 1.004
3.826PheGly: 3.826 ± 0.673
0.239PheHis: 0.239 ± 0.237
3.826PheIle: 3.826 ± 0.948
1.674PheLys: 1.674 ± 0.527
5.978PheLeu: 5.978 ± 1.539
1.435PheMet: 1.435 ± 0.461
1.196PheAsn: 1.196 ± 0.478
2.869PhePro: 2.869 ± 0.944
1.196PheGln: 1.196 ± 0.404
1.435PheArg: 1.435 ± 0.555
5.022PheSer: 5.022 ± 1.312
6.695PheThr: 6.695 ± 1.152
4.543PheVal: 4.543 ± 0.8
0.956PheTrp: 0.956 ± 0.431
2.869PheTyr: 2.869 ± 0.867
0.0PheXaa: 0.0 ± 0.0
Gly
3.348GlyAla: 3.348 ± 0.913
0.239GlyCys: 0.239 ± 0.236
0.717GlyAsp: 0.717 ± 0.434
2.152GlyGlu: 2.152 ± 0.776
5.022GlyPhe: 5.022 ± 0.952
4.782GlyGly: 4.782 ± 1.146
0.239GlyHis: 0.239 ± 0.282
7.174GlyIle: 7.174 ± 1.13
4.543GlyLys: 4.543 ± 1.326
7.891GlyLeu: 7.891 ± 2.06
1.196GlyMet: 1.196 ± 0.464
3.348GlyAsn: 3.348 ± 0.621
3.348GlyPro: 3.348 ± 0.872
2.391GlyGln: 2.391 ± 0.731
2.152GlyArg: 2.152 ± 0.727
6.456GlySer: 6.456 ± 1.241
4.304GlyThr: 4.304 ± 1.349
5.261GlyVal: 5.261 ± 1.019
0.717GlyTrp: 0.717 ± 0.381
4.065GlyTyr: 4.065 ± 0.815
0.0GlyXaa: 0.0 ± 0.0
His
0.478HisAla: 0.478 ± 0.331
0.0HisCys: 0.0 ± 0.0
0.478HisAsp: 0.478 ± 0.332
0.239HisGlu: 0.239 ± 0.237
0.478HisPhe: 0.478 ± 0.277
0.239HisGly: 0.239 ± 0.235
0.478HisHis: 0.478 ± 0.315
1.196HisIle: 1.196 ± 0.538
0.478HisLys: 0.478 ± 0.296
1.196HisLeu: 1.196 ± 0.496
0.239HisMet: 0.239 ± 0.235
1.196HisAsn: 1.196 ± 0.571
0.478HisPro: 0.478 ± 0.374
0.0HisGln: 0.0 ± 0.0
0.239HisArg: 0.239 ± 0.236
0.478HisSer: 0.478 ± 0.305
0.478HisThr: 0.478 ± 0.361
0.956HisVal: 0.956 ± 0.483
0.0HisTrp: 0.0 ± 0.0
1.196HisTyr: 1.196 ± 0.457
0.0HisXaa: 0.0 ± 0.0
Ile
7.413IleAla: 7.413 ± 1.361
1.196IleCys: 1.196 ± 0.601
3.587IleAsp: 3.587 ± 0.998
2.152IleGlu: 2.152 ± 0.658
5.5IlePhe: 5.5 ± 1.217
7.891IleGly: 7.891 ± 1.335
1.196IleHis: 1.196 ± 0.554
7.652IleIle: 7.652 ± 1.118
3.348IleLys: 3.348 ± 1.317
10.282IleLeu: 10.282 ± 1.972
1.674IleMet: 1.674 ± 0.585
2.869IleAsn: 2.869 ± 0.714
4.304IlePro: 4.304 ± 1.11
1.913IleGln: 1.913 ± 0.538
2.63IleArg: 2.63 ± 0.868
8.847IleSer: 8.847 ± 1.536
4.304IleThr: 4.304 ± 1.048
7.174IleVal: 7.174 ± 1.415
0.239IleTrp: 0.239 ± 0.197
3.587IleTyr: 3.587 ± 1.193
0.0IleXaa: 0.0 ± 0.0
Lys
4.065LysAla: 4.065 ± 1.129
0.0LysCys: 0.0 ± 0.0
1.913LysAsp: 1.913 ± 0.771
4.065LysGlu: 4.065 ± 1.764
2.869LysPhe: 2.869 ± 0.787
3.348LysGly: 3.348 ± 0.911
0.956LysHis: 0.956 ± 0.44
4.065LysIle: 4.065 ± 1.379
5.739LysLys: 5.739 ± 1.89
5.978LysLeu: 5.978 ± 1.431
1.913LysMet: 1.913 ± 0.638
3.348LysAsn: 3.348 ± 1.142
3.109LysPro: 3.109 ± 1.186
3.109LysGln: 3.109 ± 1.039
3.109LysArg: 3.109 ± 0.906
3.109LysSer: 3.109 ± 0.851
4.065LysThr: 4.065 ± 0.791
3.109LysVal: 3.109 ± 1.14
0.956LysTrp: 0.956 ± 0.368
3.109LysTyr: 3.109 ± 1.174
0.0LysXaa: 0.0 ± 0.0
Leu
6.217LeuAla: 6.217 ± 1.065
0.956LeuCys: 0.956 ± 0.463
2.152LeuAsp: 2.152 ± 0.657
3.826LeuGlu: 3.826 ± 1.213
6.456LeuPhe: 6.456 ± 1.274
7.652LeuGly: 7.652 ± 1.847
0.478LeuHis: 0.478 ± 0.315
12.195LeuIle: 12.195 ± 2.161
6.695LeuLys: 6.695 ± 1.335
10.76LeuLeu: 10.76 ± 2.09
2.869LeuMet: 2.869 ± 0.784
5.022LeuAsn: 5.022 ± 1.453
4.782LeuPro: 4.782 ± 0.789
3.348LeuGln: 3.348 ± 1.053
4.543LeuArg: 4.543 ± 1.352
9.804LeuSer: 9.804 ± 1.74
8.608LeuThr: 8.608 ± 1.096
6.217LeuVal: 6.217 ± 1.811
1.196LeuTrp: 1.196 ± 0.666
4.782LeuTyr: 4.782 ± 1.006
0.0LeuXaa: 0.0 ± 0.0
Met
2.152MetAla: 2.152 ± 0.944
0.0MetCys: 0.0 ± 0.0
0.956MetAsp: 0.956 ± 0.506
1.674MetGlu: 1.674 ± 0.579
0.956MetPhe: 0.956 ± 0.451
2.391MetGly: 2.391 ± 0.606
0.478MetHis: 0.478 ± 0.332
0.956MetIle: 0.956 ± 0.455
1.435MetLys: 1.435 ± 0.761
2.152MetLeu: 2.152 ± 1.154
0.239MetMet: 0.239 ± 0.251
0.717MetAsn: 0.717 ± 0.482
1.196MetPro: 1.196 ± 0.415
0.0MetGln: 0.0 ± 0.0
1.913MetArg: 1.913 ± 0.538
1.196MetSer: 1.196 ± 0.537
0.717MetThr: 0.717 ± 0.5
1.674MetVal: 1.674 ± 0.57
0.0MetTrp: 0.0 ± 0.0
0.717MetTyr: 0.717 ± 0.749
0.0MetXaa: 0.0 ± 0.0
Asn
4.065AsnAla: 4.065 ± 0.95
0.478AsnCys: 0.478 ± 0.365
1.196AsnAsp: 1.196 ± 0.347
2.63AsnGlu: 2.63 ± 1.04
2.152AsnPhe: 2.152 ± 0.798
4.782AsnGly: 4.782 ± 1.265
0.239AsnHis: 0.239 ± 0.197
4.065AsnIle: 4.065 ± 1.009
1.435AsnLys: 1.435 ± 0.582
5.022AsnLeu: 5.022 ± 0.768
0.956AsnMet: 0.956 ± 0.446
4.304AsnAsn: 4.304 ± 1.571
3.109AsnPro: 3.109 ± 1.122
1.913AsnGln: 1.913 ± 0.725
0.478AsnArg: 0.478 ± 0.339
6.934AsnSer: 6.934 ± 1.668
3.826AsnThr: 3.826 ± 1.326
4.543AsnVal: 4.543 ± 0.993
0.478AsnTrp: 0.478 ± 0.302
3.109AsnTyr: 3.109 ± 1.052
0.0AsnXaa: 0.0 ± 0.0
Pro
2.391ProAla: 2.391 ± 0.687
0.239ProCys: 0.239 ± 0.241
1.435ProAsp: 1.435 ± 0.597
1.435ProGlu: 1.435 ± 0.522
4.065ProPhe: 4.065 ± 0.802
1.913ProGly: 1.913 ± 0.661
0.239ProHis: 0.239 ± 0.24
3.587ProIle: 3.587 ± 1.143
2.152ProLys: 2.152 ± 0.89
5.022ProLeu: 5.022 ± 1.835
0.717ProMet: 0.717 ± 0.485
3.348ProAsn: 3.348 ± 1.023
3.587ProPro: 3.587 ± 1.144
1.196ProGln: 1.196 ± 0.622
1.196ProArg: 1.196 ± 0.461
4.304ProSer: 4.304 ± 0.725
3.109ProThr: 3.109 ± 0.772
2.391ProVal: 2.391 ± 0.798
1.196ProTrp: 1.196 ± 0.564
2.152ProTyr: 2.152 ± 0.604
0.0ProXaa: 0.0 ± 0.0
Gln
1.196GlnAla: 1.196 ± 0.393
0.0GlnCys: 0.0 ± 0.0
0.239GlnAsp: 0.239 ± 0.236
1.196GlnGlu: 1.196 ± 0.599
1.196GlnPhe: 1.196 ± 0.501
1.913GlnGly: 1.913 ± 0.59
0.717GlnHis: 0.717 ± 0.384
3.109GlnIle: 3.109 ± 1.025
2.869GlnLys: 2.869 ± 0.784
3.826GlnLeu: 3.826 ± 0.738
0.478GlnMet: 0.478 ± 0.381
1.913GlnAsn: 1.913 ± 0.688
0.478GlnPro: 0.478 ± 0.278
1.196GlnGln: 1.196 ± 0.544
1.196GlnArg: 1.196 ± 0.617
1.674GlnSer: 1.674 ± 0.649
3.826GlnThr: 3.826 ± 0.833
1.674GlnVal: 1.674 ± 0.734
0.478GlnTrp: 0.478 ± 0.256
1.913GlnTyr: 1.913 ± 0.586
0.0GlnXaa: 0.0 ± 0.0
Arg
1.674ArgAla: 1.674 ± 0.564
0.0ArgCys: 0.0 ± 0.0
1.435ArgAsp: 1.435 ± 0.662
3.109ArgGlu: 3.109 ± 1.032
1.196ArgPhe: 1.196 ± 0.593
1.435ArgGly: 1.435 ± 0.598
0.717ArgHis: 0.717 ± 0.399
2.63ArgIle: 2.63 ± 0.691
3.587ArgLys: 3.587 ± 1.447
4.065ArgLeu: 4.065 ± 1.241
0.239ArgMet: 0.239 ± 0.305
2.152ArgAsn: 2.152 ± 0.749
0.717ArgPro: 0.717 ± 0.431
1.435ArgGln: 1.435 ± 0.633
3.587ArgArg: 3.587 ± 1.092
0.956ArgSer: 0.956 ± 0.47
2.152ArgThr: 2.152 ± 0.688
2.391ArgVal: 2.391 ± 0.782
0.239ArgTrp: 0.239 ± 0.218
0.956ArgTyr: 0.956 ± 0.458
0.0ArgXaa: 0.0 ± 0.0
Ser
4.543SerAla: 4.543 ± 0.952
0.239SerCys: 0.239 ± 0.206
2.391SerAsp: 2.391 ± 0.646
3.826SerGlu: 3.826 ± 1.481
5.5SerPhe: 5.5 ± 1.339
6.695SerGly: 6.695 ± 1.443
0.717SerHis: 0.717 ± 0.415
7.891SerIle: 7.891 ± 1.51
5.5SerLys: 5.5 ± 1.177
7.413SerLeu: 7.413 ± 1.398
1.435SerMet: 1.435 ± 0.427
5.022SerAsn: 5.022 ± 1.555
3.826SerPro: 3.826 ± 1.164
2.152SerGln: 2.152 ± 0.738
1.913SerArg: 1.913 ± 0.723
7.174SerSer: 7.174 ± 2.011
3.348SerThr: 3.348 ± 1.004
4.304SerVal: 4.304 ± 1.141
1.435SerTrp: 1.435 ± 0.774
5.739SerTyr: 5.739 ± 1.797
0.0SerXaa: 0.0 ± 0.0
Thr
2.869ThrAla: 2.869 ± 0.629
0.239ThrCys: 0.239 ± 0.197
1.913ThrAsp: 1.913 ± 0.639
2.152ThrGlu: 2.152 ± 0.591
3.348ThrPhe: 3.348 ± 0.933
4.304ThrGly: 4.304 ± 0.956
1.435ThrHis: 1.435 ± 0.534
8.608ThrIle: 8.608 ± 1.909
1.435ThrLys: 1.435 ± 0.668
7.652ThrLeu: 7.652 ± 1.347
1.913ThrMet: 1.913 ± 0.741
4.782ThrAsn: 4.782 ± 1.058
2.869ThrPro: 2.869 ± 0.697
2.869ThrGln: 2.869 ± 1.029
1.913ThrArg: 1.913 ± 0.761
4.782ThrSer: 4.782 ± 0.986
6.456ThrThr: 6.456 ± 1.363
4.543ThrVal: 4.543 ± 1.15
1.435ThrTrp: 1.435 ± 0.422
5.261ThrTyr: 5.261 ± 1.416
0.0ThrXaa: 0.0 ± 0.0
Val
3.109ValAla: 3.109 ± 0.946
0.717ValCys: 0.717 ± 0.44
1.196ValAsp: 1.196 ± 0.379
2.63ValGlu: 2.63 ± 1.034
1.913ValPhe: 1.913 ± 0.856
4.782ValGly: 4.782 ± 0.981
0.478ValHis: 0.478 ± 0.327
4.782ValIle: 4.782 ± 1.293
3.348ValLys: 3.348 ± 0.945
7.652ValLeu: 7.652 ± 1.34
0.956ValMet: 0.956 ± 0.383
4.304ValAsn: 4.304 ± 1.529
2.63ValPro: 2.63 ± 1.051
2.63ValGln: 2.63 ± 0.723
0.717ValArg: 0.717 ± 0.555
8.13ValSer: 8.13 ± 1.7
8.608ValThr: 8.608 ± 2.219
3.826ValVal: 3.826 ± 0.815
0.717ValTrp: 0.717 ± 0.362
4.065ValTyr: 4.065 ± 0.865
0.0ValXaa: 0.0 ± 0.0
Trp
0.478TrpAla: 0.478 ± 0.256
0.0TrpCys: 0.0 ± 0.0
0.239TrpAsp: 0.239 ± 0.235
0.478TrpGlu: 0.478 ± 0.312
0.956TrpPhe: 0.956 ± 0.406
0.956TrpGly: 0.956 ± 0.419
0.0TrpHis: 0.0 ± 0.0
1.196TrpIle: 1.196 ± 0.806
0.717TrpLys: 0.717 ± 0.351
2.391TrpLeu: 2.391 ± 0.607
0.478TrpMet: 0.478 ± 0.308
0.478TrpAsn: 0.478 ± 0.325
0.717TrpPro: 0.717 ± 0.395
0.478TrpGln: 0.478 ± 0.336
0.478TrpArg: 0.478 ± 0.332
0.956TrpSer: 0.956 ± 0.599
0.478TrpThr: 0.478 ± 0.304
0.956TrpVal: 0.956 ± 0.41
0.239TrpTrp: 0.239 ± 0.237
1.196TrpTyr: 1.196 ± 0.775
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.304TyrAla: 4.304 ± 0.971
0.0TyrCys: 0.0 ± 0.0
0.478TyrAsp: 0.478 ± 0.319
2.152TyrGlu: 2.152 ± 0.653
1.674TyrPhe: 1.674 ± 0.688
3.348TyrGly: 3.348 ± 0.968
0.478TyrHis: 0.478 ± 0.253
4.065TyrIle: 4.065 ± 0.817
3.348TyrLys: 3.348 ± 1.037
7.413TyrLeu: 7.413 ± 1.172
0.717TyrMet: 0.717 ± 0.387
2.869TyrAsn: 2.869 ± 1.21
2.63TyrPro: 2.63 ± 0.901
0.956TyrGln: 0.956 ± 0.569
2.152TyrArg: 2.152 ± 0.688
3.587TyrSer: 3.587 ± 1.075
4.782TyrThr: 4.782 ± 1.38
5.739TyrVal: 5.739 ± 0.92
0.956TyrTrp: 0.956 ± 0.425
4.065TyrTyr: 4.065 ± 1.149
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 26 proteins (4183 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski