Amino acid dipepetide frequency for Sulfolobus spindle-shaped virus 6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.0AlaCys: 0.0 ± 0.0
0.994AlaAsp: 0.994 ± 0.342
4.372AlaGlu: 4.372 ± 1.061
2.782AlaPhe: 2.782 ± 0.688
0.994AlaGly: 0.994 ± 0.356
0.199AlaHis: 0.199 ± 0.215
3.577AlaIle: 3.577 ± 0.726
5.763AlaLys: 5.763 ± 1.416
6.359AlaLeu: 6.359 ± 1.115
0.199AlaMet: 0.199 ± 0.207
2.385AlaAsn: 2.385 ± 0.761
2.186AlaPro: 2.186 ± 0.936
2.186AlaGln: 2.186 ± 1.146
1.987AlaArg: 1.987 ± 0.68
3.18AlaSer: 3.18 ± 0.81
3.18AlaThr: 3.18 ± 0.85
6.161AlaVal: 6.161 ± 0.939
1.192AlaTrp: 1.192 ± 0.434
2.385AlaTyr: 2.385 ± 0.599
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.199CysAsp: 0.199 ± 0.174
0.199CysGlu: 0.199 ± 0.2
0.397CysPhe: 0.397 ± 0.296
0.596CysGly: 0.596 ± 0.385
0.0CysHis: 0.0 ± 0.0
0.397CysIle: 0.397 ± 0.293
0.397CysLys: 0.397 ± 0.273
0.795CysLeu: 0.795 ± 0.425
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.994CysPro: 0.994 ± 0.577
0.397CysGln: 0.397 ± 0.253
0.199CysArg: 0.199 ± 0.204
0.397CysSer: 0.397 ± 0.294
0.0CysThr: 0.0 ± 0.0
0.596CysVal: 0.596 ± 0.353
0.199CysTrp: 0.199 ± 0.214
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.385AspAla: 2.385 ± 1.051
0.199AspCys: 0.199 ± 0.204
1.192AspAsp: 1.192 ± 0.526
2.385AspGlu: 2.385 ± 0.969
1.789AspPhe: 1.789 ± 0.535
1.987AspGly: 1.987 ± 0.758
0.596AspHis: 0.596 ± 0.371
4.173AspIle: 4.173 ± 0.924
1.987AspLys: 1.987 ± 0.879
3.776AspLeu: 3.776 ± 0.876
0.596AspMet: 0.596 ± 0.334
1.789AspAsn: 1.789 ± 0.657
1.391AspPro: 1.391 ± 0.523
0.199AspGln: 0.199 ± 0.204
0.397AspArg: 0.397 ± 0.289
1.987AspSer: 1.987 ± 0.777
1.59AspThr: 1.59 ± 0.603
3.577AspVal: 3.577 ± 0.826
0.596AspTrp: 0.596 ± 0.294
2.186AspTyr: 2.186 ± 0.586
0.0AspXaa: 0.0 ± 0.0
Glu
4.173GluAla: 4.173 ± 1.28
0.397GluCys: 0.397 ± 0.275
3.577GluAsp: 3.577 ± 0.776
5.962GluGlu: 5.962 ± 1.63
1.987GluPhe: 1.987 ± 0.491
1.987GluGly: 1.987 ± 0.671
1.192GluHis: 1.192 ± 0.557
4.173GluIle: 4.173 ± 1.021
4.372GluLys: 4.372 ± 1.17
9.539GluLeu: 9.539 ± 2.421
1.987GluMet: 1.987 ± 0.701
3.378GluAsn: 3.378 ± 1.065
0.994GluPro: 0.994 ± 0.578
2.186GluGln: 2.186 ± 0.72
2.583GluArg: 2.583 ± 1.064
1.391GluSer: 1.391 ± 0.499
2.385GluThr: 2.385 ± 0.619
4.571GluVal: 4.571 ± 0.932
0.596GluTrp: 0.596 ± 0.468
3.18GluTyr: 3.18 ± 0.763
0.0GluXaa: 0.0 ± 0.0
Phe
2.385PheAla: 2.385 ± 0.867
0.0PheCys: 0.0 ± 0.0
1.192PheAsp: 1.192 ± 0.546
1.987PheGlu: 1.987 ± 0.899
2.981PhePhe: 2.981 ± 0.988
3.975PheGly: 3.975 ± 0.797
1.192PheHis: 1.192 ± 0.552
3.577PheIle: 3.577 ± 1.141
3.378PheLys: 3.378 ± 0.949
5.366PheLeu: 5.366 ± 1.002
0.994PheMet: 0.994 ± 0.449
2.981PheAsn: 2.981 ± 1.06
1.59PhePro: 1.59 ± 0.457
1.192PheGln: 1.192 ± 0.466
1.391PheArg: 1.391 ± 0.593
3.378PheSer: 3.378 ± 1.08
2.782PheThr: 2.782 ± 0.621
3.776PheVal: 3.776 ± 0.997
0.795PheTrp: 0.795 ± 0.323
5.167PheTyr: 5.167 ± 0.681
0.0PheXaa: 0.0 ± 0.0
Gly
1.789GlyAla: 1.789 ± 0.45
0.0GlyCys: 0.0 ± 0.0
3.378GlyAsp: 3.378 ± 0.832
2.385GlyGlu: 2.385 ± 0.546
3.577GlyPhe: 3.577 ± 0.804
3.577GlyGly: 3.577 ± 1.404
0.0GlyHis: 0.0 ± 0.0
4.968GlyIle: 4.968 ± 0.841
4.372GlyLys: 4.372 ± 1.118
4.769GlyLeu: 4.769 ± 1.134
1.391GlyMet: 1.391 ± 0.54
4.571GlyAsn: 4.571 ± 1.539
1.59GlyPro: 1.59 ± 0.501
1.59GlyGln: 1.59 ± 0.809
2.583GlyArg: 2.583 ± 0.918
4.173GlySer: 4.173 ± 1.05
5.366GlyThr: 5.366 ± 1.56
3.378GlyVal: 3.378 ± 0.638
0.795GlyTrp: 0.795 ± 0.386
3.378GlyTyr: 3.378 ± 1.014
0.0GlyXaa: 0.0 ± 0.0
His
0.795HisAla: 0.795 ± 0.382
0.0HisCys: 0.0 ± 0.0
0.199HisAsp: 0.199 ± 0.223
0.795HisGlu: 0.795 ± 0.333
0.795HisPhe: 0.795 ± 0.417
0.596HisGly: 0.596 ± 0.363
0.199HisHis: 0.199 ± 0.204
1.192HisIle: 1.192 ± 0.434
0.795HisLys: 0.795 ± 0.335
1.391HisLeu: 1.391 ± 0.588
0.0HisMet: 0.0 ± 0.0
1.789HisAsn: 1.789 ± 0.659
0.0HisPro: 0.0 ± 0.0
0.397HisGln: 0.397 ± 0.21
0.596HisArg: 0.596 ± 0.371
1.192HisSer: 1.192 ± 0.402
0.397HisThr: 0.397 ± 0.285
0.994HisVal: 0.994 ± 0.456
0.0HisTrp: 0.0 ± 0.0
1.391HisTyr: 1.391 ± 0.47
0.0HisXaa: 0.0 ± 0.0
Ile
3.975IleAla: 3.975 ± 0.986
0.397IleCys: 0.397 ± 0.274
3.577IleAsp: 3.577 ± 0.615
4.173IleGlu: 4.173 ± 1.002
5.564IlePhe: 5.564 ± 1.653
4.372IleGly: 4.372 ± 0.895
1.391IleHis: 1.391 ± 0.485
6.955IleIle: 6.955 ± 1.506
6.161IleLys: 6.161 ± 1.466
7.154IleLeu: 7.154 ± 1.187
0.994IleMet: 0.994 ± 0.385
3.975IleAsn: 3.975 ± 0.881
2.583IlePro: 2.583 ± 0.654
2.583IleGln: 2.583 ± 0.604
3.975IleArg: 3.975 ± 1.043
4.173IleSer: 4.173 ± 0.686
4.968IleThr: 4.968 ± 0.962
4.571IleVal: 4.571 ± 0.852
1.391IleTrp: 1.391 ± 0.694
4.372IleTyr: 4.372 ± 0.725
0.0IleXaa: 0.0 ± 0.0
Lys
4.173LysAla: 4.173 ± 1.234
0.994LysCys: 0.994 ± 0.496
3.18LysAsp: 3.18 ± 0.867
6.757LysGlu: 6.757 ± 2.071
2.782LysPhe: 2.782 ± 0.949
4.372LysGly: 4.372 ± 1.41
1.59LysHis: 1.59 ± 0.653
6.757LysIle: 6.757 ± 1.412
9.539LysLys: 9.539 ± 3.17
8.943LysLeu: 8.943 ± 2.294
1.987LysMet: 1.987 ± 0.659
2.981LysAsn: 2.981 ± 0.836
0.795LysPro: 0.795 ± 0.477
3.18LysGln: 3.18 ± 0.939
2.981LysArg: 2.981 ± 1.055
2.385LysSer: 2.385 ± 0.74
4.571LysThr: 4.571 ± 1.082
4.372LysVal: 4.372 ± 1.669
0.795LysTrp: 0.795 ± 0.449
4.769LysTyr: 4.769 ± 0.901
0.0LysXaa: 0.0 ± 0.0
Leu
6.955LeuAla: 6.955 ± 0.998
0.795LeuCys: 0.795 ± 0.537
3.577LeuAsp: 3.577 ± 0.711
6.558LeuGlu: 6.558 ± 1.415
6.757LeuPhe: 6.757 ± 1.104
5.564LeuGly: 5.564 ± 1.47
1.391LeuHis: 1.391 ± 0.711
8.148LeuIle: 8.148 ± 1.433
6.955LeuLys: 6.955 ± 2.03
12.122LeuLeu: 12.122 ± 1.807
2.981LeuMet: 2.981 ± 0.84
5.962LeuAsn: 5.962 ± 0.828
4.173LeuPro: 4.173 ± 0.573
3.378LeuGln: 3.378 ± 0.721
3.776LeuArg: 3.776 ± 1.279
6.757LeuSer: 6.757 ± 1.189
6.558LeuThr: 6.558 ± 1.342
6.955LeuVal: 6.955 ± 1.217
0.994LeuTrp: 0.994 ± 0.413
4.571LeuTyr: 4.571 ± 0.884
0.0LeuXaa: 0.0 ± 0.0
Met
1.59MetAla: 1.59 ± 0.563
0.199MetCys: 0.199 ± 0.174
0.596MetAsp: 0.596 ± 0.293
1.987MetGlu: 1.987 ± 0.566
1.391MetPhe: 1.391 ± 0.457
2.782MetGly: 2.782 ± 0.941
0.199MetHis: 0.199 ± 0.223
0.596MetIle: 0.596 ± 0.281
1.987MetLys: 1.987 ± 0.776
1.987MetLeu: 1.987 ± 0.749
1.391MetMet: 1.391 ± 0.434
1.391MetAsn: 1.391 ± 0.461
0.596MetPro: 0.596 ± 0.352
0.795MetGln: 0.795 ± 0.486
0.795MetArg: 0.795 ± 0.44
2.385MetSer: 2.385 ± 0.629
0.795MetThr: 0.795 ± 0.5
0.994MetVal: 0.994 ± 0.382
0.596MetTrp: 0.596 ± 0.279
0.397MetTyr: 0.397 ± 0.227
0.0MetXaa: 0.0 ± 0.0
Asn
2.583AsnAla: 2.583 ± 0.741
0.0AsnCys: 0.0 ± 0.0
2.186AsnAsp: 2.186 ± 0.61
3.975AsnGlu: 3.975 ± 1.148
2.385AsnPhe: 2.385 ± 0.755
4.968AsnGly: 4.968 ± 2.126
0.596AsnHis: 0.596 ± 0.317
3.776AsnIle: 3.776 ± 0.71
3.577AsnLys: 3.577 ± 1.077
4.372AsnLeu: 4.372 ± 0.884
1.391AsnMet: 1.391 ± 0.435
4.968AsnAsn: 4.968 ± 1.194
4.173AsnPro: 4.173 ± 1.082
2.385AsnGln: 2.385 ± 0.616
1.987AsnArg: 1.987 ± 0.651
3.776AsnSer: 3.776 ± 0.945
3.18AsnThr: 3.18 ± 1.16
4.769AsnVal: 4.769 ± 0.824
1.391AsnTrp: 1.391 ± 0.524
4.173AsnTyr: 4.173 ± 1.224
0.0AsnXaa: 0.0 ± 0.0
Pro
1.987ProAla: 1.987 ± 0.737
0.199ProCys: 0.199 ± 0.204
0.994ProAsp: 0.994 ± 0.632
0.596ProGlu: 0.596 ± 0.349
1.391ProPhe: 1.391 ± 0.464
1.192ProGly: 1.192 ± 0.478
0.199ProHis: 0.199 ± 0.143
1.59ProIle: 1.59 ± 0.584
1.987ProLys: 1.987 ± 0.624
4.372ProLeu: 4.372 ± 0.814
0.795ProMet: 0.795 ± 0.447
2.385ProAsn: 2.385 ± 0.886
2.782ProPro: 2.782 ± 0.694
2.385ProGln: 2.385 ± 0.656
0.397ProArg: 0.397 ± 0.449
3.378ProSer: 3.378 ± 1.128
3.577ProThr: 3.577 ± 1.339
2.782ProVal: 2.782 ± 0.763
0.994ProTrp: 0.994 ± 0.404
2.385ProTyr: 2.385 ± 0.597
0.0ProXaa: 0.0 ± 0.0
Gln
1.987GlnAla: 1.987 ± 0.82
0.199GlnCys: 0.199 ± 0.214
1.192GlnAsp: 1.192 ± 0.448
1.59GlnGlu: 1.59 ± 0.547
1.192GlnPhe: 1.192 ± 0.509
1.987GlnGly: 1.987 ± 0.49
0.795GlnHis: 0.795 ± 0.487
4.571GlnIle: 4.571 ± 0.793
3.776GlnLys: 3.776 ± 1.107
2.782GlnLeu: 2.782 ± 0.879
1.59GlnMet: 1.59 ± 0.534
1.789GlnAsn: 1.789 ± 0.518
1.192GlnPro: 1.192 ± 0.454
1.391GlnGln: 1.391 ± 0.424
0.795GlnArg: 0.795 ± 0.313
1.987GlnSer: 1.987 ± 0.612
2.981GlnThr: 2.981 ± 1.044
1.59GlnVal: 1.59 ± 0.586
0.795GlnTrp: 0.795 ± 0.419
1.192GlnTyr: 1.192 ± 0.528
0.0GlnXaa: 0.0 ± 0.0
Arg
1.192ArgAla: 1.192 ± 0.567
0.397ArgCys: 0.397 ± 0.29
1.391ArgAsp: 1.391 ± 0.747
3.378ArgGlu: 3.378 ± 0.916
1.789ArgPhe: 1.789 ± 0.617
1.192ArgGly: 1.192 ± 0.609
0.795ArgHis: 0.795 ± 0.417
1.59ArgIle: 1.59 ± 0.731
4.769ArgLys: 4.769 ± 1.587
2.583ArgLeu: 2.583 ± 0.82
1.192ArgMet: 1.192 ± 0.724
1.59ArgAsn: 1.59 ± 0.569
0.199ArgPro: 0.199 ± 0.22
1.192ArgGln: 1.192 ± 0.499
2.186ArgArg: 2.186 ± 0.715
1.59ArgSer: 1.59 ± 0.63
1.192ArgThr: 1.192 ± 0.414
3.577ArgVal: 3.577 ± 0.938
0.994ArgTrp: 0.994 ± 0.396
1.391ArgTyr: 1.391 ± 0.525
0.0ArgXaa: 0.0 ± 0.0
Ser
3.378SerAla: 3.378 ± 0.761
0.199SerCys: 0.199 ± 0.204
1.391SerAsp: 1.391 ± 0.391
2.981SerGlu: 2.981 ± 0.649
2.186SerPhe: 2.186 ± 0.614
4.571SerGly: 4.571 ± 0.995
0.596SerHis: 0.596 ± 0.392
4.372SerIle: 4.372 ± 0.86
5.366SerLys: 5.366 ± 1.594
5.763SerLeu: 5.763 ± 0.851
0.795SerMet: 0.795 ± 0.331
2.981SerAsn: 2.981 ± 0.687
2.782SerPro: 2.782 ± 0.858
3.18SerGln: 3.18 ± 0.846
2.186SerArg: 2.186 ± 0.721
4.769SerSer: 4.769 ± 1.241
4.173SerThr: 4.173 ± 0.897
4.173SerVal: 4.173 ± 1.144
1.987SerTrp: 1.987 ± 0.629
5.564SerTyr: 5.564 ± 1.148
0.0SerXaa: 0.0 ± 0.0
Thr
2.583ThrAla: 2.583 ± 0.729
0.397ThrCys: 0.397 ± 0.288
0.596ThrAsp: 0.596 ± 0.396
3.378ThrGlu: 3.378 ± 1.18
2.981ThrPhe: 2.981 ± 0.601
3.975ThrGly: 3.975 ± 1.125
0.994ThrHis: 0.994 ± 0.455
5.366ThrIle: 5.366 ± 1.648
3.18ThrLys: 3.18 ± 1.245
7.552ThrLeu: 7.552 ± 1.023
1.789ThrMet: 1.789 ± 0.672
4.769ThrAsn: 4.769 ± 1.261
2.583ThrPro: 2.583 ± 0.713
2.186ThrGln: 2.186 ± 0.552
1.789ThrArg: 1.789 ± 0.736
4.173ThrSer: 4.173 ± 1.136
4.968ThrThr: 4.968 ± 1.397
4.571ThrVal: 4.571 ± 1.532
0.795ThrTrp: 0.795 ± 0.368
3.18ThrTyr: 3.18 ± 1.247
0.0ThrXaa: 0.0 ± 0.0
Val
3.577ValAla: 3.577 ± 0.808
0.994ValCys: 0.994 ± 0.856
2.782ValAsp: 2.782 ± 0.633
3.975ValGlu: 3.975 ± 0.881
3.378ValPhe: 3.378 ± 0.977
4.968ValGly: 4.968 ± 0.935
0.795ValHis: 0.795 ± 0.36
5.366ValIle: 5.366 ± 1.117
5.167ValLys: 5.167 ± 1.26
5.167ValLeu: 5.167 ± 1.085
1.192ValMet: 1.192 ± 0.541
6.757ValAsn: 6.757 ± 1.374
3.18ValPro: 3.18 ± 0.723
1.987ValGln: 1.987 ± 0.844
1.789ValArg: 1.789 ± 0.574
8.545ValSer: 8.545 ± 1.298
4.571ValThr: 4.571 ± 0.824
6.955ValVal: 6.955 ± 1.424
1.789ValTrp: 1.789 ± 0.571
3.18ValTyr: 3.18 ± 0.8
0.0ValXaa: 0.0 ± 0.0
Trp
0.994TrpAla: 0.994 ± 0.544
0.0TrpCys: 0.0 ± 0.0
0.397TrpAsp: 0.397 ± 0.25
0.795TrpGlu: 0.795 ± 0.329
0.596TrpPhe: 0.596 ± 0.271
1.192TrpGly: 1.192 ± 0.44
0.397TrpHis: 0.397 ± 0.225
1.987TrpIle: 1.987 ± 0.628
1.59TrpLys: 1.59 ± 0.676
2.583TrpLeu: 2.583 ± 0.703
0.199TrpMet: 0.199 ± 0.225
1.192TrpAsn: 1.192 ± 0.576
0.596TrpPro: 0.596 ± 0.296
0.994TrpGln: 0.994 ± 0.547
0.596TrpArg: 0.596 ± 0.414
1.192TrpSer: 1.192 ± 0.447
0.397TrpThr: 0.397 ± 0.302
1.391TrpVal: 1.391 ± 0.649
0.596TrpTrp: 0.596 ± 0.239
0.795TrpTyr: 0.795 ± 0.461
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.577TyrAla: 3.577 ± 0.825
0.397TyrCys: 0.397 ± 0.294
2.186TyrAsp: 2.186 ± 0.579
2.385TyrGlu: 2.385 ± 0.698
3.378TyrPhe: 3.378 ± 0.845
2.782TyrGly: 2.782 ± 1.312
0.397TyrHis: 0.397 ± 0.215
4.173TyrIle: 4.173 ± 0.817
2.782TyrLys: 2.782 ± 0.733
7.552TyrLeu: 7.552 ± 1.131
1.789TyrMet: 1.789 ± 0.585
2.981TyrAsn: 2.981 ± 0.584
1.987TyrPro: 1.987 ± 0.778
1.59TyrGln: 1.59 ± 0.681
1.391TyrArg: 1.391 ± 0.536
2.583TyrSer: 2.583 ± 0.804
4.173TyrThr: 4.173 ± 1.243
6.359TyrVal: 6.359 ± 0.965
1.192TyrTrp: 1.192 ± 0.522
3.975TyrTyr: 3.975 ± 1.059
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 33 proteins (5033 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski