Amino acid dipepetide frequency for Sulfolobus polyhedral virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.499AlaAla: 0.499 ± 0.272
0.333AlaCys: 0.333 ± 0.216
2.997AlaAsp: 2.997 ± 0.889
3.829AlaGlu: 3.829 ± 0.755
2.664AlaPhe: 2.664 ± 0.77
3.329AlaGly: 3.329 ± 0.879
0.499AlaHis: 0.499 ± 0.304
4.495AlaIle: 4.495 ± 0.79
5.327AlaLys: 5.327 ± 1.11
6.659AlaLeu: 6.659 ± 1.408
1.665AlaMet: 1.665 ± 0.473
4.828AlaAsn: 4.828 ± 0.936
0.832AlaPro: 0.832 ± 0.331
3.163AlaGln: 3.163 ± 0.715
1.665AlaArg: 1.665 ± 0.576
4.162AlaSer: 4.162 ± 1.123
2.497AlaThr: 2.497 ± 0.525
4.495AlaVal: 4.495 ± 0.922
0.333AlaTrp: 0.333 ± 0.257
4.994AlaTyr: 4.994 ± 0.81
0.0AlaXaa: 0.0 ± 0.0
Cys
0.666CysAla: 0.666 ± 0.31
0.0CysCys: 0.0 ± 0.0
1.165CysAsp: 1.165 ± 0.537
0.999CysGlu: 0.999 ± 0.47
0.666CysPhe: 0.666 ± 0.297
1.165CysGly: 1.165 ± 0.564
0.166CysHis: 0.166 ± 0.181
0.333CysIle: 0.333 ± 0.22
0.999CysLys: 0.999 ± 0.422
0.666CysLeu: 0.666 ± 0.27
0.333CysMet: 0.333 ± 0.279
0.832CysAsn: 0.832 ± 0.339
0.666CysPro: 0.666 ± 0.286
0.0CysGln: 0.0 ± 0.0
1.165CysArg: 1.165 ± 0.43
0.832CysSer: 0.832 ± 0.524
0.166CysThr: 0.166 ± 0.177
0.832CysVal: 0.832 ± 0.326
0.0CysTrp: 0.0 ± 0.0
0.499CysTyr: 0.499 ± 0.316
0.0CysXaa: 0.0 ± 0.0
Asp
2.497AspAla: 2.497 ± 0.606
1.332AspCys: 1.332 ± 0.52
1.665AspAsp: 1.665 ± 0.65
4.162AspGlu: 4.162 ± 1.246
1.998AspPhe: 1.998 ± 0.776
1.665AspGly: 1.665 ± 0.553
0.166AspHis: 0.166 ± 0.146
4.828AspIle: 4.828 ± 0.797
2.83AspLys: 2.83 ± 1.081
4.162AspLeu: 4.162 ± 0.94
1.332AspMet: 1.332 ± 0.4
4.828AspAsn: 4.828 ± 0.916
1.498AspPro: 1.498 ± 0.392
0.333AspGln: 0.333 ± 0.216
0.999AspArg: 0.999 ± 0.553
1.665AspSer: 1.665 ± 0.43
1.831AspThr: 1.831 ± 0.513
2.997AspVal: 2.997 ± 0.537
0.333AspTrp: 0.333 ± 0.212
2.497AspTyr: 2.497 ± 0.641
0.0AspXaa: 0.0 ± 0.0
Glu
3.163GluAla: 3.163 ± 0.87
0.666GluCys: 0.666 ± 0.317
3.329GluAsp: 3.329 ± 0.942
4.828GluGlu: 4.828 ± 1.099
2.164GluPhe: 2.164 ± 0.558
2.83GluGly: 2.83 ± 0.587
0.666GluHis: 0.666 ± 0.301
4.495GluIle: 4.495 ± 1.166
3.662GluLys: 3.662 ± 1.043
7.158GluLeu: 7.158 ± 1.403
1.998GluMet: 1.998 ± 0.588
2.664GluAsn: 2.664 ± 0.755
1.165GluPro: 1.165 ± 0.439
2.497GluGln: 2.497 ± 0.656
1.831GluArg: 1.831 ± 0.676
1.831GluSer: 1.831 ± 0.597
2.664GluThr: 2.664 ± 0.761
2.997GluVal: 2.997 ± 0.743
0.499GluTrp: 0.499 ± 0.279
3.662GluTyr: 3.662 ± 0.739
0.0GluXaa: 0.0 ± 0.0
Phe
2.83PheAla: 2.83 ± 0.687
1.165PheCys: 1.165 ± 0.437
1.332PheAsp: 1.332 ± 0.596
1.332PheGlu: 1.332 ± 0.425
0.999PhePhe: 0.999 ± 0.445
2.331PheGly: 2.331 ± 0.849
0.0PheHis: 0.0 ± 0.0
3.662PheIle: 3.662 ± 0.702
1.332PheLys: 1.332 ± 0.49
3.496PheLeu: 3.496 ± 1.005
0.499PheMet: 0.499 ± 0.327
3.496PheAsn: 3.496 ± 0.783
1.998PhePro: 1.998 ± 0.665
0.832PheGln: 0.832 ± 0.331
0.666PheArg: 0.666 ± 0.284
3.995PheSer: 3.995 ± 0.7
2.997PheThr: 2.997 ± 0.719
0.999PheVal: 0.999 ± 0.462
0.333PheTrp: 0.333 ± 0.224
1.831PheTyr: 1.831 ± 0.528
0.0PheXaa: 0.0 ± 0.0
Gly
2.83GlyAla: 2.83 ± 0.557
0.832GlyCys: 0.832 ± 0.533
2.164GlyAsp: 2.164 ± 0.542
2.664GlyGlu: 2.664 ± 0.679
3.496GlyPhe: 3.496 ± 0.646
5.66GlyGly: 5.66 ± 1.216
0.333GlyHis: 0.333 ± 0.208
3.995GlyIle: 3.995 ± 0.927
3.496GlyLys: 3.496 ± 0.949
6.159GlyLeu: 6.159 ± 1.024
1.665GlyMet: 1.665 ± 0.431
3.829GlyAsn: 3.829 ± 1.356
0.499GlyPro: 0.499 ± 0.339
2.997GlyGln: 2.997 ± 0.645
0.333GlyArg: 0.333 ± 0.383
4.661GlySer: 4.661 ± 1.023
5.827GlyThr: 5.827 ± 1.181
4.495GlyVal: 4.495 ± 1.091
0.832GlyTrp: 0.832 ± 0.425
3.329GlyTyr: 3.329 ± 0.928
0.0GlyXaa: 0.0 ± 0.0
His
0.832HisAla: 0.832 ± 0.306
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.999HisGlu: 0.999 ± 0.458
0.166HisPhe: 0.166 ± 0.192
0.166HisGly: 0.166 ± 0.154
0.499HisHis: 0.499 ± 0.403
0.666HisIle: 0.666 ± 0.345
1.165HisLys: 1.165 ± 0.381
1.165HisLeu: 1.165 ± 0.356
0.499HisMet: 0.499 ± 0.336
1.165HisAsn: 1.165 ± 0.532
0.166HisPro: 0.166 ± 0.163
0.333HisGln: 0.333 ± 0.273
0.666HisArg: 0.666 ± 0.277
0.333HisSer: 0.333 ± 0.247
0.666HisThr: 0.666 ± 0.308
0.166HisVal: 0.166 ± 0.159
0.333HisTrp: 0.333 ± 0.25
1.165HisTyr: 1.165 ± 0.597
0.0HisXaa: 0.0 ± 0.0
Ile
5.66IleAla: 5.66 ± 0.905
0.666IleCys: 0.666 ± 0.315
4.661IleAsp: 4.661 ± 0.993
3.829IleGlu: 3.829 ± 0.836
2.83IlePhe: 2.83 ± 0.59
3.496IleGly: 3.496 ± 0.738
1.165IleHis: 1.165 ± 0.587
7.158IleIle: 7.158 ± 1.114
3.829IleLys: 3.829 ± 0.759
6.326IleLeu: 6.326 ± 0.955
1.998IleMet: 1.998 ± 0.636
4.495IleAsn: 4.495 ± 0.707
2.83IlePro: 2.83 ± 0.592
3.662IleGln: 3.662 ± 0.643
1.498IleArg: 1.498 ± 0.421
5.327IleSer: 5.327 ± 1.18
6.492IleThr: 6.492 ± 1.163
4.495IleVal: 4.495 ± 1.076
0.999IleTrp: 0.999 ± 0.335
3.329IleTyr: 3.329 ± 0.671
0.0IleXaa: 0.0 ± 0.0
Lys
4.495LysAla: 4.495 ± 0.968
0.999LysCys: 0.999 ± 0.483
2.664LysAsp: 2.664 ± 0.911
3.829LysGlu: 3.829 ± 0.979
1.332LysPhe: 1.332 ± 0.418
3.329LysGly: 3.329 ± 0.712
1.332LysHis: 1.332 ± 0.796
3.329LysIle: 3.329 ± 0.827
5.827LysLys: 5.827 ± 1.723
7.991LysLeu: 7.991 ± 1.341
2.331LysMet: 2.331 ± 0.735
3.496LysAsn: 3.496 ± 0.918
1.665LysPro: 1.665 ± 0.602
2.497LysGln: 2.497 ± 0.822
2.331LysArg: 2.331 ± 0.655
3.496LysSer: 3.496 ± 0.834
1.998LysThr: 1.998 ± 0.595
2.997LysVal: 2.997 ± 0.916
0.333LysTrp: 0.333 ± 0.208
3.329LysTyr: 3.329 ± 0.945
0.0LysXaa: 0.0 ± 0.0
Leu
6.825LeuAla: 6.825 ± 0.955
0.666LeuCys: 0.666 ± 0.339
3.163LeuAsp: 3.163 ± 0.764
4.994LeuGlu: 4.994 ± 0.974
3.329LeuPhe: 3.329 ± 0.651
6.825LeuGly: 6.825 ± 1.127
0.999LeuHis: 0.999 ± 0.531
8.157LeuIle: 8.157 ± 1.222
5.327LeuLys: 5.327 ± 1.134
9.156LeuLeu: 9.156 ± 1.222
1.998LeuMet: 1.998 ± 0.611
7.158LeuAsn: 7.158 ± 1.245
5.827LeuPro: 5.827 ± 1.103
3.995LeuGln: 3.995 ± 0.699
3.829LeuArg: 3.829 ± 1.065
7.325LeuSer: 7.325 ± 1.179
6.326LeuThr: 6.326 ± 1.156
4.328LeuVal: 4.328 ± 0.776
1.165LeuTrp: 1.165 ± 0.396
4.495LeuTyr: 4.495 ± 0.87
0.0LeuXaa: 0.0 ± 0.0
Met
2.164MetAla: 2.164 ± 0.649
0.166MetCys: 0.166 ± 0.181
0.832MetAsp: 0.832 ± 0.351
1.165MetGlu: 1.165 ± 0.503
0.333MetPhe: 0.333 ± 0.208
1.498MetGly: 1.498 ± 0.538
0.166MetHis: 0.166 ± 0.169
1.831MetIle: 1.831 ± 0.647
2.164MetLys: 2.164 ± 0.661
3.496MetLeu: 3.496 ± 0.78
0.666MetMet: 0.666 ± 0.353
1.165MetAsn: 1.165 ± 0.547
0.999MetPro: 0.999 ± 0.48
1.498MetGln: 1.498 ± 0.561
0.999MetArg: 0.999 ± 0.482
2.83MetSer: 2.83 ± 0.786
0.999MetThr: 0.999 ± 0.527
1.498MetVal: 1.498 ± 0.531
0.499MetTrp: 0.499 ± 0.342
0.832MetTyr: 0.832 ± 0.325
0.0MetXaa: 0.0 ± 0.0
Asn
5.827AsnAla: 5.827 ± 1.04
1.165AsnCys: 1.165 ± 0.413
1.998AsnAsp: 1.998 ± 0.55
4.661AsnGlu: 4.661 ± 1.024
1.665AsnPhe: 1.665 ± 0.509
4.328AsnGly: 4.328 ± 0.799
0.666AsnHis: 0.666 ± 0.339
5.66AsnIle: 5.66 ± 1.085
3.662AsnLys: 3.662 ± 1.073
4.994AsnLeu: 4.994 ± 0.782
1.665AsnMet: 1.665 ± 0.587
6.492AsnAsn: 6.492 ± 1.148
3.662AsnPro: 3.662 ± 0.754
2.164AsnGln: 2.164 ± 0.564
2.497AsnArg: 2.497 ± 0.558
6.159AsnSer: 6.159 ± 0.99
3.662AsnThr: 3.662 ± 0.968
4.162AsnVal: 4.162 ± 0.744
0.499AsnTrp: 0.499 ± 0.238
4.828AsnTyr: 4.828 ± 0.768
0.0AsnXaa: 0.0 ± 0.0
Pro
2.497ProAla: 2.497 ± 0.863
0.0ProCys: 0.0 ± 0.0
1.998ProAsp: 1.998 ± 0.453
1.498ProGlu: 1.498 ± 0.562
1.165ProPhe: 1.165 ± 0.409
1.831ProGly: 1.831 ± 0.572
0.166ProHis: 0.166 ± 0.188
2.83ProIle: 2.83 ± 0.709
1.498ProLys: 1.498 ± 0.49
3.329ProLeu: 3.329 ± 0.861
0.666ProMet: 0.666 ± 0.331
2.664ProAsn: 2.664 ± 0.609
4.661ProPro: 4.661 ± 1.326
1.831ProGln: 1.831 ± 0.565
0.666ProArg: 0.666 ± 0.425
3.496ProSer: 3.496 ± 0.982
3.496ProThr: 3.496 ± 0.774
1.498ProVal: 1.498 ± 0.428
0.166ProTrp: 0.166 ± 0.141
3.163ProTyr: 3.163 ± 0.785
0.0ProXaa: 0.0 ± 0.0
Gln
1.498GlnAla: 1.498 ± 0.375
0.499GlnCys: 0.499 ± 0.316
0.832GlnAsp: 0.832 ± 0.358
1.498GlnGlu: 1.498 ± 0.509
1.498GlnPhe: 1.498 ± 0.52
3.163GlnGly: 3.163 ± 1.053
0.0GlnHis: 0.0 ± 0.0
3.163GlnIle: 3.163 ± 0.872
1.998GlnLys: 1.998 ± 0.601
3.496GlnLeu: 3.496 ± 0.628
1.498GlnMet: 1.498 ± 0.677
3.329GlnAsn: 3.329 ± 0.722
2.331GlnPro: 2.331 ± 0.796
2.664GlnGln: 2.664 ± 0.58
1.165GlnArg: 1.165 ± 0.492
3.496GlnSer: 3.496 ± 0.773
8.157GlnThr: 8.157 ± 1.84
2.164GlnVal: 2.164 ± 0.617
0.666GlnTrp: 0.666 ± 0.323
3.163GlnTyr: 3.163 ± 0.722
0.0GlnXaa: 0.0 ± 0.0
Arg
1.665ArgAla: 1.665 ± 0.524
0.166ArgCys: 0.166 ± 0.159
1.998ArgAsp: 1.998 ± 0.609
1.831ArgGlu: 1.831 ± 0.64
0.832ArgPhe: 0.832 ± 0.344
1.998ArgGly: 1.998 ± 0.666
0.499ArgHis: 0.499 ± 0.353
1.665ArgIle: 1.665 ± 0.639
2.497ArgLys: 2.497 ± 0.71
3.329ArgLeu: 3.329 ± 0.81
1.498ArgMet: 1.498 ± 0.554
0.666ArgAsn: 0.666 ± 0.429
0.166ArgPro: 0.166 ± 0.168
2.164ArgGln: 2.164 ± 0.912
2.664ArgArg: 2.664 ± 0.842
1.498ArgSer: 1.498 ± 0.587
0.333ArgThr: 0.333 ± 0.243
0.999ArgVal: 0.999 ± 0.486
0.0ArgTrp: 0.0 ± 0.0
1.831ArgTyr: 1.831 ± 0.552
0.0ArgXaa: 0.0 ± 0.0
Ser
4.162SerAla: 4.162 ± 0.838
0.499SerCys: 0.499 ± 0.299
2.83SerAsp: 2.83 ± 0.77
3.995SerGlu: 3.995 ± 0.863
3.496SerPhe: 3.496 ± 0.809
6.825SerGly: 6.825 ± 1.465
0.999SerHis: 0.999 ± 0.349
4.162SerIle: 4.162 ± 0.869
3.995SerLys: 3.995 ± 0.938
5.827SerLeu: 5.827 ± 0.671
1.998SerMet: 1.998 ± 0.576
4.328SerAsn: 4.328 ± 0.938
3.496SerPro: 3.496 ± 1.52
5.827SerGln: 5.827 ± 1.163
0.999SerArg: 0.999 ± 0.424
9.655SerSer: 9.655 ± 2.316
6.825SerThr: 6.825 ± 1.284
4.828SerVal: 4.828 ± 0.802
0.666SerTrp: 0.666 ± 0.372
4.328SerTyr: 4.328 ± 0.909
0.0SerXaa: 0.0 ± 0.0
Thr
3.496ThrAla: 3.496 ± 0.661
0.999ThrCys: 0.999 ± 0.51
2.997ThrAsp: 2.997 ± 0.784
2.664ThrGlu: 2.664 ± 0.723
1.831ThrPhe: 1.831 ± 0.574
3.329ThrGly: 3.329 ± 0.874
0.832ThrHis: 0.832 ± 0.468
4.495ThrIle: 4.495 ± 0.76
3.329ThrLys: 3.329 ± 0.722
6.492ThrLeu: 6.492 ± 1.063
0.832ThrMet: 0.832 ± 0.335
6.326ThrAsn: 6.326 ± 1.529
3.829ThrPro: 3.829 ± 0.665
5.66ThrGln: 5.66 ± 1.449
0.666ThrArg: 0.666 ± 0.311
8.157ThrSer: 8.157 ± 1.757
8.49ThrThr: 8.49 ± 2.99
5.494ThrVal: 5.494 ± 1.061
0.499ThrTrp: 0.499 ± 0.278
3.496ThrTyr: 3.496 ± 0.899
0.0ThrXaa: 0.0 ± 0.0
Val
4.162ValAla: 4.162 ± 1.099
1.498ValCys: 1.498 ± 0.487
3.329ValAsp: 3.329 ± 0.712
2.497ValGlu: 2.497 ± 0.655
2.331ValPhe: 2.331 ± 0.697
3.329ValGly: 3.329 ± 0.816
0.832ValHis: 0.832 ± 0.306
5.327ValIle: 5.327 ± 0.764
3.995ValLys: 3.995 ± 0.704
4.994ValLeu: 4.994 ± 0.842
0.666ValMet: 0.666 ± 0.319
4.328ValAsn: 4.328 ± 0.906
1.831ValPro: 1.831 ± 0.63
2.164ValGln: 2.164 ± 0.63
1.665ValArg: 1.665 ± 0.513
4.661ValSer: 4.661 ± 0.85
4.828ValThr: 4.828 ± 1.068
3.995ValVal: 3.995 ± 0.94
0.0ValTrp: 0.0 ± 0.0
3.662ValTyr: 3.662 ± 0.742
0.0ValXaa: 0.0 ± 0.0
Trp
0.333TrpAla: 0.333 ± 0.235
0.166TrpCys: 0.166 ± 0.188
0.499TrpAsp: 0.499 ± 0.292
0.666TrpGlu: 0.666 ± 0.275
0.499TrpPhe: 0.499 ± 0.262
0.499TrpGly: 0.499 ± 0.285
0.166TrpHis: 0.166 ± 0.168
0.166TrpIle: 0.166 ± 0.147
0.166TrpLys: 0.166 ± 0.164
1.165TrpLeu: 1.165 ± 0.403
0.166TrpMet: 0.166 ± 0.156
0.333TrpAsn: 0.333 ± 0.22
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.832TrpSer: 0.832 ± 0.401
0.832TrpThr: 0.832 ± 0.33
1.332TrpVal: 1.332 ± 0.4
0.0TrpTrp: 0.0 ± 0.0
0.666TrpTyr: 0.666 ± 0.322
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.163TyrAla: 3.163 ± 0.68
0.333TyrCys: 0.333 ± 0.224
3.163TyrAsp: 3.163 ± 0.919
2.83TyrGlu: 2.83 ± 0.563
2.83TyrPhe: 2.83 ± 0.579
2.497TyrGly: 2.497 ± 0.751
0.999TyrHis: 0.999 ± 0.391
4.328TyrIle: 4.328 ± 0.724
2.497TyrLys: 2.497 ± 0.636
5.827TyrLeu: 5.827 ± 0.894
1.665TyrMet: 1.665 ± 0.478
3.995TyrAsn: 3.995 ± 0.889
0.999TyrPro: 0.999 ± 0.428
1.998TyrGln: 1.998 ± 0.606
1.998TyrArg: 1.998 ± 0.656
5.494TyrSer: 5.494 ± 1.122
4.828TyrThr: 4.828 ± 1.055
5.327TyrVal: 5.327 ± 0.746
0.333TyrTrp: 0.333 ± 0.205
5.327TyrTyr: 5.327 ± 1.264
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 45 proteins (6008 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski