Amino acid dipepetide frequency for Acidianus bottle-shaped virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.147AlaCys: 0.147 ± 0.128
1.326AlaAsp: 1.326 ± 0.469
1.768AlaGlu: 1.768 ± 0.368
3.095AlaPhe: 3.095 ± 0.734
1.621AlaGly: 1.621 ± 0.42
1.032AlaHis: 1.032 ± 0.482
6.926AlaIle: 6.926 ± 0.874
3.831AlaLys: 3.831 ± 0.647
5.452AlaLeu: 5.452 ± 0.939
0.884AlaMet: 0.884 ± 0.341
3.389AlaAsn: 3.389 ± 0.727
2.947AlaPro: 2.947 ± 1.285
1.032AlaGln: 1.032 ± 0.464
1.474AlaArg: 1.474 ± 0.471
2.505AlaSer: 2.505 ± 0.586
1.474AlaThr: 1.474 ± 0.522
3.979AlaVal: 3.979 ± 0.87
0.295AlaTrp: 0.295 ± 0.176
3.831AlaTyr: 3.831 ± 0.688
0.0AlaXaa: 0.0 ± 0.0
Cys
0.737CysAla: 0.737 ± 0.394
0.0CysCys: 0.0 ± 0.0
0.589CysAsp: 0.589 ± 0.275
0.295CysGlu: 0.295 ± 0.2
0.442CysPhe: 0.442 ± 0.234
0.589CysGly: 0.589 ± 0.415
0.147CysHis: 0.147 ± 0.156
1.179CysIle: 1.179 ± 0.44
0.295CysLys: 0.295 ± 0.159
0.884CysLeu: 0.884 ± 0.387
0.295CysMet: 0.295 ± 0.255
0.147CysAsn: 0.147 ± 0.151
1.032CysPro: 1.032 ± 0.442
0.589CysGln: 0.589 ± 0.39
0.442CysArg: 0.442 ± 0.244
1.032CysSer: 1.032 ± 0.531
0.589CysThr: 0.589 ± 0.292
0.147CysVal: 0.147 ± 0.14
0.147CysTrp: 0.147 ± 0.15
0.589CysTyr: 0.589 ± 0.275
0.0CysXaa: 0.0 ± 0.0
Asp
1.768AspAla: 1.768 ± 0.589
0.589AspCys: 0.589 ± 0.346
3.537AspAsp: 3.537 ± 0.98
3.831AspGlu: 3.831 ± 1.08
3.684AspPhe: 3.684 ± 0.795
1.326AspGly: 1.326 ± 0.43
0.589AspHis: 0.589 ± 0.236
2.8AspIle: 2.8 ± 0.661
5.01AspLys: 5.01 ± 0.729
6.042AspLeu: 6.042 ± 0.95
0.737AspMet: 0.737 ± 0.382
4.126AspAsn: 4.126 ± 0.76
1.916AspPro: 1.916 ± 0.521
1.474AspGln: 1.474 ± 0.563
2.358AspArg: 2.358 ± 0.659
3.684AspSer: 3.684 ± 0.693
1.474AspThr: 1.474 ± 0.41
4.568AspVal: 4.568 ± 0.883
0.147AspTrp: 0.147 ± 0.13
3.684AspTyr: 3.684 ± 0.832
0.0AspXaa: 0.0 ± 0.0
Glu
1.916GluAla: 1.916 ± 0.565
1.032GluCys: 1.032 ± 0.421
3.242GluAsp: 3.242 ± 0.847
5.452GluGlu: 5.452 ± 1.063
3.389GluPhe: 3.389 ± 0.716
2.063GluGly: 2.063 ± 0.534
0.884GluHis: 0.884 ± 0.281
7.073GluIle: 7.073 ± 1.043
7.663GluLys: 7.663 ± 1.624
5.305GluLeu: 5.305 ± 1.427
2.358GluMet: 2.358 ± 0.569
3.095GluAsn: 3.095 ± 0.778
2.8GluPro: 2.8 ± 0.521
2.21GluGln: 2.21 ± 0.495
1.474GluArg: 1.474 ± 0.448
4.421GluSer: 4.421 ± 0.657
3.389GluThr: 3.389 ± 0.699
5.158GluVal: 5.158 ± 0.871
0.737GluTrp: 0.737 ± 0.288
3.095GluTyr: 3.095 ± 0.815
0.147GluXaa: 0.147 ± 0.135
Phe
4.863PheAla: 4.863 ± 0.98
0.589PheCys: 0.589 ± 0.374
3.537PheAsp: 3.537 ± 0.831
2.947PheGlu: 2.947 ± 0.644
2.505PhePhe: 2.505 ± 0.693
2.063PheGly: 2.063 ± 0.556
0.295PheHis: 0.295 ± 0.259
5.6PheIle: 5.6 ± 0.977
4.716PheLys: 4.716 ± 0.678
5.452PheLeu: 5.452 ± 1.06
0.884PheMet: 0.884 ± 0.359
3.684PheAsn: 3.684 ± 0.736
1.179PhePro: 1.179 ± 0.445
1.474PheGln: 1.474 ± 0.394
1.916PheArg: 1.916 ± 0.489
5.01PheSer: 5.01 ± 0.647
3.242PheThr: 3.242 ± 0.678
2.505PheVal: 2.505 ± 0.505
0.295PheTrp: 0.295 ± 0.188
2.947PheTyr: 2.947 ± 0.676
0.0PheXaa: 0.0 ± 0.0
Gly
2.947GlyAla: 2.947 ± 0.727
0.884GlyCys: 0.884 ± 0.446
1.916GlyAsp: 1.916 ± 0.503
4.421GlyGlu: 4.421 ± 0.682
2.21GlyPhe: 2.21 ± 0.719
2.8GlyGly: 2.8 ± 0.695
0.589GlyHis: 0.589 ± 0.246
5.452GlyIle: 5.452 ± 0.934
4.568GlyLys: 4.568 ± 0.83
5.452GlyLeu: 5.452 ± 0.974
1.474GlyMet: 1.474 ± 0.43
2.653GlyAsn: 2.653 ± 0.568
0.442GlyPro: 0.442 ± 0.218
1.621GlyGln: 1.621 ± 0.471
1.032GlyArg: 1.032 ± 0.413
1.916GlySer: 1.916 ± 0.709
1.474GlyThr: 1.474 ± 0.385
3.389GlyVal: 3.389 ± 0.909
0.442GlyTrp: 0.442 ± 0.255
1.768GlyTyr: 1.768 ± 0.421
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.884HisAsp: 0.884 ± 0.31
0.737HisGlu: 0.737 ± 0.39
0.884HisPhe: 0.884 ± 0.364
0.589HisGly: 0.589 ± 0.284
0.295HisHis: 0.295 ± 0.223
0.442HisIle: 0.442 ± 0.224
1.621HisLys: 1.621 ± 0.634
0.589HisLeu: 0.589 ± 0.265
0.147HisMet: 0.147 ± 0.126
0.442HisAsn: 0.442 ± 0.293
0.295HisPro: 0.295 ± 0.181
0.0HisGln: 0.0 ± 0.0
1.032HisArg: 1.032 ± 0.386
0.442HisSer: 0.442 ± 0.232
0.884HisThr: 0.884 ± 0.319
0.884HisVal: 0.884 ± 0.338
0.147HisTrp: 0.147 ± 0.131
0.589HisTyr: 0.589 ± 0.293
0.0HisXaa: 0.0 ± 0.0
Ile
4.421IleAla: 4.421 ± 0.624
1.916IleCys: 1.916 ± 0.685
6.484IleAsp: 6.484 ± 1.119
5.305IleGlu: 5.305 ± 1.03
3.684IlePhe: 3.684 ± 0.594
5.6IleGly: 5.6 ± 0.837
1.326IleHis: 1.326 ± 0.481
10.315IleIle: 10.315 ± 1.1
6.042IleLys: 6.042 ± 1.036
10.021IleLeu: 10.021 ± 1.264
1.916IleMet: 1.916 ± 0.469
5.894IleAsn: 5.894 ± 0.894
2.653IlePro: 2.653 ± 0.516
2.653IleGln: 2.653 ± 0.502
3.242IleArg: 3.242 ± 0.65
5.452IleSer: 5.452 ± 0.996
5.747IleThr: 5.747 ± 1.063
5.01IleVal: 5.01 ± 0.829
0.0IleTrp: 0.0 ± 0.0
4.716IleTyr: 4.716 ± 0.785
0.0IleXaa: 0.0 ± 0.0
Lys
3.831LysAla: 3.831 ± 0.942
0.442LysCys: 0.442 ± 0.249
3.095LysAsp: 3.095 ± 0.586
6.337LysGlu: 6.337 ± 1.15
4.274LysPhe: 4.274 ± 0.864
3.684LysGly: 3.684 ± 0.704
0.884LysHis: 0.884 ± 0.323
5.894LysIle: 5.894 ± 1.112
4.863LysLys: 4.863 ± 0.871
6.926LysLeu: 6.926 ± 1.077
2.505LysMet: 2.505 ± 0.645
3.095LysAsn: 3.095 ± 0.824
2.505LysPro: 2.505 ± 0.842
4.126LysGln: 4.126 ± 0.767
1.621LysArg: 1.621 ± 0.54
4.568LysSer: 4.568 ± 0.821
3.537LysThr: 3.537 ± 0.63
7.958LysVal: 7.958 ± 1.279
1.474LysTrp: 1.474 ± 0.438
3.537LysTyr: 3.537 ± 0.683
0.0LysXaa: 0.0 ± 0.0
Leu
4.716LeuAla: 4.716 ± 0.905
1.032LeuCys: 1.032 ± 0.454
6.189LeuAsp: 6.189 ± 1.046
6.631LeuGlu: 6.631 ± 0.887
6.779LeuPhe: 6.779 ± 1.108
6.042LeuGly: 6.042 ± 1.19
0.589LeuHis: 0.589 ± 0.286
10.021LeuIle: 10.021 ± 1.732
6.779LeuLys: 6.779 ± 1.191
10.463LeuLeu: 10.463 ± 1.12
2.653LeuMet: 2.653 ± 0.682
7.368LeuAsn: 7.368 ± 1.042
3.389LeuPro: 3.389 ± 0.739
3.242LeuGln: 3.242 ± 0.567
3.979LeuArg: 3.979 ± 0.674
7.368LeuSer: 7.368 ± 1.288
6.631LeuThr: 6.631 ± 0.701
5.452LeuVal: 5.452 ± 0.85
0.295LeuTrp: 0.295 ± 0.176
3.389LeuTyr: 3.389 ± 0.688
0.0LeuXaa: 0.0 ± 0.0
Met
2.358MetAla: 2.358 ± 0.483
0.147MetCys: 0.147 ± 0.151
1.032MetAsp: 1.032 ± 0.439
1.326MetGlu: 1.326 ± 0.402
0.737MetPhe: 0.737 ± 0.287
0.589MetGly: 0.589 ± 0.279
0.147MetHis: 0.147 ± 0.121
2.358MetIle: 2.358 ± 0.503
2.063MetLys: 2.063 ± 0.58
2.21MetLeu: 2.21 ± 0.614
0.737MetMet: 0.737 ± 0.296
0.737MetAsn: 0.737 ± 0.338
0.884MetPro: 0.884 ± 0.375
0.442MetGln: 0.442 ± 0.286
1.179MetArg: 1.179 ± 0.418
1.621MetSer: 1.621 ± 0.485
1.474MetThr: 1.474 ± 0.408
1.621MetVal: 1.621 ± 0.458
0.0MetTrp: 0.0 ± 0.0
2.063MetTyr: 2.063 ± 0.547
0.0MetXaa: 0.0 ± 0.0
Asn
2.653AsnAla: 2.653 ± 0.427
0.737AsnCys: 0.737 ± 0.357
3.389AsnAsp: 3.389 ± 0.81
3.684AsnGlu: 3.684 ± 0.761
4.126AsnPhe: 4.126 ± 0.727
3.095AsnGly: 3.095 ± 0.631
0.589AsnHis: 0.589 ± 0.371
2.358AsnIle: 2.358 ± 0.633
3.831AsnLys: 3.831 ± 0.907
6.042AsnLeu: 6.042 ± 0.837
0.884AsnMet: 0.884 ± 0.341
5.305AsnAsn: 5.305 ± 1.186
2.653AsnPro: 2.653 ± 0.632
2.653AsnGln: 2.653 ± 0.493
1.916AsnArg: 1.916 ± 0.625
3.242AsnSer: 3.242 ± 0.638
3.537AsnThr: 3.537 ± 0.665
2.358AsnVal: 2.358 ± 0.729
0.295AsnTrp: 0.295 ± 0.188
5.01AsnTyr: 5.01 ± 0.901
0.0AsnXaa: 0.0 ± 0.0
Pro
2.21ProAla: 2.21 ± 0.436
0.295ProCys: 0.295 ± 0.183
1.474ProAsp: 1.474 ± 0.472
3.979ProGlu: 3.979 ± 1.131
1.179ProPhe: 1.179 ± 0.481
0.589ProGly: 0.589 ± 0.26
0.295ProHis: 0.295 ± 0.188
4.421ProIle: 4.421 ± 0.709
2.947ProLys: 2.947 ± 0.706
3.537ProLeu: 3.537 ± 0.529
0.147ProMet: 0.147 ± 0.137
1.621ProAsn: 1.621 ± 0.5
2.8ProPro: 2.8 ± 0.922
0.737ProGln: 0.737 ± 0.306
1.179ProArg: 1.179 ± 0.52
4.421ProSer: 4.421 ± 1.198
2.358ProThr: 2.358 ± 0.488
2.358ProVal: 2.358 ± 0.457
0.0ProTrp: 0.0 ± 0.0
1.768ProTyr: 1.768 ± 0.633
0.0ProXaa: 0.0 ± 0.0
Gln
1.032GlnAla: 1.032 ± 0.412
0.442GlnCys: 0.442 ± 0.249
1.768GlnAsp: 1.768 ± 0.484
2.063GlnGlu: 2.063 ± 0.599
2.21GlnPhe: 2.21 ± 0.435
1.032GlnGly: 1.032 ± 0.366
0.442GlnHis: 0.442 ± 0.259
2.653GlnIle: 2.653 ± 0.499
2.505GlnLys: 2.505 ± 0.495
3.979GlnLeu: 3.979 ± 0.86
1.474GlnMet: 1.474 ± 0.533
1.621GlnAsn: 1.621 ± 0.412
0.295GlnPro: 0.295 ± 0.184
1.179GlnGln: 1.179 ± 0.35
0.589GlnArg: 0.589 ± 0.281
2.8GlnSer: 2.8 ± 0.621
2.063GlnThr: 2.063 ± 0.467
2.063GlnVal: 2.063 ± 0.424
0.295GlnTrp: 0.295 ± 0.185
2.063GlnTyr: 2.063 ± 0.36
0.0GlnXaa: 0.0 ± 0.0
Arg
0.884ArgAla: 0.884 ± 0.358
0.0ArgCys: 0.0 ± 0.0
1.916ArgAsp: 1.916 ± 0.452
1.474ArgGlu: 1.474 ± 0.541
1.474ArgPhe: 1.474 ± 0.524
1.916ArgGly: 1.916 ± 0.545
0.147ArgHis: 0.147 ± 0.131
3.095ArgIle: 3.095 ± 0.717
2.947ArgLys: 2.947 ± 0.579
3.242ArgLeu: 3.242 ± 0.649
1.179ArgMet: 1.179 ± 0.342
1.621ArgAsn: 1.621 ± 0.583
0.589ArgPro: 0.589 ± 0.375
1.768ArgGln: 1.768 ± 0.712
1.474ArgArg: 1.474 ± 0.516
3.389ArgSer: 3.389 ± 0.751
2.063ArgThr: 2.063 ± 0.526
2.21ArgVal: 2.21 ± 0.605
0.147ArgTrp: 0.147 ± 0.138
1.474ArgTyr: 1.474 ± 0.428
0.0ArgXaa: 0.0 ± 0.0
Ser
3.242SerAla: 3.242 ± 0.57
0.589SerCys: 0.589 ± 0.315
3.242SerAsp: 3.242 ± 0.771
6.484SerGlu: 6.484 ± 1.672
4.421SerPhe: 4.421 ± 0.694
3.684SerGly: 3.684 ± 0.752
0.737SerHis: 0.737 ± 0.341
5.894SerIle: 5.894 ± 1.134
3.831SerLys: 3.831 ± 0.795
7.515SerLeu: 7.515 ± 1.078
1.768SerMet: 1.768 ± 0.52
4.274SerAsn: 4.274 ± 0.645
3.831SerPro: 3.831 ± 1.247
2.063SerGln: 2.063 ± 0.488
1.916SerArg: 1.916 ± 0.548
5.452SerSer: 5.452 ± 1.223
4.126SerThr: 4.126 ± 0.863
3.831SerVal: 3.831 ± 0.751
0.295SerTrp: 0.295 ± 0.178
4.274SerTyr: 4.274 ± 0.753
0.0SerXaa: 0.0 ± 0.0
Thr
2.8ThrAla: 2.8 ± 0.58
0.442ThrCys: 0.442 ± 0.272
2.505ThrAsp: 2.505 ± 0.648
2.358ThrGlu: 2.358 ± 0.492
4.274ThrPhe: 4.274 ± 0.814
2.947ThrGly: 2.947 ± 0.738
0.589ThrHis: 0.589 ± 0.272
4.568ThrIle: 4.568 ± 0.807
2.358ThrLys: 2.358 ± 0.745
6.337ThrLeu: 6.337 ± 0.99
1.326ThrMet: 1.326 ± 0.377
3.389ThrAsn: 3.389 ± 0.691
2.947ThrPro: 2.947 ± 0.516
1.621ThrGln: 1.621 ± 0.416
1.621ThrArg: 1.621 ± 0.46
4.421ThrSer: 4.421 ± 0.884
2.8ThrThr: 2.8 ± 0.744
2.8ThrVal: 2.8 ± 0.712
0.442ThrTrp: 0.442 ± 0.201
3.242ThrTyr: 3.242 ± 0.744
0.0ThrXaa: 0.0 ± 0.0
Val
2.8ValAla: 2.8 ± 0.575
0.147ValCys: 0.147 ± 0.154
3.537ValAsp: 3.537 ± 0.508
5.01ValGlu: 5.01 ± 1.129
3.389ValPhe: 3.389 ± 0.537
4.274ValGly: 4.274 ± 0.823
1.032ValHis: 1.032 ± 0.339
7.073ValIle: 7.073 ± 0.858
5.158ValLys: 5.158 ± 0.942
6.484ValLeu: 6.484 ± 1.286
1.621ValMet: 1.621 ± 0.48
2.358ValAsn: 2.358 ± 0.524
2.505ValPro: 2.505 ± 0.676
1.179ValGln: 1.179 ± 0.405
1.916ValArg: 1.916 ± 0.529
4.716ValSer: 4.716 ± 0.855
3.389ValThr: 3.389 ± 0.753
4.126ValVal: 4.126 ± 0.786
0.147ValTrp: 0.147 ± 0.167
3.831ValTyr: 3.831 ± 0.614
0.147ValXaa: 0.147 ± 0.135
Trp
0.147TrpAla: 0.147 ± 0.149
0.0TrpCys: 0.0 ± 0.0
0.295TrpAsp: 0.295 ± 0.197
0.147TrpGlu: 0.147 ± 0.151
0.295TrpPhe: 0.295 ± 0.213
0.295TrpGly: 0.295 ± 0.176
0.0TrpHis: 0.0 ± 0.0
0.589TrpIle: 0.589 ± 0.259
0.589TrpLys: 0.589 ± 0.261
0.884TrpLeu: 0.884 ± 0.302
0.0TrpMet: 0.0 ± 0.0
0.589TrpAsn: 0.589 ± 0.34
0.0TrpPro: 0.0 ± 0.0
0.295TrpGln: 0.295 ± 0.191
0.442TrpArg: 0.442 ± 0.201
0.147TrpSer: 0.147 ± 0.13
0.295TrpThr: 0.295 ± 0.202
0.295TrpVal: 0.295 ± 0.176
0.0TrpTrp: 0.0 ± 0.0
0.884TrpTyr: 0.884 ± 0.305
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.684TyrAla: 3.684 ± 0.697
0.737TyrCys: 0.737 ± 0.299
3.389TyrAsp: 3.389 ± 0.782
2.653TyrGlu: 2.653 ± 0.631
2.653TyrPhe: 2.653 ± 0.592
2.653TyrGly: 2.653 ± 0.529
0.295TyrHis: 0.295 ± 0.186
3.684TyrIle: 3.684 ± 0.734
3.389TyrLys: 3.389 ± 0.633
6.337TyrLeu: 6.337 ± 0.881
0.589TyrMet: 0.589 ± 0.324
2.947TyrAsn: 2.947 ± 0.69
2.653TyrPro: 2.653 ± 0.855
2.21TyrGln: 2.21 ± 0.511
2.21TyrArg: 2.21 ± 0.606
4.863TyrSer: 4.863 ± 1.143
3.242TyrThr: 3.242 ± 0.588
3.979TyrVal: 3.979 ± 0.68
0.589TyrTrp: 0.589 ± 0.245
2.505TyrTyr: 2.505 ± 0.597
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.147XaaLys: 0.147 ± 0.135
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.147XaaVal: 0.147 ± 0.135
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.147XaaXaa: 0.147 ± 0.135
Statistics based on 53 proteins (6787 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski