Amino acid dipepetide frequency for Sulfolobus spindle-shaped virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.2AlaAla: 0.2 ± 0.185
0.2AlaCys: 0.2 ± 0.185
0.999AlaAsp: 0.999 ± 0.43
2.998AlaGlu: 2.998 ± 0.835
2.198AlaPhe: 2.198 ± 0.582
2.398AlaGly: 2.398 ± 0.815
0.6AlaHis: 0.6 ± 0.345
3.597AlaIle: 3.597 ± 0.931
5.396AlaLys: 5.396 ± 1.464
5.596AlaLeu: 5.596 ± 0.885
0.6AlaMet: 0.6 ± 0.358
2.998AlaAsn: 2.998 ± 0.728
1.199AlaPro: 1.199 ± 0.488
1.399AlaGln: 1.399 ± 0.502
2.398AlaArg: 2.398 ± 0.839
4.197AlaSer: 4.197 ± 1.178
3.597AlaThr: 3.597 ± 0.906
3.397AlaVal: 3.397 ± 0.801
0.999AlaTrp: 0.999 ± 0.339
2.798AlaTyr: 2.798 ± 0.765
0.0AlaXaa: 0.0 ± 0.0
Cys
0.2CysAla: 0.2 ± 0.243
0.4CysCys: 0.4 ± 0.37
0.4CysAsp: 0.4 ± 0.321
0.0CysGlu: 0.0 ± 0.0
0.2CysPhe: 0.2 ± 0.213
0.6CysGly: 0.6 ± 0.382
0.0CysHis: 0.0 ± 0.0
1.199CysIle: 1.199 ± 0.547
0.2CysLys: 0.2 ± 0.224
0.6CysLeu: 0.6 ± 0.351
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.999CysPro: 0.999 ± 0.713
0.4CysGln: 0.4 ± 0.343
0.2CysArg: 0.2 ± 0.182
0.6CysSer: 0.6 ± 0.556
0.0CysThr: 0.0 ± 0.0
0.6CysVal: 0.6 ± 0.351
0.2CysTrp: 0.2 ± 0.243
0.4CysTyr: 0.4 ± 0.279
0.0CysXaa: 0.0 ± 0.0
Asp
1.799AspAla: 1.799 ± 1.051
0.0AspCys: 0.0 ± 0.0
1.799AspAsp: 1.799 ± 0.686
2.198AspGlu: 2.198 ± 0.892
1.199AspPhe: 1.199 ± 0.497
2.998AspGly: 2.998 ± 0.672
0.799AspHis: 0.799 ± 0.362
2.998AspIle: 2.998 ± 0.794
2.398AspLys: 2.398 ± 0.982
3.197AspLeu: 3.197 ± 1.101
0.799AspMet: 0.799 ± 0.404
0.6AspAsn: 0.6 ± 0.379
0.799AspPro: 0.799 ± 0.405
0.2AspGln: 0.2 ± 0.183
0.799AspArg: 0.799 ± 0.429
1.199AspSer: 1.199 ± 0.454
1.599AspThr: 1.599 ± 0.488
2.398AspVal: 2.398 ± 0.74
0.799AspTrp: 0.799 ± 0.392
1.998AspTyr: 1.998 ± 0.69
0.0AspXaa: 0.0 ± 0.0
Glu
3.197GluAla: 3.197 ± 0.995
0.999GluCys: 0.999 ± 0.606
3.597GluAsp: 3.597 ± 1.424
7.794GluGlu: 7.794 ± 2.599
1.799GluPhe: 1.799 ± 0.575
1.399GluGly: 1.399 ± 0.57
0.999GluHis: 0.999 ± 0.551
3.597GluIle: 3.597 ± 0.891
4.596GluLys: 4.596 ± 1.64
9.392GluLeu: 9.392 ± 2.541
1.998GluMet: 1.998 ± 0.587
2.398GluAsn: 2.398 ± 1.036
1.599GluPro: 1.599 ± 0.648
1.199GluGln: 1.199 ± 0.484
2.598GluArg: 2.598 ± 1.104
3.397GluSer: 3.397 ± 0.999
1.399GluThr: 1.399 ± 0.698
4.197GluVal: 4.197 ± 1.238
0.4GluTrp: 0.4 ± 0.26
2.598GluTyr: 2.598 ± 0.917
0.0GluXaa: 0.0 ± 0.0
Phe
2.198PheAla: 2.198 ± 0.618
0.0PheCys: 0.0 ± 0.0
2.398PheAsp: 2.398 ± 0.599
1.399PheGlu: 1.399 ± 0.535
2.198PhePhe: 2.198 ± 0.653
2.398PheGly: 2.398 ± 0.6
0.6PheHis: 0.6 ± 0.323
2.798PheIle: 2.798 ± 0.779
1.998PheLys: 1.998 ± 0.69
4.197PheLeu: 4.197 ± 1.123
0.999PheMet: 0.999 ± 0.413
1.599PheAsn: 1.599 ± 0.609
1.399PhePro: 1.399 ± 0.539
1.399PheGln: 1.399 ± 0.529
1.599PheArg: 1.599 ± 0.612
4.396PheSer: 4.396 ± 0.994
3.797PheThr: 3.797 ± 0.882
4.796PheVal: 4.796 ± 1.058
0.999PheTrp: 0.999 ± 0.386
4.396PheTyr: 4.396 ± 0.97
0.0PheXaa: 0.0 ± 0.0
Gly
1.799GlyAla: 1.799 ± 0.719
0.0GlyCys: 0.0 ± 0.0
1.799GlyAsp: 1.799 ± 0.553
1.799GlyGlu: 1.799 ± 0.588
3.997GlyPhe: 3.997 ± 0.802
3.197GlyGly: 3.197 ± 1.07
0.4GlyHis: 0.4 ± 0.273
4.596GlyIle: 4.596 ± 0.913
4.396GlyLys: 4.396 ± 0.76
7.794GlyLeu: 7.794 ± 1.561
0.4GlyMet: 0.4 ± 0.276
1.998GlyAsn: 1.998 ± 0.507
2.198GlyPro: 2.198 ± 0.896
2.398GlyGln: 2.398 ± 1.05
2.998GlyArg: 2.998 ± 0.99
4.596GlySer: 4.596 ± 1.385
4.197GlyThr: 4.197 ± 1.427
3.797GlyVal: 3.797 ± 0.667
0.999GlyTrp: 0.999 ± 0.366
3.597GlyTyr: 3.597 ± 0.932
0.0GlyXaa: 0.0 ± 0.0
His
0.799HisAla: 0.799 ± 0.474
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.6HisGlu: 0.6 ± 0.33
0.799HisPhe: 0.799 ± 0.435
0.999HisGly: 0.999 ± 0.42
0.0HisHis: 0.0 ± 0.0
0.6HisIle: 0.6 ± 0.541
0.999HisLys: 0.999 ± 0.459
0.999HisLeu: 0.999 ± 0.537
0.0HisMet: 0.0 ± 0.0
1.199HisAsn: 1.199 ± 0.68
0.4HisPro: 0.4 ± 0.238
0.2HisGln: 0.2 ± 0.164
0.799HisArg: 0.799 ± 0.524
0.799HisSer: 0.799 ± 0.494
0.999HisThr: 0.999 ± 0.384
1.199HisVal: 1.199 ± 0.602
0.0HisTrp: 0.0 ± 0.0
1.399HisTyr: 1.399 ± 0.558
0.0HisXaa: 0.0 ± 0.0
Ile
4.396IleAla: 4.396 ± 0.892
0.4IleCys: 0.4 ± 0.337
1.799IleAsp: 1.799 ± 0.539
3.997IleGlu: 3.997 ± 1.06
5.596IlePhe: 5.596 ± 1.028
5.196IleGly: 5.196 ± 1.209
1.199IleHis: 1.199 ± 0.59
6.994IleIle: 6.994 ± 1.462
4.796IleLys: 4.796 ± 1.377
8.393IleLeu: 8.393 ± 1.529
1.399IleMet: 1.399 ± 0.424
4.596IleAsn: 4.596 ± 0.927
4.396IlePro: 4.396 ± 0.971
2.198IleGln: 2.198 ± 0.52
4.796IleArg: 4.796 ± 1.267
6.395IleSer: 6.395 ± 1.093
3.997IleThr: 3.997 ± 1.303
5.396IleVal: 5.396 ± 1.143
0.999IleTrp: 0.999 ± 0.402
3.997IleTyr: 3.997 ± 0.853
0.0IleXaa: 0.0 ± 0.0
Lys
2.598LysAla: 2.598 ± 0.711
0.799LysCys: 0.799 ± 0.355
1.998LysAsp: 1.998 ± 0.771
5.795LysGlu: 5.795 ± 1.236
2.598LysPhe: 2.598 ± 0.647
3.597LysGly: 3.597 ± 0.77
1.199LysHis: 1.199 ± 0.532
7.794LysIle: 7.794 ± 1.853
8.393LysLys: 8.393 ± 2.091
8.993LysLeu: 8.993 ± 2.114
2.198LysMet: 2.198 ± 0.796
2.798LysAsn: 2.798 ± 0.836
1.399LysPro: 1.399 ± 0.486
3.197LysGln: 3.197 ± 0.904
2.798LysArg: 2.798 ± 0.826
4.197LysSer: 4.197 ± 1.508
4.596LysThr: 4.596 ± 0.987
3.197LysVal: 3.197 ± 0.69
0.999LysTrp: 0.999 ± 0.356
4.197LysTyr: 4.197 ± 1.15
0.0LysXaa: 0.0 ± 0.0
Leu
5.396LeuAla: 5.396 ± 0.83
0.999LeuCys: 0.999 ± 0.593
2.798LeuAsp: 2.798 ± 1.08
5.596LeuGlu: 5.596 ± 1.573
5.596LeuPhe: 5.596 ± 1.266
5.596LeuGly: 5.596 ± 1.184
0.799LeuHis: 0.799 ± 0.424
9.592LeuIle: 9.592 ± 1.656
6.395LeuLys: 6.395 ± 1.547
13.989LeuLeu: 13.989 ± 1.906
3.197LeuMet: 3.197 ± 0.77
9.193LeuAsn: 9.193 ± 1.418
5.196LeuPro: 5.196 ± 0.809
3.597LeuGln: 3.597 ± 1.017
4.996LeuArg: 4.996 ± 1.326
8.393LeuSer: 8.393 ± 1.339
9.392LeuThr: 9.392 ± 1.108
7.994LeuVal: 7.994 ± 1.545
1.399LeuTrp: 1.399 ± 0.551
4.996LeuTyr: 4.996 ± 0.865
0.0LeuXaa: 0.0 ± 0.0
Met
0.999MetAla: 0.999 ± 0.474
0.0MetCys: 0.0 ± 0.0
0.999MetAsp: 0.999 ± 0.667
1.599MetGlu: 1.599 ± 0.478
0.2MetPhe: 0.2 ± 0.212
1.599MetGly: 1.599 ± 0.536
0.2MetHis: 0.2 ± 0.243
0.799MetIle: 0.799 ± 0.502
2.598MetLys: 2.598 ± 0.716
2.598MetLeu: 2.598 ± 0.857
0.4MetMet: 0.4 ± 0.302
0.6MetAsn: 0.6 ± 0.27
0.4MetPro: 0.4 ± 0.296
0.999MetGln: 0.999 ± 0.378
1.199MetArg: 1.199 ± 0.582
1.599MetSer: 1.599 ± 0.47
2.198MetThr: 2.198 ± 0.564
0.999MetVal: 0.999 ± 0.417
0.799MetTrp: 0.799 ± 0.395
0.999MetTyr: 0.999 ± 0.44
0.0MetXaa: 0.0 ± 0.0
Asn
3.397AsnAla: 3.397 ± 0.706
0.6AsnCys: 0.6 ± 0.323
1.599AsnAsp: 1.599 ± 0.514
2.798AsnGlu: 2.798 ± 1.081
2.398AsnPhe: 2.398 ± 0.796
3.397AsnGly: 3.397 ± 1.018
0.6AsnHis: 0.6 ± 0.414
4.396AsnIle: 4.396 ± 1.08
2.398AsnLys: 2.398 ± 0.666
4.197AsnLeu: 4.197 ± 1.016
0.4AsnMet: 0.4 ± 0.299
4.996AsnAsn: 4.996 ± 1.033
2.398AsnPro: 2.398 ± 0.776
1.599AsnGln: 1.599 ± 0.604
1.199AsnArg: 1.199 ± 0.434
3.597AsnSer: 3.597 ± 1.095
3.797AsnThr: 3.797 ± 0.845
4.396AsnVal: 4.396 ± 1.487
0.6AsnTrp: 0.6 ± 0.27
4.396AsnTyr: 4.396 ± 0.887
0.0AsnXaa: 0.0 ± 0.0
Pro
2.598ProAla: 2.598 ± 0.768
0.0ProCys: 0.0 ± 0.0
1.599ProAsp: 1.599 ± 0.664
1.399ProGlu: 1.399 ± 0.498
2.598ProPhe: 2.598 ± 0.655
1.599ProGly: 1.599 ± 0.705
0.6ProHis: 0.6 ± 0.359
1.399ProIle: 1.399 ± 0.561
1.799ProLys: 1.799 ± 0.714
3.397ProLeu: 3.397 ± 0.749
0.799ProMet: 0.799 ± 0.393
1.799ProAsn: 1.799 ± 0.632
3.997ProPro: 3.997 ± 1.397
1.799ProGln: 1.799 ± 0.473
1.199ProArg: 1.199 ± 0.579
3.197ProSer: 3.197 ± 1.261
2.598ProThr: 2.598 ± 0.724
2.798ProVal: 2.798 ± 0.893
0.6ProTrp: 0.6 ± 0.364
3.397ProTyr: 3.397 ± 0.84
0.0ProXaa: 0.0 ± 0.0
Gln
0.6GlnAla: 0.6 ± 0.344
0.2GlnCys: 0.2 ± 0.195
0.6GlnAsp: 0.6 ± 0.33
0.999GlnGlu: 0.999 ± 0.408
1.599GlnPhe: 1.599 ± 0.743
1.599GlnGly: 1.599 ± 0.553
1.399GlnHis: 1.399 ± 0.591
4.396GlnIle: 4.396 ± 0.814
2.798GlnLys: 2.798 ± 0.951
3.397GlnLeu: 3.397 ± 0.856
0.6GlnMet: 0.6 ± 0.297
1.199GlnAsn: 1.199 ± 0.403
0.999GlnPro: 0.999 ± 0.64
1.599GlnGln: 1.599 ± 0.477
0.6GlnArg: 0.6 ± 0.332
2.798GlnSer: 2.798 ± 0.838
2.198GlnThr: 2.198 ± 0.734
2.798GlnVal: 2.798 ± 1.013
0.4GlnTrp: 0.4 ± 0.236
1.399GlnTyr: 1.399 ± 0.637
0.0GlnXaa: 0.0 ± 0.0
Arg
1.799ArgAla: 1.799 ± 0.714
0.6ArgCys: 0.6 ± 0.39
1.599ArgAsp: 1.599 ± 0.63
3.797ArgGlu: 3.797 ± 1.071
0.6ArgPhe: 0.6 ± 0.382
1.799ArgGly: 1.799 ± 0.646
0.799ArgHis: 0.799 ± 0.387
2.798ArgIle: 2.798 ± 0.779
3.997ArgLys: 3.997 ± 0.905
2.798ArgLeu: 2.798 ± 0.852
0.799ArgMet: 0.799 ± 0.429
1.998ArgAsn: 1.998 ± 0.605
0.799ArgPro: 0.799 ± 0.399
1.799ArgGln: 1.799 ± 0.768
2.998ArgArg: 2.998 ± 1.2
1.399ArgSer: 1.399 ± 0.554
1.199ArgThr: 1.199 ± 0.606
4.396ArgVal: 4.396 ± 1.446
0.2ArgTrp: 0.2 ± 0.213
2.198ArgTyr: 2.198 ± 1.077
0.0ArgXaa: 0.0 ± 0.0
Ser
3.997SerAla: 3.997 ± 0.859
0.2SerCys: 0.2 ± 0.182
1.799SerAsp: 1.799 ± 0.474
4.796SerGlu: 4.796 ± 1.273
2.798SerPhe: 2.798 ± 0.643
5.995SerGly: 5.995 ± 0.971
0.6SerHis: 0.6 ± 0.338
5.596SerIle: 5.596 ± 1.632
7.194SerLys: 7.194 ± 1.67
7.394SerLeu: 7.394 ± 1.586
1.199SerMet: 1.199 ± 0.502
4.396SerAsn: 4.396 ± 1.064
2.798SerPro: 2.798 ± 0.738
2.598SerGln: 2.598 ± 0.689
1.799SerArg: 1.799 ± 0.72
6.395SerSer: 6.395 ± 2.093
4.396SerThr: 4.396 ± 1.259
5.795SerVal: 5.795 ± 1.524
0.799SerTrp: 0.799 ± 0.361
3.197SerTyr: 3.197 ± 0.936
0.0SerXaa: 0.0 ± 0.0
Thr
3.397ThrAla: 3.397 ± 1.135
0.0ThrCys: 0.0 ± 0.0
1.199ThrAsp: 1.199 ± 0.457
4.197ThrGlu: 4.197 ± 1.025
2.598ThrPhe: 2.598 ± 0.691
3.597ThrGly: 3.597 ± 1.256
0.999ThrHis: 0.999 ± 0.496
6.195ThrIle: 6.195 ± 1.266
4.197ThrLys: 4.197 ± 0.805
10.392ThrLeu: 10.392 ± 1.337
1.599ThrMet: 1.599 ± 0.549
2.598ThrAsn: 2.598 ± 0.684
2.798ThrPro: 2.798 ± 0.786
2.598ThrGln: 2.598 ± 0.647
0.999ThrArg: 0.999 ± 0.445
4.396ThrSer: 4.396 ± 0.959
5.596ThrThr: 5.596 ± 1.856
4.197ThrVal: 4.197 ± 1.096
1.199ThrTrp: 1.199 ± 0.488
3.197ThrTyr: 3.197 ± 0.975
0.0ThrXaa: 0.0 ± 0.0
Val
3.797ValAla: 3.797 ± 0.863
1.199ValCys: 1.199 ± 1.046
1.599ValAsp: 1.599 ± 0.443
4.796ValGlu: 4.796 ± 1.645
2.198ValPhe: 2.198 ± 0.65
4.996ValGly: 4.996 ± 1.032
0.4ValHis: 0.4 ± 0.276
5.196ValIle: 5.196 ± 1.376
4.996ValLys: 4.996 ± 0.895
7.194ValLeu: 7.194 ± 1.206
1.799ValMet: 1.799 ± 0.541
3.997ValAsn: 3.997 ± 1.004
2.798ValPro: 2.798 ± 0.89
1.199ValGln: 1.199 ± 0.616
1.799ValArg: 1.799 ± 0.662
7.794ValSer: 7.794 ± 1.501
6.195ValThr: 6.195 ± 1.367
5.596ValVal: 5.596 ± 1.554
1.399ValTrp: 1.399 ± 0.628
3.997ValTyr: 3.997 ± 0.982
0.0ValXaa: 0.0 ± 0.0
Trp
0.6TrpAla: 0.6 ± 0.477
0.2TrpCys: 0.2 ± 0.204
0.6TrpAsp: 0.6 ± 0.322
0.4TrpGlu: 0.4 ± 0.281
1.199TrpPhe: 1.199 ± 0.37
0.999TrpGly: 0.999 ± 0.446
0.0TrpHis: 0.0 ± 0.0
0.799TrpIle: 0.799 ± 0.5
0.799TrpLys: 0.799 ± 0.305
2.598TrpLeu: 2.598 ± 0.676
0.6TrpMet: 0.6 ± 0.363
0.2TrpAsn: 0.2 ± 0.195
0.2TrpPro: 0.2 ± 0.177
0.2TrpGln: 0.2 ± 0.207
0.4TrpArg: 0.4 ± 0.245
0.799TrpSer: 0.799 ± 0.508
1.199TrpThr: 1.199 ± 0.454
0.999TrpVal: 0.999 ± 0.477
0.0TrpTrp: 0.0 ± 0.0
1.998TrpTyr: 1.998 ± 0.724
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.997TyrAla: 3.997 ± 0.99
0.4TyrCys: 0.4 ± 0.343
1.399TyrAsp: 1.399 ± 0.556
2.198TyrGlu: 2.198 ± 0.574
3.197TyrPhe: 3.197 ± 0.776
2.998TyrGly: 2.998 ± 0.906
0.6TyrHis: 0.6 ± 0.4
5.596TyrIle: 5.596 ± 1.036
3.197TyrLys: 3.197 ± 0.862
7.994TyrLeu: 7.994 ± 0.946
1.799TyrMet: 1.799 ± 0.705
3.997TyrAsn: 3.997 ± 1.089
2.398TyrPro: 2.398 ± 0.859
1.599TyrGln: 1.599 ± 0.434
2.198TyrArg: 2.198 ± 0.717
3.597TyrSer: 3.597 ± 0.991
2.998TyrThr: 2.998 ± 0.746
3.997TyrVal: 3.997 ± 0.911
0.999TyrTrp: 0.999 ± 0.336
3.397TyrTyr: 3.397 ± 0.952
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 34 proteins (5005 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski