Amino acid dipepetide frequency for Sulfolobus spindle-shape virus 1 (SSV1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.0AlaCys: 0.0 ± 0.0
1.818AlaAsp: 1.818 ± 1.206
4.646AlaGlu: 4.646 ± 0.998
3.03AlaPhe: 3.03 ± 0.673
2.222AlaGly: 2.222 ± 0.958
0.808AlaHis: 0.808 ± 0.333
6.867AlaIle: 6.867 ± 1.204
6.261AlaLys: 6.261 ± 1.326
7.675AlaLeu: 7.675 ± 0.986
0.606AlaMet: 0.606 ± 0.295
3.636AlaAsn: 3.636 ± 1.019
1.01AlaPro: 1.01 ± 0.631
2.828AlaGln: 2.828 ± 0.814
2.222AlaArg: 2.222 ± 0.726
5.251AlaSer: 5.251 ± 1.243
2.02AlaThr: 2.02 ± 0.703
2.828AlaVal: 2.828 ± 0.579
0.606AlaTrp: 0.606 ± 0.303
3.838AlaTyr: 3.838 ± 1.056
0.0AlaXaa: 0.0 ± 0.0
Cys
0.202CysAla: 0.202 ± 0.19
0.202CysCys: 0.202 ± 0.207
0.202CysAsp: 0.202 ± 0.207
0.606CysGlu: 0.606 ± 0.546
0.0CysPhe: 0.0 ± 0.0
1.212CysGly: 1.212 ± 0.668
0.0CysHis: 0.0 ± 0.0
0.404CysIle: 0.404 ± 0.309
1.212CysLys: 1.212 ± 0.619
0.808CysLeu: 0.808 ± 0.442
0.0CysMet: 0.0 ± 0.0
0.404CysAsn: 0.404 ± 0.422
0.808CysPro: 0.808 ± 0.586
0.0CysGln: 0.0 ± 0.0
0.202CysArg: 0.202 ± 0.19
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.404CysVal: 0.404 ± 0.264
0.0CysTrp: 0.0 ± 0.0
0.808CysTyr: 0.808 ± 0.444
0.0CysXaa: 0.0 ± 0.0
Asp
2.626AspAla: 2.626 ± 0.775
0.202AspCys: 0.202 ± 0.211
1.616AspAsp: 1.616 ± 0.769
3.636AspGlu: 3.636 ± 1.002
1.616AspPhe: 1.616 ± 0.598
2.828AspGly: 2.828 ± 0.793
0.606AspHis: 0.606 ± 0.328
4.242AspIle: 4.242 ± 1.078
2.02AspLys: 2.02 ± 0.88
2.424AspLeu: 2.424 ± 0.88
0.404AspMet: 0.404 ± 0.319
1.414AspAsn: 1.414 ± 0.429
0.808AspPro: 0.808 ± 0.4
1.212AspGln: 1.212 ± 0.65
1.414AspArg: 1.414 ± 0.521
2.222AspSer: 2.222 ± 0.754
0.606AspThr: 0.606 ± 0.357
2.626AspVal: 2.626 ± 0.685
0.404AspTrp: 0.404 ± 0.257
2.424AspTyr: 2.424 ± 0.896
0.0AspXaa: 0.0 ± 0.0
Glu
3.636GluAla: 3.636 ± 0.993
0.808GluCys: 0.808 ± 0.477
2.424GluAsp: 2.424 ± 0.829
7.675GluGlu: 7.675 ± 1.79
1.212GluPhe: 1.212 ± 0.461
3.03GluGly: 3.03 ± 0.832
1.01GluHis: 1.01 ± 0.446
3.838GluIle: 3.838 ± 0.956
5.655GluLys: 5.655 ± 1.406
8.483GluLeu: 8.483 ± 2.23
1.616GluMet: 1.616 ± 0.502
2.828GluAsn: 2.828 ± 0.981
1.01GluPro: 1.01 ± 0.534
2.828GluGln: 2.828 ± 1.049
3.838GluArg: 3.838 ± 1.503
1.616GluSer: 1.616 ± 0.692
2.222GluThr: 2.222 ± 0.665
3.232GluVal: 3.232 ± 0.897
0.404GluTrp: 0.404 ± 0.272
3.03GluTyr: 3.03 ± 0.835
0.0GluXaa: 0.0 ± 0.0
Phe
2.222PheAla: 2.222 ± 0.67
0.0PheCys: 0.0 ± 0.0
2.02PheAsp: 2.02 ± 0.699
2.424PheGlu: 2.424 ± 0.92
1.414PhePhe: 1.414 ± 0.472
3.03PheGly: 3.03 ± 0.56
0.202PheHis: 0.202 ± 0.23
3.838PheIle: 3.838 ± 0.847
2.626PheLys: 2.626 ± 0.723
3.434PheLeu: 3.434 ± 0.701
0.808PheMet: 0.808 ± 0.343
2.424PheAsn: 2.424 ± 0.634
1.01PhePro: 1.01 ± 0.419
1.212PheGln: 1.212 ± 0.486
1.414PheArg: 1.414 ± 0.618
3.636PheSer: 3.636 ± 1.23
3.03PheThr: 3.03 ± 0.953
4.242PheVal: 4.242 ± 0.742
0.606PheTrp: 0.606 ± 0.379
4.444PheTyr: 4.444 ± 0.722
0.0PheXaa: 0.0 ± 0.0
Gly
2.424GlyAla: 2.424 ± 1.112
0.202GlyCys: 0.202 ± 0.19
1.414GlyAsp: 1.414 ± 0.629
1.414GlyGlu: 1.414 ± 0.485
4.646GlyPhe: 4.646 ± 1.041
3.434GlyGly: 3.434 ± 1.168
0.404GlyHis: 0.404 ± 0.272
6.059GlyIle: 6.059 ± 0.934
5.453GlyLys: 5.453 ± 1.115
7.069GlyLeu: 7.069 ± 1.665
1.212GlyMet: 1.212 ± 0.616
2.626GlyAsn: 2.626 ± 1.003
2.02GlyPro: 2.02 ± 0.6
2.02GlyGln: 2.02 ± 0.874
3.232GlyArg: 3.232 ± 0.836
4.242GlySer: 4.242 ± 1.214
4.242GlyThr: 4.242 ± 0.99
5.453GlyVal: 5.453 ± 0.85
0.808GlyTrp: 0.808 ± 0.354
3.03GlyTyr: 3.03 ± 1.283
0.0GlyXaa: 0.0 ± 0.0
His
0.202HisAla: 0.202 ± 0.172
0.0HisCys: 0.0 ± 0.0
0.202HisAsp: 0.202 ± 0.23
0.808HisGlu: 0.808 ± 0.381
0.808HisPhe: 0.808 ± 0.373
0.404HisGly: 0.404 ± 0.267
0.0HisHis: 0.0 ± 0.0
0.808HisIle: 0.808 ± 0.388
1.414HisLys: 1.414 ± 0.62
1.818HisLeu: 1.818 ± 0.798
0.0HisMet: 0.0 ± 0.0
0.404HisAsn: 0.404 ± 0.421
0.202HisPro: 0.202 ± 0.242
0.202HisGln: 0.202 ± 0.211
0.404HisArg: 0.404 ± 0.354
0.202HisSer: 0.202 ± 0.205
1.01HisThr: 1.01 ± 0.433
1.616HisVal: 1.616 ± 0.573
0.0HisTrp: 0.0 ± 0.0
1.01HisTyr: 1.01 ± 0.437
0.0HisXaa: 0.0 ± 0.0
Ile
7.877IleAla: 7.877 ± 1.354
1.414IleCys: 1.414 ± 0.611
3.636IleAsp: 3.636 ± 0.616
3.434IleGlu: 3.434 ± 0.704
4.04IlePhe: 4.04 ± 1.01
4.848IleGly: 4.848 ± 1.355
1.414IleHis: 1.414 ± 0.522
8.685IleIle: 8.685 ± 1.691
5.049IleLys: 5.049 ± 1.368
7.877IleLeu: 7.877 ± 1.557
1.414IleMet: 1.414 ± 0.545
4.646IleAsn: 4.646 ± 0.86
3.636IlePro: 3.636 ± 0.703
2.222IleGln: 2.222 ± 0.831
3.434IleArg: 3.434 ± 0.985
6.463IleSer: 6.463 ± 1.011
6.261IleThr: 6.261 ± 1.623
7.271IleVal: 7.271 ± 1.751
1.01IleTrp: 1.01 ± 0.448
3.434IleTyr: 3.434 ± 0.768
0.0IleXaa: 0.0 ± 0.0
Lys
6.059LysAla: 6.059 ± 1.542
0.606LysCys: 0.606 ± 0.5
3.838LysAsp: 3.838 ± 1.248
6.261LysGlu: 6.261 ± 1.715
2.222LysPhe: 2.222 ± 0.664
3.232LysGly: 3.232 ± 0.9
1.414LysHis: 1.414 ± 0.519
6.867LysIle: 6.867 ± 1.9
6.261LysLys: 6.261 ± 1.73
8.281LysLeu: 8.281 ± 1.748
2.02LysMet: 2.02 ± 0.594
1.616LysAsn: 1.616 ± 0.481
1.818LysPro: 1.818 ± 0.496
3.434LysGln: 3.434 ± 1.113
4.444LysArg: 4.444 ± 1.537
2.02LysSer: 2.02 ± 0.762
3.434LysThr: 3.434 ± 0.8
4.242LysVal: 4.242 ± 0.947
1.818LysTrp: 1.818 ± 0.389
2.626LysTyr: 2.626 ± 0.893
0.0LysXaa: 0.0 ± 0.0
Leu
6.261LeuAla: 6.261 ± 0.759
0.606LeuCys: 0.606 ± 0.313
2.828LeuAsp: 2.828 ± 0.923
7.271LeuGlu: 7.271 ± 2.183
6.261LeuPhe: 6.261 ± 1.084
6.867LeuGly: 6.867 ± 1.477
0.808LeuHis: 0.808 ± 0.359
10.503LeuIle: 10.503 ± 1.49
8.281LeuLys: 8.281 ± 1.715
10.907LeuLeu: 10.907 ± 1.747
2.222LeuMet: 2.222 ± 0.679
5.857LeuAsn: 5.857 ± 1.591
4.242LeuPro: 4.242 ± 0.832
3.838LeuGln: 3.838 ± 0.827
5.655LeuArg: 5.655 ± 1.565
7.675LeuSer: 7.675 ± 1.014
8.685LeuThr: 8.685 ± 1.251
4.646LeuVal: 4.646 ± 1.197
0.808LeuTrp: 0.808 ± 0.46
5.251LeuTyr: 5.251 ± 1.018
0.0LeuXaa: 0.0 ± 0.0
Met
1.818MetAla: 1.818 ± 0.589
0.0MetCys: 0.0 ± 0.0
0.808MetAsp: 0.808 ± 0.456
0.202MetGlu: 0.202 ± 0.186
1.01MetPhe: 1.01 ± 0.414
1.818MetGly: 1.818 ± 0.491
0.202MetHis: 0.202 ± 0.177
1.414MetIle: 1.414 ± 0.535
2.424MetLys: 2.424 ± 0.766
1.414MetLeu: 1.414 ± 0.641
0.606MetMet: 0.606 ± 0.37
0.808MetAsn: 0.808 ± 0.466
0.606MetPro: 0.606 ± 0.405
0.202MetGln: 0.202 ± 0.226
1.01MetArg: 1.01 ± 0.449
1.414MetSer: 1.414 ± 0.588
1.01MetThr: 1.01 ± 0.554
1.414MetVal: 1.414 ± 0.48
0.0MetTrp: 0.0 ± 0.0
0.404MetTyr: 0.404 ± 0.295
0.0MetXaa: 0.0 ± 0.0
Asn
3.636AsnAla: 3.636 ± 0.961
0.202AsnCys: 0.202 ± 0.211
2.424AsnAsp: 2.424 ± 0.651
3.232AsnGlu: 3.232 ± 0.965
2.424AsnPhe: 2.424 ± 0.726
4.848AsnGly: 4.848 ± 1.224
0.202AsnHis: 0.202 ± 0.211
4.242AsnIle: 4.242 ± 0.894
1.414AsnLys: 1.414 ± 0.675
2.828AsnLeu: 2.828 ± 0.669
0.808AsnMet: 0.808 ± 0.339
4.04AsnAsn: 4.04 ± 1.401
2.424AsnPro: 2.424 ± 0.733
1.212AsnGln: 1.212 ± 0.497
1.414AsnArg: 1.414 ± 0.777
4.04AsnSer: 4.04 ± 0.957
3.434AsnThr: 3.434 ± 1.427
3.232AsnVal: 3.232 ± 1.169
1.01AsnTrp: 1.01 ± 0.477
4.242AsnTyr: 4.242 ± 1.456
0.0AsnXaa: 0.0 ± 0.0
Pro
2.02ProAla: 2.02 ± 0.572
0.0ProCys: 0.0 ± 0.0
1.414ProAsp: 1.414 ± 0.757
1.212ProGlu: 1.212 ± 0.542
1.818ProPhe: 1.818 ± 0.614
1.414ProGly: 1.414 ± 0.562
0.0ProHis: 0.0 ± 0.0
3.434ProIle: 3.434 ± 0.867
1.212ProLys: 1.212 ± 0.494
3.232ProLeu: 3.232 ± 1.349
0.606ProMet: 0.606 ± 0.413
2.626ProAsn: 2.626 ± 1.129
1.818ProPro: 1.818 ± 0.838
1.616ProGln: 1.616 ± 0.482
0.808ProArg: 0.808 ± 0.363
4.04ProSer: 4.04 ± 0.936
1.414ProThr: 1.414 ± 0.514
2.828ProVal: 2.828 ± 0.853
0.808ProTrp: 0.808 ± 0.521
2.424ProTyr: 2.424 ± 0.617
0.0ProXaa: 0.0 ± 0.0
Gln
0.808GlnAla: 0.808 ± 0.357
0.202GlnCys: 0.202 ± 0.196
0.808GlnAsp: 0.808 ± 0.445
1.818GlnGlu: 1.818 ± 0.656
1.414GlnPhe: 1.414 ± 0.576
2.222GlnGly: 2.222 ± 0.479
1.01GlnHis: 1.01 ± 0.56
3.838GlnIle: 3.838 ± 0.805
3.434GlnLys: 3.434 ± 0.968
3.838GlnLeu: 3.838 ± 0.757
1.01GlnMet: 1.01 ± 0.473
1.01GlnAsn: 1.01 ± 0.432
1.212GlnPro: 1.212 ± 0.491
0.606GlnGln: 0.606 ± 0.274
1.414GlnArg: 1.414 ± 0.586
2.02GlnSer: 2.02 ± 0.789
3.232GlnThr: 3.232 ± 1.019
1.414GlnVal: 1.414 ± 0.574
0.202GlnTrp: 0.202 ± 0.172
1.818GlnTyr: 1.818 ± 0.573
0.0GlnXaa: 0.0 ± 0.0
Arg
1.818ArgAla: 1.818 ± 0.574
1.01ArgCys: 1.01 ± 0.502
1.616ArgAsp: 1.616 ± 0.804
2.626ArgGlu: 2.626 ± 0.687
1.212ArgPhe: 1.212 ± 0.382
2.222ArgGly: 2.222 ± 0.87
0.808ArgHis: 0.808 ± 0.451
3.03ArgIle: 3.03 ± 0.731
4.444ArgLys: 4.444 ± 1.386
6.059ArgLeu: 6.059 ± 1.594
0.808ArgMet: 0.808 ± 0.322
1.818ArgAsn: 1.818 ± 0.761
0.404ArgPro: 0.404 ± 0.279
1.616ArgGln: 1.616 ± 0.683
2.828ArgArg: 2.828 ± 0.852
1.212ArgSer: 1.212 ± 0.664
1.212ArgThr: 1.212 ± 0.492
4.04ArgVal: 4.04 ± 1.11
0.202ArgTrp: 0.202 ± 0.214
2.424ArgTyr: 2.424 ± 0.835
0.0ArgXaa: 0.0 ± 0.0
Ser
5.049SerAla: 5.049 ± 0.989
0.202SerCys: 0.202 ± 0.19
1.616SerAsp: 1.616 ± 0.478
3.03SerGlu: 3.03 ± 0.918
3.434SerPhe: 3.434 ± 0.958
5.049SerGly: 5.049 ± 0.888
1.01SerHis: 1.01 ± 0.387
4.848SerIle: 4.848 ± 1.031
2.626SerLys: 2.626 ± 0.822
7.877SerLeu: 7.877 ± 1.218
1.01SerMet: 1.01 ± 0.417
4.646SerAsn: 4.646 ± 1.209
3.03SerPro: 3.03 ± 0.706
1.818SerGln: 1.818 ± 0.563
1.212SerArg: 1.212 ± 0.636
5.655SerSer: 5.655 ± 1.527
2.828SerThr: 2.828 ± 0.822
6.261SerVal: 6.261 ± 1.532
0.808SerTrp: 0.808 ± 0.434
4.242SerTyr: 4.242 ± 1.01
0.0SerXaa: 0.0 ± 0.0
Thr
3.232ThrAla: 3.232 ± 1.007
0.0ThrCys: 0.0 ± 0.0
2.02ThrAsp: 2.02 ± 0.723
3.03ThrGlu: 3.03 ± 0.819
1.616ThrPhe: 1.616 ± 0.529
4.242ThrGly: 4.242 ± 1.054
0.404ThrHis: 0.404 ± 0.411
5.655ThrIle: 5.655 ± 1.422
3.03ThrLys: 3.03 ± 0.665
7.877ThrLeu: 7.877 ± 1.345
0.808ThrMet: 0.808 ± 0.399
3.232ThrAsn: 3.232 ± 0.766
3.232ThrPro: 3.232 ± 0.931
1.818ThrGln: 1.818 ± 0.619
2.02ThrArg: 2.02 ± 1.036
3.232ThrSer: 3.232 ± 0.667
5.453ThrThr: 5.453 ± 1.583
5.453ThrVal: 5.453 ± 1.626
1.616ThrTrp: 1.616 ± 0.626
3.838ThrTyr: 3.838 ± 0.929
0.0ThrXaa: 0.0 ± 0.0
Val
4.04ValAla: 4.04 ± 0.924
1.01ValCys: 1.01 ± 0.866
2.626ValAsp: 2.626 ± 0.772
4.242ValGlu: 4.242 ± 0.826
3.03ValPhe: 3.03 ± 0.71
4.242ValGly: 4.242 ± 0.823
0.606ValHis: 0.606 ± 0.35
4.646ValIle: 4.646 ± 1.47
4.242ValLys: 4.242 ± 1.253
7.877ValLeu: 7.877 ± 1.467
1.616ValMet: 1.616 ± 0.622
4.242ValAsn: 4.242 ± 1.584
2.626ValPro: 2.626 ± 0.758
2.222ValGln: 2.222 ± 0.608
2.02ValArg: 2.02 ± 0.68
7.877ValSer: 7.877 ± 1.165
5.655ValThr: 5.655 ± 1.564
5.655ValVal: 5.655 ± 1.083
0.808ValTrp: 0.808 ± 0.449
3.636ValTyr: 3.636 ± 0.887
0.0ValXaa: 0.0 ± 0.0
Trp
0.404TrpAla: 0.404 ± 0.235
0.0TrpCys: 0.0 ± 0.0
0.202TrpAsp: 0.202 ± 0.195
0.404TrpGlu: 0.404 ± 0.255
0.606TrpPhe: 0.606 ± 0.282
0.808TrpGly: 0.808 ± 0.383
0.0TrpHis: 0.0 ± 0.0
1.414TrpIle: 1.414 ± 0.566
0.808TrpLys: 0.808 ± 0.338
2.222TrpLeu: 2.222 ± 0.707
0.202TrpMet: 0.202 ± 0.172
0.0TrpAsn: 0.0 ± 0.0
0.606TrpPro: 0.606 ± 0.46
0.404TrpGln: 0.404 ± 0.264
0.404TrpArg: 0.404 ± 0.281
0.606TrpSer: 0.606 ± 0.329
1.01TrpThr: 1.01 ± 0.579
1.616TrpVal: 1.616 ± 0.471
0.0TrpTrp: 0.0 ± 0.0
1.01TrpTyr: 1.01 ± 0.557
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.838TyrAla: 3.838 ± 0.848
0.808TyrCys: 0.808 ± 0.473
1.818TyrAsp: 1.818 ± 0.661
2.626TyrGlu: 2.626 ± 0.785
2.222TyrPhe: 2.222 ± 0.598
3.434TyrGly: 3.434 ± 1.064
0.606TyrHis: 0.606 ± 0.331
3.434TyrIle: 3.434 ± 0.799
4.646TyrLys: 4.646 ± 1.646
7.473TyrLeu: 7.473 ± 1.347
0.606TyrMet: 0.606 ± 0.313
3.03TyrAsn: 3.03 ± 0.966
2.222TyrPro: 2.222 ± 0.674
2.02TyrGln: 2.02 ± 0.526
2.02TyrArg: 2.02 ± 0.583
2.828TyrSer: 2.828 ± 0.962
5.049TyrThr: 5.049 ± 1.633
4.444TyrVal: 4.444 ± 0.858
0.808TyrTrp: 0.808 ± 0.438
4.646TyrTyr: 4.646 ± 1.478
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 34 proteins (4952 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski