Amino acid dipepetide frequency for Toros virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.541AlaAla: 3.541 ± 1.226
1.012AlaCys: 1.012 ± 0.532
1.265AlaAsp: 1.265 ± 0.411
3.541AlaGlu: 3.541 ± 0.148
1.77AlaPhe: 1.77 ± 0.683
4.047AlaGly: 4.047 ± 0.824
1.517AlaHis: 1.517 ± 0.667
4.805AlaIle: 4.805 ± 1.657
3.794AlaLys: 3.794 ± 1.183
7.081AlaLeu: 7.081 ± 2.868
2.023AlaMet: 2.023 ± 0.457
1.265AlaAsn: 1.265 ± 0.426
2.276AlaPro: 2.276 ± 0.755
1.77AlaGln: 1.77 ± 0.609
3.288AlaArg: 3.288 ± 0.495
6.829AlaSer: 6.829 ± 0.929
3.794AlaThr: 3.794 ± 0.998
3.794AlaVal: 3.794 ± 0.707
0.759AlaTrp: 0.759 ± 0.213
1.012AlaTyr: 1.012 ± 0.502
0.0AlaXaa: 0.0 ± 0.0
Cys
0.759CysAla: 0.759 ± 0.333
0.253CysCys: 0.253 ± 0.154
1.265CysAsp: 1.265 ± 0.952
2.023CysGlu: 2.023 ± 0.35
1.517CysPhe: 1.517 ± 0.667
1.012CysGly: 1.012 ± 0.854
0.506CysHis: 0.506 ± 0.427
0.759CysIle: 0.759 ± 1.123
2.276CysLys: 2.276 ± 0.639
1.517CysLeu: 1.517 ± 0.963
0.506CysMet: 0.506 ± 0.148
0.759CysAsn: 0.759 ± 0.213
0.759CysPro: 0.759 ± 0.333
0.506CysGln: 0.506 ± 0.148
2.023CysArg: 2.023 ± 0.863
3.035CysSer: 3.035 ± 0.725
2.023CysThr: 2.023 ± 0.895
2.529CysVal: 2.529 ± 1.586
0.0CysTrp: 0.0 ± 0.0
1.265CysTyr: 1.265 ± 0.469
0.0CysXaa: 0.0 ± 0.0
Asp
3.288AspAla: 3.288 ± 1.286
0.506AspCys: 0.506 ± 0.308
3.794AspAsp: 3.794 ± 1.452
3.541AspGlu: 3.541 ± 0.805
1.77AspPhe: 1.77 ± 0.576
4.047AspGly: 4.047 ± 1.18
0.506AspHis: 0.506 ± 0.308
2.529AspIle: 2.529 ± 0.377
3.288AspLys: 3.288 ± 0.869
5.058AspLeu: 5.058 ± 1.424
2.276AspMet: 2.276 ± 1.103
3.794AspAsn: 3.794 ± 0.985
3.288AspPro: 3.288 ± 1.237
1.012AspGln: 1.012 ± 0.615
2.023AspArg: 2.023 ± 0.457
5.817AspSer: 5.817 ± 2.231
3.541AspThr: 3.541 ± 1.091
3.794AspVal: 3.794 ± 0.998
1.012AspTrp: 1.012 ± 0.295
1.012AspTyr: 1.012 ± 0.9
0.0AspXaa: 0.0 ± 0.0
Glu
4.552GluAla: 4.552 ± 1.22
2.276GluCys: 2.276 ± 1.172
6.829GluAsp: 6.829 ± 1.527
5.564GluGlu: 5.564 ± 0.958
4.299GluPhe: 4.299 ± 0.706
4.552GluGly: 4.552 ± 0.608
1.012GluHis: 1.012 ± 1.036
5.817GluIle: 5.817 ± 1.593
5.817GluLys: 5.817 ± 1.306
7.587GluLeu: 7.587 ± 1.945
2.529GluMet: 2.529 ± 0.379
2.023GluAsn: 2.023 ± 0.81
2.276GluPro: 2.276 ± 1.439
1.265GluGln: 1.265 ± 0.413
3.288GluArg: 3.288 ± 0.813
4.552GluSer: 4.552 ± 1.543
2.529GluThr: 2.529 ± 0.396
5.817GluVal: 5.817 ± 1.219
0.506GluTrp: 0.506 ± 0.538
1.265GluTyr: 1.265 ± 0.484
0.0GluXaa: 0.0 ± 0.0
Phe
3.035PheAla: 3.035 ± 0.737
1.265PheCys: 1.265 ± 0.333
3.794PheAsp: 3.794 ± 0.817
2.529PheGlu: 2.529 ± 0.394
2.023PhePhe: 2.023 ± 0.934
3.035PheGly: 3.035 ± 1.146
1.265PheHis: 1.265 ± 0.411
1.77PheIle: 1.77 ± 0.399
1.77PheLys: 1.77 ± 0.547
3.541PheLeu: 3.541 ± 0.771
1.517PheMet: 1.517 ± 0.662
2.529PheAsn: 2.529 ± 0.966
1.517PhePro: 1.517 ± 1.023
0.506PheGln: 0.506 ± 0.148
2.529PheArg: 2.529 ± 1.25
4.299PheSer: 4.299 ± 1.34
2.276PheThr: 2.276 ± 0.823
3.541PheVal: 3.541 ± 1.565
0.506PheTrp: 0.506 ± 0.518
0.506PheTyr: 0.506 ± 0.518
0.0PheXaa: 0.0 ± 0.0
Gly
4.805GlyAla: 4.805 ± 0.803
1.77GlyCys: 1.77 ± 0.611
3.541GlyAsp: 3.541 ± 0.96
2.276GlyGlu: 2.276 ± 0.37
4.552GlyPhe: 4.552 ± 1.326
4.805GlyGly: 4.805 ± 0.941
0.506GlyHis: 0.506 ± 0.308
2.276GlyIle: 2.276 ± 0.755
4.552GlyLys: 4.552 ± 0.827
5.058GlyLeu: 5.058 ± 2.902
2.023GlyMet: 2.023 ± 0.591
1.517GlyAsn: 1.517 ± 0.514
2.529GlyPro: 2.529 ± 0.632
1.517GlyGln: 1.517 ± 0.368
4.299GlyArg: 4.299 ± 1.006
5.311GlySer: 5.311 ± 1.288
2.276GlyThr: 2.276 ± 1.29
4.299GlyVal: 4.299 ± 0.699
1.012GlyTrp: 1.012 ± 0.74
2.276GlyTyr: 2.276 ± 0.951
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.265HisCys: 1.265 ± 0.654
1.517HisAsp: 1.517 ± 0.443
1.265HisGlu: 1.265 ± 0.333
1.265HisPhe: 1.265 ± 0.411
1.517HisGly: 1.517 ± 0.426
0.253HisHis: 0.253 ± 0.214
1.517HisIle: 1.517 ± 0.632
1.012HisLys: 1.012 ± 0.783
1.265HisLeu: 1.265 ± 0.333
0.0HisMet: 0.0 ± 0.0
1.012HisAsn: 1.012 ± 0.295
1.012HisPro: 1.012 ± 0.602
0.759HisGln: 0.759 ± 0.461
1.77HisArg: 1.77 ± 0.347
1.517HisSer: 1.517 ± 0.426
0.506HisThr: 0.506 ± 0.148
0.759HisVal: 0.759 ± 0.213
0.0HisTrp: 0.0 ± 0.0
2.276HisTyr: 2.276 ± 0.339
0.0HisXaa: 0.0 ± 0.0
Ile
3.288IleAla: 3.288 ± 0.882
1.77IleCys: 1.77 ± 0.683
2.023IleAsp: 2.023 ± 1.459
6.829IleGlu: 6.829 ± 1.502
2.276IlePhe: 2.276 ± 0.339
3.288IleGly: 3.288 ± 1.046
1.265IleHis: 1.265 ± 0.751
4.552IleIle: 4.552 ± 1.261
4.047IleLys: 4.047 ± 0.505
4.299IleLeu: 4.299 ± 1.357
2.023IleMet: 2.023 ± 0.646
3.541IleAsn: 3.541 ± 0.733
1.77IlePro: 1.77 ± 0.611
1.77IleGln: 1.77 ± 0.609
3.794IleArg: 3.794 ± 0.734
4.047IleSer: 4.047 ± 1.875
1.265IleThr: 1.265 ± 0.469
7.84IleVal: 7.84 ± 0.905
0.253IleTrp: 0.253 ± 0.154
0.759IleTyr: 0.759 ± 0.461
0.0IleXaa: 0.0 ± 0.0
Lys
4.552LysAla: 4.552 ± 0.589
1.77LysCys: 1.77 ± 0.995
2.023LysAsp: 2.023 ± 0.539
5.311LysGlu: 5.311 ± 1.23
3.035LysPhe: 3.035 ± 0.757
2.023LysGly: 2.023 ± 0.539
0.506LysHis: 0.506 ± 0.148
4.299LysIle: 4.299 ± 0.95
5.564LysLys: 5.564 ± 1.126
6.829LysLeu: 6.829 ± 1.864
3.288LysMet: 3.288 ± 0.952
2.023LysAsn: 2.023 ± 0.591
3.288LysPro: 3.288 ± 0.866
1.517LysGln: 1.517 ± 0.443
2.782LysArg: 2.782 ± 0.907
5.311LysSer: 5.311 ± 1.632
4.299LysThr: 4.299 ± 0.87
4.805LysVal: 4.805 ± 1.392
0.759LysTrp: 0.759 ± 0.461
1.77LysTyr: 1.77 ± 1.146
0.0LysXaa: 0.0 ± 0.0
Leu
4.805LeuAla: 4.805 ± 0.519
1.012LeuCys: 1.012 ± 0.615
3.541LeuAsp: 3.541 ± 1.425
7.587LeuGlu: 7.587 ± 0.953
3.541LeuPhe: 3.541 ± 1.014
3.794LeuGly: 3.794 ± 1.067
1.77LeuHis: 1.77 ± 0.783
5.817LeuIle: 5.817 ± 1.422
4.047LeuLys: 4.047 ± 1.078
5.564LeuLeu: 5.564 ± 1.986
2.529LeuMet: 2.529 ± 0.394
2.529LeuAsn: 2.529 ± 0.665
3.035LeuPro: 3.035 ± 0.245
3.794LeuGln: 3.794 ± 1.082
6.323LeuArg: 6.323 ± 0.683
9.358LeuSer: 9.358 ± 1.97
5.058LeuThr: 5.058 ± 1.782
6.323LeuVal: 6.323 ± 0.563
1.012LeuTrp: 1.012 ± 0.54
3.288LeuTyr: 3.288 ± 0.48
0.0LeuXaa: 0.0 ± 0.0
Met
2.276MetAla: 2.276 ± 0.951
1.012MetCys: 1.012 ± 0.295
1.517MetAsp: 1.517 ± 0.368
3.541MetGlu: 3.541 ± 0.796
0.759MetPhe: 0.759 ± 0.55
2.023MetGly: 2.023 ± 0.682
1.517MetHis: 1.517 ± 0.368
2.529MetIle: 2.529 ± 0.396
1.265MetLys: 1.265 ± 0.333
2.023MetLeu: 2.023 ± 0.682
2.782MetMet: 2.782 ± 1.129
2.276MetAsn: 2.276 ± 0.785
0.253MetPro: 0.253 ± 0.214
2.276MetGln: 2.276 ± 0.5
1.77MetArg: 1.77 ± 0.783
2.023MetSer: 2.023 ± 1.152
2.782MetThr: 2.782 ± 0.9
0.506MetVal: 0.506 ± 0.308
0.253MetTrp: 0.253 ± 0.154
0.253MetTyr: 0.253 ± 0.214
0.0MetXaa: 0.0 ± 0.0
Asn
2.782AsnAla: 2.782 ± 0.903
0.506AsnCys: 0.506 ± 0.427
3.541AsnAsp: 3.541 ± 0.789
3.541AsnGlu: 3.541 ± 0.607
2.529AsnPhe: 2.529 ± 0.686
2.023AsnGly: 2.023 ± 0.565
1.012AsnHis: 1.012 ± 0.54
1.77AsnIle: 1.77 ± 0.347
3.288AsnLys: 3.288 ± 0.205
4.047AsnLeu: 4.047 ± 1.862
0.759AsnMet: 0.759 ± 0.213
1.012AsnAsn: 1.012 ± 0.54
2.782AsnPro: 2.782 ± 0.886
0.506AsnGln: 0.506 ± 0.518
1.77AsnArg: 1.77 ± 0.611
2.782AsnSer: 2.782 ± 0.6
2.276AsnThr: 2.276 ± 1.0
1.77AsnVal: 1.77 ± 0.385
0.253AsnTrp: 0.253 ± 0.214
1.265AsnTyr: 1.265 ± 0.426
0.0AsnXaa: 0.0 ± 0.0
Pro
2.782ProAla: 2.782 ± 0.771
0.253ProCys: 0.253 ± 0.214
2.782ProAsp: 2.782 ± 1.178
3.541ProGlu: 3.541 ± 1.348
3.035ProPhe: 3.035 ± 0.714
3.794ProGly: 3.794 ± 0.797
0.506ProHis: 0.506 ± 0.148
1.517ProIle: 1.517 ± 0.527
1.77ProLys: 1.77 ± 0.385
2.782ProLeu: 2.782 ± 0.392
1.012ProMet: 1.012 ± 0.74
1.265ProAsn: 1.265 ± 0.484
0.506ProPro: 0.506 ± 0.427
0.759ProGln: 0.759 ± 0.641
2.023ProArg: 2.023 ± 0.81
3.288ProSer: 3.288 ± 1.021
2.023ProThr: 2.023 ± 0.591
2.023ProVal: 2.023 ± 1.104
0.506ProTrp: 0.506 ± 0.308
0.759ProTyr: 0.759 ± 0.48
0.0ProXaa: 0.0 ± 0.0
Gln
1.77GlnAla: 1.77 ± 1.262
1.012GlnCys: 1.012 ± 0.295
1.012GlnAsp: 1.012 ± 0.532
2.023GlnGlu: 2.023 ± 1.204
0.759GlnPhe: 0.759 ± 0.48
2.529GlnGly: 2.529 ± 0.686
1.012GlnHis: 1.012 ± 0.615
1.517GlnIle: 1.517 ± 0.632
1.517GlnLys: 1.517 ± 0.514
2.782GlnLeu: 2.782 ± 0.392
0.506GlnMet: 0.506 ± 0.538
1.77GlnAsn: 1.77 ± 1.162
1.265GlnPro: 1.265 ± 0.451
1.265GlnGln: 1.265 ± 0.484
2.023GlnArg: 2.023 ± 0.457
2.023GlnSer: 2.023 ± 0.401
1.77GlnThr: 1.77 ± 0.385
2.023GlnVal: 2.023 ± 0.682
0.0GlnTrp: 0.0 ± 0.0
1.012GlnTyr: 1.012 ± 0.532
0.0GlnXaa: 0.0 ± 0.0
Arg
3.035ArgAla: 3.035 ± 1.333
1.517ArgCys: 1.517 ± 0.443
5.058ArgAsp: 5.058 ± 2.149
5.058ArgGlu: 5.058 ± 0.89
1.012ArgPhe: 1.012 ± 0.295
3.794ArgGly: 3.794 ± 2.22
0.506ArgHis: 0.506 ± 0.308
4.047ArgIle: 4.047 ± 0.752
3.541ArgLys: 3.541 ± 0.148
3.035ArgLeu: 3.035 ± 0.493
2.529ArgMet: 2.529 ± 0.529
2.529ArgAsn: 2.529 ± 0.377
2.782ArgPro: 2.782 ± 0.754
2.276ArgGln: 2.276 ± 0.562
4.552ArgArg: 4.552 ± 0.276
3.794ArgSer: 3.794 ± 0.385
3.541ArgThr: 3.541 ± 1.307
1.77ArgVal: 1.77 ± 0.647
1.012ArgTrp: 1.012 ± 0.615
2.023ArgTyr: 2.023 ± 0.682
0.0ArgXaa: 0.0 ± 0.0
Ser
5.564SerAla: 5.564 ± 1.035
3.288SerCys: 3.288 ± 1.873
5.058SerAsp: 5.058 ± 0.718
6.07SerGlu: 6.07 ± 1.049
4.552SerPhe: 4.552 ± 1.903
4.552SerGly: 4.552 ± 1.981
2.529SerHis: 2.529 ± 0.665
6.829SerIle: 6.829 ± 1.341
5.058SerLys: 5.058 ± 0.439
7.081SerLeu: 7.081 ± 0.847
2.276SerMet: 2.276 ± 1.0
2.276SerAsn: 2.276 ± 1.384
2.276SerPro: 2.276 ± 0.562
2.782SerGln: 2.782 ± 0.963
4.299SerArg: 4.299 ± 0.964
9.105SerSer: 9.105 ± 1.947
5.058SerThr: 5.058 ± 1.704
3.794SerVal: 3.794 ± 0.817
2.023SerTrp: 2.023 ± 1.104
2.782SerTyr: 2.782 ± 0.857
0.0SerXaa: 0.0 ± 0.0
Thr
2.529ThrAla: 2.529 ± 0.665
2.276ThrCys: 2.276 ± 1.29
2.529ThrAsp: 2.529 ± 0.665
3.035ThrGlu: 3.035 ± 1.079
2.782ThrPhe: 2.782 ± 0.886
4.805ThrGly: 4.805 ± 1.267
0.759ThrHis: 0.759 ± 0.213
2.529ThrIle: 2.529 ± 0.753
4.299ThrLys: 4.299 ± 1.105
6.576ThrLeu: 6.576 ± 1.406
1.012ThrMet: 1.012 ± 0.295
4.047ThrAsn: 4.047 ± 1.181
0.506ThrPro: 0.506 ± 0.148
1.517ThrGln: 1.517 ± 0.923
3.288ThrArg: 3.288 ± 1.299
4.047ThrSer: 4.047 ± 0.505
2.782ThrThr: 2.782 ± 0.9
3.035ThrVal: 3.035 ± 0.983
0.253ThrTrp: 0.253 ± 0.214
1.265ThrTyr: 1.265 ± 1.281
0.0ThrXaa: 0.0 ± 0.0
Val
4.552ValAla: 4.552 ± 1.107
1.517ValCys: 1.517 ± 0.814
3.288ValAsp: 3.288 ± 0.813
3.288ValGlu: 3.288 ± 1.642
1.517ValPhe: 1.517 ± 0.632
2.782ValGly: 2.782 ± 0.352
2.276ValHis: 2.276 ± 0.339
2.529ValIle: 2.529 ± 0.444
5.817ValLys: 5.817 ± 0.589
4.299ValLeu: 4.299 ± 1.171
1.517ValMet: 1.517 ± 0.443
2.276ValAsn: 2.276 ± 1.441
2.529ValPro: 2.529 ± 0.487
2.782ValGln: 2.782 ± 0.979
3.541ValArg: 3.541 ± 0.937
6.576ValSer: 6.576 ± 1.739
3.541ValThr: 3.541 ± 0.7
7.84ValVal: 7.84 ± 1.029
1.265ValTrp: 1.265 ± 0.484
4.299ValTyr: 4.299 ± 0.964
0.0ValXaa: 0.0 ± 0.0
Trp
0.759TrpAla: 0.759 ± 0.461
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.77TrpGlu: 1.77 ± 0.647
0.253TrpPhe: 0.253 ± 0.154
1.012TrpGly: 1.012 ± 0.532
0.253TrpHis: 0.253 ± 0.154
1.265TrpIle: 1.265 ± 0.469
0.0TrpLys: 0.0 ± 0.0
0.759TrpLeu: 0.759 ± 0.475
0.759TrpMet: 0.759 ± 0.213
0.759TrpAsn: 0.759 ± 0.333
0.253TrpPro: 0.253 ± 0.568
0.253TrpGln: 0.253 ± 0.214
0.506TrpArg: 0.506 ± 0.308
1.517TrpSer: 1.517 ± 0.443
1.012TrpThr: 1.012 ± 0.437
0.759TrpVal: 0.759 ± 0.461
0.0TrpTrp: 0.0 ± 0.0
0.253TrpTyr: 0.253 ± 0.154
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.506TyrAla: 0.506 ± 0.148
1.012TyrCys: 1.012 ± 0.532
1.517TyrAsp: 1.517 ± 1.023
2.023TyrGlu: 2.023 ± 0.839
0.759TyrPhe: 0.759 ± 0.461
1.77TyrGly: 1.77 ± 0.42
1.012TyrHis: 1.012 ± 0.341
2.023TyrIle: 2.023 ± 0.563
3.035TyrLys: 3.035 ± 1.504
3.541TyrLeu: 3.541 ± 1.354
1.517TyrMet: 1.517 ± 0.661
1.012TyrAsn: 1.012 ± 0.341
2.023TyrPro: 2.023 ± 1.167
0.759TyrGln: 0.759 ± 1.103
1.517TyrArg: 1.517 ± 0.791
2.023TyrSer: 2.023 ± 0.35
1.517TyrThr: 1.517 ± 0.632
1.012TyrVal: 1.012 ± 0.341
0.759TyrTrp: 0.759 ± 0.333
0.253TyrTyr: 0.253 ± 0.154
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3955 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski