Amino acid dipepetide frequency for Asikkala orthohantavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.359AlaAla: 3.359 ± 2.113
1.12AlaCys: 1.12 ± 0.403
2.8AlaAsp: 2.8 ± 1.421
3.359AlaGlu: 3.359 ± 1.341
3.359AlaPhe: 3.359 ± 0.783
3.919AlaGly: 3.919 ± 1.022
3.359AlaHis: 3.359 ± 2.531
3.359AlaIle: 3.359 ± 1.302
4.479AlaLys: 4.479 ± 1.047
6.719AlaLeu: 6.719 ± 1.622
0.0AlaMet: 0.0 ± 0.0
0.56AlaAsn: 0.56 ± 0.542
1.12AlaPro: 1.12 ± 0.482
5.039AlaGln: 5.039 ± 2.072
1.12AlaArg: 1.12 ± 0.482
4.479AlaSer: 4.479 ± 0.197
6.159AlaThr: 6.159 ± 1.087
3.919AlaVal: 3.919 ± 0.522
1.12AlaTrp: 1.12 ± 0.482
2.24AlaTyr: 2.24 ± 0.798
0.0AlaXaa: 0.0 ± 0.0
Cys
1.12CysAla: 1.12 ± 0.711
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
2.24CysGlu: 2.24 ± 2.209
2.8CysPhe: 2.8 ± 1.28
1.12CysGly: 1.12 ± 0.646
0.0CysHis: 0.0 ± 0.0
1.12CysIle: 1.12 ± 0.403
2.24CysLys: 2.24 ± 1.437
0.56CysLeu: 0.56 ± 0.356
1.12CysMet: 1.12 ± 1.104
0.56CysAsn: 0.56 ± 0.552
3.359CysPro: 3.359 ± 2.145
2.24CysGln: 2.24 ± 0.806
0.56CysArg: 0.56 ± 0.356
0.56CysSer: 0.56 ± 0.356
1.68CysThr: 1.68 ± 1.657
2.24CysVal: 2.24 ± 0.806
0.0CysTrp: 0.0 ± 0.0
1.68CysTyr: 1.68 ± 1.657
0.0CysXaa: 0.0 ± 0.0
Asp
3.919AspAla: 3.919 ± 0.75
2.24AspCys: 2.24 ± 0.743
4.479AspAsp: 4.479 ± 2.378
2.8AspGlu: 2.8 ± 1.421
2.24AspPhe: 2.24 ± 0.743
2.24AspGly: 2.24 ± 0.798
1.68AspHis: 1.68 ± 0.295
2.8AspIle: 2.8 ± 0.454
3.359AspLys: 3.359 ± 0.567
4.479AspLeu: 4.479 ± 0.827
1.68AspMet: 1.68 ± 0.961
2.24AspAsn: 2.24 ± 0.099
2.8AspPro: 2.8 ± 0.939
3.919AspGln: 3.919 ± 0.75
3.359AspArg: 3.359 ± 1.923
2.8AspSer: 2.8 ± 0.454
3.359AspThr: 3.359 ± 1.115
1.12AspVal: 1.12 ± 0.482
0.56AspTrp: 0.56 ± 0.542
2.24AspTyr: 2.24 ± 0.773
0.0AspXaa: 0.0 ± 0.0
Glu
5.039GluAla: 5.039 ± 0.514
1.12GluCys: 1.12 ± 0.403
3.359GluAsp: 3.359 ± 0.335
5.599GluGlu: 5.599 ± 0.286
0.56GluPhe: 0.56 ± 0.356
0.56GluGly: 0.56 ± 0.542
0.0GluHis: 0.0 ± 0.0
2.8GluIle: 2.8 ± 1.421
5.039GluLys: 5.039 ± 1.162
8.399GluLeu: 8.399 ± 1.763
0.56GluMet: 0.56 ± 0.483
2.8GluAsn: 2.8 ± 0.863
2.8GluPro: 2.8 ± 1.682
1.68GluGln: 1.68 ± 0.651
2.8GluArg: 2.8 ± 0.577
3.919GluSer: 3.919 ± 0.633
2.8GluThr: 2.8 ± 0.577
4.479GluVal: 4.479 ± 1.576
1.12GluTrp: 1.12 ± 0.403
3.919GluTyr: 3.919 ± 2.333
0.0GluXaa: 0.0 ± 0.0
Phe
3.359PheAla: 3.359 ± 1.006
0.56PheCys: 0.56 ± 0.552
2.8PheAsp: 2.8 ± 1.119
4.479PheGlu: 4.479 ± 2.149
3.359PhePhe: 3.359 ± 1.586
1.12PheGly: 1.12 ± 1.104
2.24PheHis: 2.24 ± 0.743
2.24PheIle: 2.24 ± 0.099
3.359PheLys: 3.359 ± 1.209
5.599PheLeu: 5.599 ± 0.286
0.56PheMet: 0.56 ± 0.356
1.12PheAsn: 1.12 ± 1.104
3.359PhePro: 3.359 ± 1.302
1.68PheGln: 1.68 ± 0.522
2.8PheArg: 2.8 ± 0.863
5.039PheSer: 5.039 ± 2.013
2.24PheThr: 2.24 ± 1.437
2.8PheVal: 2.8 ± 0.939
0.56PheTrp: 0.56 ± 0.552
1.68PheTyr: 1.68 ± 0.651
0.0PheXaa: 0.0 ± 0.0
Gly
2.24GlyAla: 2.24 ± 0.964
1.68GlyCys: 1.68 ± 1.657
4.479GlyAsp: 4.479 ± 0.65
1.68GlyGlu: 1.68 ± 1.067
3.359GlyPhe: 3.359 ± 1.006
1.12GlyGly: 1.12 ± 0.403
1.68GlyHis: 1.68 ± 0.522
1.68GlyIle: 1.68 ± 0.899
2.8GlyLys: 2.8 ± 1.089
6.159GlyLeu: 6.159 ± 0.539
1.68GlyMet: 1.68 ± 0.651
4.479GlyAsn: 4.479 ± 1.493
1.68GlyPro: 1.68 ± 0.522
2.8GlyGln: 2.8 ± 1.302
3.359GlyArg: 3.359 ± 1.839
3.359GlySer: 3.359 ± 1.045
4.479GlyThr: 4.479 ± 1.486
2.24GlyVal: 2.24 ± 0.773
1.12GlyTrp: 1.12 ± 1.104
1.12GlyTyr: 1.12 ± 0.403
0.0GlyXaa: 0.0 ± 0.0
His
1.68HisAla: 1.68 ± 1.067
1.12HisCys: 1.12 ± 1.104
0.0HisAsp: 0.0 ± 0.0
2.24HisGlu: 2.24 ± 0.798
2.24HisPhe: 2.24 ± 0.099
2.24HisGly: 2.24 ± 0.806
0.56HisHis: 0.56 ± 0.356
1.12HisIle: 1.12 ± 0.403
1.12HisLys: 1.12 ± 0.711
3.359HisLeu: 3.359 ± 1.302
0.56HisMet: 0.56 ± 0.309
2.24HisAsn: 2.24 ± 0.798
0.56HisPro: 0.56 ± 0.356
0.0HisGln: 0.0 ± 0.0
0.56HisArg: 0.56 ± 0.552
1.12HisSer: 1.12 ± 0.592
1.68HisThr: 1.68 ± 0.295
0.56HisVal: 0.56 ± 0.374
1.12HisTrp: 1.12 ± 0.403
1.12HisTyr: 1.12 ± 0.403
0.0HisXaa: 0.0 ± 0.0
Ile
5.599IleAla: 5.599 ± 1.982
1.12IleCys: 1.12 ± 1.104
4.479IleAsp: 4.479 ± 1.047
3.359IleGlu: 3.359 ± 0.783
2.8IlePhe: 2.8 ± 1.28
3.359IleGly: 3.359 ± 0.335
1.68IleHis: 1.68 ± 0.522
3.919IleIle: 3.919 ± 1.553
2.24IleLys: 2.24 ± 0.743
7.839IleLeu: 7.839 ± 0.808
0.56IleMet: 0.56 ± 0.552
2.24IleAsn: 2.24 ± 0.806
1.68IlePro: 1.68 ± 0.522
2.24IleGln: 2.24 ± 0.932
3.919IleArg: 3.919 ± 1.928
6.719IleSer: 6.719 ± 1.55
6.159IleThr: 6.159 ± 0.902
2.24IleVal: 2.24 ± 0.099
0.0IleTrp: 0.0 ± 0.0
3.359IleTyr: 3.359 ± 1.045
0.0IleXaa: 0.0 ± 0.0
Lys
3.919LysAla: 3.919 ± 0.522
1.12LysCys: 1.12 ± 0.403
1.68LysAsp: 1.68 ± 1.056
4.479LysGlu: 4.479 ± 1.534
4.479LysPhe: 4.479 ± 1.613
3.919LysGly: 3.919 ± 0.75
3.359LysHis: 3.359 ± 1.456
6.159LysIle: 6.159 ± 0.539
6.719LysLys: 6.719 ± 1.133
6.719LysLeu: 6.719 ± 1.393
1.12LysMet: 1.12 ± 0.482
3.359LysAsn: 3.359 ± 1.115
1.12LysPro: 1.12 ± 0.646
1.68LysGln: 1.68 ± 1.073
2.24LysArg: 2.24 ± 0.798
5.039LysSer: 5.039 ± 1.953
4.479LysThr: 4.479 ± 1.34
3.359LysVal: 3.359 ± 0.335
1.12LysTrp: 1.12 ± 0.711
3.359LysTyr: 3.359 ± 1.456
0.0LysXaa: 0.0 ± 0.0
Leu
3.359LeuAla: 3.359 ± 0.59
1.68LeuCys: 1.68 ± 0.899
7.279LeuAsp: 7.279 ± 0.687
11.198LeuGlu: 11.198 ± 1.722
4.479LeuPhe: 4.479 ± 0.827
5.599LeuGly: 5.599 ± 0.546
2.24LeuHis: 2.24 ± 0.798
7.839LeuIle: 7.839 ± 1.822
7.839LeuLys: 7.839 ± 1.882
9.518LeuLeu: 9.518 ± 1.986
1.12LeuMet: 1.12 ± 0.314
5.039LeuAsn: 5.039 ± 1.911
2.24LeuPro: 2.24 ± 1.422
5.599LeuGln: 5.599 ± 0.655
6.719LeuArg: 6.719 ± 0.296
6.159LeuSer: 6.159 ± 3.397
3.359LeuThr: 3.359 ± 0.59
5.599LeuVal: 5.599 ± 1.473
1.12LeuTrp: 1.12 ± 0.403
4.479LeuTyr: 4.479 ± 1.927
0.0LeuXaa: 0.0 ± 0.0
Met
2.24MetAla: 2.24 ± 0.773
0.56MetCys: 0.56 ± 0.552
0.56MetAsp: 0.56 ± 0.542
1.68MetGlu: 1.68 ± 0.295
1.12MetPhe: 1.12 ± 1.104
1.68MetGly: 1.68 ± 1.625
0.0MetHis: 0.0 ± 0.0
1.68MetIle: 1.68 ± 0.961
1.68MetLys: 1.68 ± 0.651
1.68MetLeu: 1.68 ± 0.961
0.0MetMet: 0.0 ± 0.0
1.12MetAsn: 1.12 ± 0.403
0.56MetPro: 0.56 ± 0.542
0.0MetGln: 0.0 ± 0.0
1.68MetArg: 1.68 ± 1.056
2.24MetSer: 2.24 ± 1.422
0.56MetThr: 0.56 ± 0.542
1.12MetVal: 1.12 ± 1.104
0.0MetTrp: 0.0 ± 0.0
0.56MetTyr: 0.56 ± 0.356
0.0MetXaa: 0.0 ± 0.0
Asn
2.8AsnAla: 2.8 ± 0.577
0.56AsnCys: 0.56 ± 0.356
2.8AsnAsp: 2.8 ± 1.28
2.24AsnGlu: 2.24 ± 0.806
1.68AsnPhe: 1.68 ± 0.522
1.12AsnGly: 1.12 ± 0.711
0.0AsnHis: 0.0 ± 0.0
3.919AsnIle: 3.919 ± 1.801
3.359AsnLys: 3.359 ± 0.567
5.039AsnLeu: 5.039 ± 1.342
2.24AsnMet: 2.24 ± 0.099
3.359AsnAsn: 3.359 ± 1.456
2.24AsnPro: 2.24 ± 0.806
0.56AsnGln: 0.56 ± 0.552
1.68AsnArg: 1.68 ± 0.295
2.24AsnSer: 2.24 ± 1.422
3.359AsnThr: 3.359 ± 2.145
2.8AsnVal: 2.8 ± 1.251
1.68AsnTrp: 1.68 ± 0.295
0.56AsnTyr: 0.56 ± 0.356
0.0AsnXaa: 0.0 ± 0.0
Pro
2.24ProAla: 2.24 ± 0.932
0.0ProCys: 0.0 ± 0.0
3.359ProAsp: 3.359 ± 0.59
2.24ProGlu: 2.24 ± 0.099
1.68ProPhe: 1.68 ± 0.961
3.919ProGly: 3.919 ± 0.806
0.56ProHis: 0.56 ± 0.356
0.56ProIle: 0.56 ± 0.552
0.56ProLys: 0.56 ± 0.552
2.24ProLeu: 2.24 ± 1.437
1.68ProMet: 1.68 ± 0.295
0.56ProAsn: 0.56 ± 0.542
0.56ProPro: 0.56 ± 0.542
1.12ProGln: 1.12 ± 1.104
2.24ProArg: 2.24 ± 0.099
3.359ProSer: 3.359 ± 0.783
4.479ProThr: 4.479 ± 1.047
1.12ProVal: 1.12 ± 1.083
0.56ProTrp: 0.56 ± 0.552
1.12ProTyr: 1.12 ± 0.403
0.0ProXaa: 0.0 ± 0.0
Gln
2.8GlnAla: 2.8 ± 0.577
1.68GlnCys: 1.68 ± 0.899
1.68GlnAsp: 1.68 ± 0.961
0.56GlnGlu: 0.56 ± 0.356
2.8GlnPhe: 2.8 ± 0.454
3.359GlnGly: 3.359 ± 1.798
0.56GlnHis: 0.56 ± 0.356
0.0GlnIle: 0.0 ± 0.0
1.68GlnLys: 1.68 ± 0.961
5.599GlnLeu: 5.599 ± 1.154
0.56GlnMet: 0.56 ± 0.542
2.24GlnAsn: 2.24 ± 0.099
1.12GlnPro: 1.12 ± 0.403
1.68GlnGln: 1.68 ± 0.522
2.8GlnArg: 2.8 ± 1.421
5.039GlnSer: 5.039 ± 1.629
3.359GlnThr: 3.359 ± 1.115
3.359GlnVal: 3.359 ± 1.341
0.56GlnTrp: 0.56 ± 0.552
0.56GlnTyr: 0.56 ± 0.356
0.0GlnXaa: 0.0 ± 0.0
Arg
1.68ArgAla: 1.68 ± 1.625
1.68ArgCys: 1.68 ± 0.295
2.8ArgAsp: 2.8 ± 0.431
0.56ArgGlu: 0.56 ± 0.542
2.8ArgPhe: 2.8 ± 0.863
1.68ArgGly: 1.68 ± 0.295
1.12ArgHis: 1.12 ± 0.482
3.919ArgIle: 3.919 ± 1.022
6.719ArgLys: 6.719 ± 1.244
3.919ArgLeu: 3.919 ± 1.138
0.0ArgMet: 0.0 ± 0.0
4.479ArgAsn: 4.479 ± 0.827
1.12ArgPro: 1.12 ± 0.711
2.8ArgGln: 2.8 ± 2.708
2.24ArgArg: 2.24 ± 0.773
3.919ArgSer: 3.919 ± 1.138
2.8ArgThr: 2.8 ± 1.302
1.12ArgVal: 1.12 ± 0.403
0.56ArgTrp: 0.56 ± 0.356
5.599ArgTyr: 5.599 ± 0.861
0.0ArgXaa: 0.0 ± 0.0
Ser
3.359SerAla: 3.359 ± 1.045
2.24SerCys: 2.24 ± 2.209
3.919SerAsp: 3.919 ± 1.655
2.8SerGlu: 2.8 ± 1.28
4.479SerPhe: 4.479 ± 0.197
5.599SerGly: 5.599 ± 1.039
2.24SerHis: 2.24 ± 0.932
6.719SerIle: 6.719 ± 0.671
6.159SerLys: 6.159 ± 1.315
9.518SerLeu: 9.518 ± 3.599
3.919SerMet: 3.919 ± 2.596
2.24SerAsn: 2.24 ± 0.743
3.359SerPro: 3.359 ± 1.115
2.24SerGln: 2.24 ± 0.932
2.24SerArg: 2.24 ± 0.798
5.599SerSer: 5.599 ± 0.861
1.12SerThr: 1.12 ± 0.711
3.359SerVal: 3.359 ± 0.783
0.56SerTrp: 0.56 ± 0.356
1.12SerTyr: 1.12 ± 0.482
0.0SerXaa: 0.0 ± 0.0
Thr
6.719ThrAla: 6.719 ± 1.55
0.56ThrCys: 0.56 ± 0.356
2.8ThrAsp: 2.8 ± 0.577
2.8ThrGlu: 2.8 ± 0.577
2.24ThrPhe: 2.24 ± 0.743
2.8ThrGly: 2.8 ± 0.577
1.12ThrHis: 1.12 ± 0.403
5.599ThrIle: 5.599 ± 2.058
3.359ThrLys: 3.359 ± 1.209
5.039ThrLeu: 5.039 ± 2.662
0.56ThrMet: 0.56 ± 0.552
2.24ThrAsn: 2.24 ± 0.932
1.68ThrPro: 1.68 ± 0.295
2.24ThrGln: 2.24 ± 0.743
5.039ThrArg: 5.039 ± 0.514
5.039ThrSer: 5.039 ± 2.23
2.8ThrThr: 2.8 ± 0.454
5.599ThrVal: 5.599 ± 1.315
1.12ThrTrp: 1.12 ± 0.403
2.8ThrTyr: 2.8 ± 0.939
0.0ThrXaa: 0.0 ± 0.0
Val
2.8ValAla: 2.8 ± 0.939
3.359ValCys: 3.359 ± 1.006
3.919ValAsp: 3.919 ± 1.022
3.919ValGlu: 3.919 ± 1.138
2.24ValPhe: 2.24 ± 0.099
3.919ValGly: 3.919 ± 0.522
1.68ValHis: 1.68 ± 0.522
4.479ValIle: 4.479 ± 0.65
2.24ValLys: 2.24 ± 0.964
4.479ValLeu: 4.479 ± 1.863
1.12ValMet: 1.12 ± 0.482
1.12ValAsn: 1.12 ± 0.482
1.68ValPro: 1.68 ± 0.295
1.68ValGln: 1.68 ± 1.073
1.68ValArg: 1.68 ± 0.651
2.8ValSer: 2.8 ± 0.454
4.479ValThr: 4.479 ± 1.613
1.68ValVal: 1.68 ± 0.651
1.12ValTrp: 1.12 ± 0.646
0.56ValTyr: 0.56 ± 0.356
0.0ValXaa: 0.0 ± 0.0
Trp
2.8TrpAla: 2.8 ± 0.863
0.56TrpCys: 0.56 ± 0.552
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
1.68TrpPhe: 1.68 ± 0.899
1.12TrpGly: 1.12 ± 0.646
0.56TrpHis: 0.56 ± 0.356
1.68TrpIle: 1.68 ± 1.073
0.56TrpLys: 0.56 ± 0.356
1.12TrpLeu: 1.12 ± 0.711
0.0TrpMet: 0.0 ± 0.0
1.12TrpAsn: 1.12 ± 0.403
0.0TrpPro: 0.0 ± 0.0
0.56TrpGln: 0.56 ± 0.552
0.0TrpArg: 0.0 ± 0.0
1.68TrpSer: 1.68 ± 0.522
0.56TrpThr: 0.56 ± 0.542
0.56TrpVal: 0.56 ± 0.542
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.56TyrAla: 0.56 ± 0.356
2.24TyrCys: 2.24 ± 0.806
1.12TyrAsp: 1.12 ± 0.482
1.68TyrGlu: 1.68 ± 0.899
0.56TyrPhe: 0.56 ± 0.356
2.8TyrGly: 2.8 ± 0.431
0.56TyrHis: 0.56 ± 0.356
3.919TyrIle: 3.919 ± 0.75
3.919TyrLys: 3.919 ± 1.242
4.479TyrLeu: 4.479 ± 1.027
1.12TyrMet: 1.12 ± 1.083
1.12TyrAsn: 1.12 ± 0.711
1.12TyrPro: 1.12 ± 0.403
2.24TyrGln: 2.24 ± 0.773
4.479TyrArg: 4.479 ± 1.34
1.68TyrSer: 1.68 ± 0.522
2.24TyrThr: 2.24 ± 1.437
1.68TyrVal: 1.68 ± 0.651
0.56TyrTrp: 0.56 ± 0.356
1.68TyrTyr: 1.68 ± 1.067
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1787 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski