Amino acid dipepetide frequency for Apore mammarenavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.724AlaAla: 2.724 ± 0.348
0.908AlaCys: 0.908 ± 0.315
2.119AlaAsp: 2.119 ± 2.226
3.329AlaGlu: 3.329 ± 0.697
1.211AlaPhe: 1.211 ± 0.331
1.513AlaGly: 1.513 ± 0.803
0.908AlaHis: 0.908 ± 0.504
2.421AlaIle: 2.421 ± 0.661
1.513AlaLys: 1.513 ± 0.803
8.777AlaLeu: 8.777 ± 0.817
1.211AlaMet: 1.211 ± 0.393
1.816AlaAsn: 1.816 ± 0.746
0.908AlaPro: 0.908 ± 0.721
2.421AlaGln: 2.421 ± 0.787
1.211AlaArg: 1.211 ± 0.393
1.211AlaSer: 1.211 ± 0.331
0.908AlaThr: 0.908 ± 0.345
2.421AlaVal: 2.421 ± 0.331
0.0AlaTrp: 0.0 ± 0.0
0.908AlaTyr: 0.908 ± 0.504
0.0AlaXaa: 0.0 ± 0.0
Cys
0.605CysAla: 0.605 ± 0.315
0.303CysCys: 0.303 ± 0.795
1.816CysAsp: 1.816 ± 0.746
0.908CysGlu: 0.908 ± 0.504
2.119CysPhe: 2.119 ± 0.689
0.908CysGly: 0.908 ± 0.925
1.211CysHis: 1.211 ± 0.671
0.908CysIle: 0.908 ± 0.504
1.816CysLys: 1.816 ± 0.66
2.421CysLeu: 2.421 ± 1.949
0.908CysMet: 0.908 ± 0.488
1.513CysAsn: 1.513 ± 0.732
0.303CysPro: 0.303 ± 0.512
1.513CysGln: 1.513 ± 0.732
1.513CysArg: 1.513 ± 1.594
2.421CysSer: 2.421 ± 0.8
0.303CysThr: 0.303 ± 0.168
2.421CysVal: 2.421 ± 0.864
0.908CysTrp: 0.908 ± 0.925
0.908CysTyr: 0.908 ± 0.315
0.0CysXaa: 0.0 ± 0.0
Asp
1.211AspAla: 1.211 ± 0.791
1.513AspCys: 1.513 ± 0.96
2.119AspAsp: 2.119 ± 0.812
3.329AspGlu: 3.329 ± 1.081
4.237AspPhe: 4.237 ± 0.728
3.632AspGly: 3.632 ± 0.647
1.513AspHis: 1.513 ± 0.732
3.935AspIle: 3.935 ± 0.707
3.027AspLys: 3.027 ± 0.881
9.08AspLeu: 9.08 ± 2.741
2.724AspMet: 2.724 ± 0.426
2.119AspAsn: 2.119 ± 0.548
2.724AspPro: 2.724 ± 1.131
3.632AspGln: 3.632 ± 1.622
1.816AspArg: 1.816 ± 0.572
3.329AspSer: 3.329 ± 0.56
2.421AspThr: 2.421 ± 0.398
2.119AspVal: 2.119 ± 0.915
0.605AspTrp: 0.605 ± 0.403
1.513AspTyr: 1.513 ± 0.517
0.0AspXaa: 0.0 ± 0.0
Glu
2.119GluAla: 2.119 ± 0.249
1.513GluCys: 1.513 ± 0.839
4.54GluAsp: 4.54 ± 2.141
3.027GluGlu: 3.027 ± 1.293
2.724GluPhe: 2.724 ± 1.035
2.119GluGly: 2.119 ± 0.915
1.513GluHis: 1.513 ± 0.193
3.935GluIle: 3.935 ± 0.922
3.027GluLys: 3.027 ± 1.577
5.145GluLeu: 5.145 ± 0.955
2.119GluMet: 2.119 ± 0.713
2.724GluAsn: 2.724 ± 0.348
1.816GluPro: 1.816 ± 0.712
2.119GluGln: 2.119 ± 0.713
3.632GluArg: 3.632 ± 0.653
4.843GluSer: 4.843 ± 1.288
4.237GluThr: 4.237 ± 0.553
6.053GluVal: 6.053 ± 1.444
0.908GluTrp: 0.908 ± 0.345
1.513GluTyr: 1.513 ± 0.449
0.0GluXaa: 0.0 ± 0.0
Phe
0.908PheAla: 0.908 ± 0.823
1.816PheCys: 1.816 ± 0.746
1.211PheAsp: 1.211 ± 0.671
3.632PheGlu: 3.632 ± 1.053
3.027PhePhe: 3.027 ± 0.897
2.724PheGly: 2.724 ± 0.943
0.303PheHis: 0.303 ± 0.168
2.119PheIle: 2.119 ± 1.175
3.632PheLys: 3.632 ± 0.653
6.356PheLeu: 6.356 ± 2.522
0.908PheMet: 0.908 ± 0.504
3.027PheAsn: 3.027 ± 1.027
0.605PhePro: 0.605 ± 0.65
1.513PheGln: 1.513 ± 0.642
2.119PheArg: 2.119 ± 0.548
5.448PheSer: 5.448 ± 0.696
3.329PheThr: 3.329 ± 0.537
2.724PheVal: 2.724 ± 0.558
1.211PheTrp: 1.211 ± 0.806
1.513PheTyr: 1.513 ± 0.449
0.0PheXaa: 0.0 ± 0.0
Gly
1.816GlyAla: 1.816 ± 0.146
0.605GlyCys: 0.605 ± 0.403
3.935GlyAsp: 3.935 ± 1.471
2.724GlyGlu: 2.724 ± 0.59
4.237GlyPhe: 4.237 ± 0.497
3.329GlyGly: 3.329 ± 0.654
0.908GlyHis: 0.908 ± 0.696
2.119GlyIle: 2.119 ± 0.713
2.724GlyLys: 2.724 ± 0.943
6.053GlyLeu: 6.053 ± 1.586
1.211GlyMet: 1.211 ± 1.3
4.843GlyAsn: 4.843 ± 2.308
1.513GlyPro: 1.513 ± 0.607
1.816GlyGln: 1.816 ± 0.975
3.632GlyArg: 3.632 ± 0.629
4.237GlySer: 4.237 ± 0.785
1.816GlyThr: 1.816 ± 0.746
4.54GlyVal: 4.54 ± 0.936
0.303GlyTrp: 0.303 ± 0.396
1.513GlyTyr: 1.513 ± 0.193
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.605HisCys: 0.605 ± 0.315
1.816HisAsp: 1.816 ± 0.975
2.119HisGlu: 2.119 ± 0.713
0.0HisPhe: 0.0 ± 0.0
1.816HisGly: 1.816 ± 0.975
0.605HisHis: 0.605 ± 0.336
2.119HisIle: 2.119 ± 0.689
1.513HisLys: 1.513 ± 0.449
3.027HisLeu: 3.027 ± 0.972
0.303HisMet: 0.303 ± 0.32
1.816HisAsn: 1.816 ± 1.457
0.605HisPro: 0.605 ± 1.024
0.0HisGln: 0.0 ± 0.0
1.211HisArg: 1.211 ± 0.806
2.421HisSer: 2.421 ± 0.461
0.303HisThr: 0.303 ± 0.512
0.605HisVal: 0.605 ± 0.336
0.0HisTrp: 0.0 ± 0.0
0.605HisTyr: 0.605 ± 0.92
0.0HisXaa: 0.0 ± 0.0
Ile
2.119IleAla: 2.119 ± 0.816
1.211IleCys: 1.211 ± 0.741
2.421IleAsp: 2.421 ± 0.97
2.421IleGlu: 2.421 ± 0.398
1.816IlePhe: 1.816 ± 0.572
3.935IleGly: 3.935 ± 0.371
1.211IleHis: 1.211 ± 0.363
2.724IleIle: 2.724 ± 0.942
4.843IleLys: 4.843 ± 1.573
6.053IleLeu: 6.053 ± 1.496
2.724IleMet: 2.724 ± 0.91
3.935IleAsn: 3.935 ± 1.594
1.816IlePro: 1.816 ± 0.75
1.211IleGln: 1.211 ± 0.363
2.724IleArg: 2.724 ± 0.426
5.448IleSer: 5.448 ± 1.561
3.632IleThr: 3.632 ± 1.584
1.816IleVal: 1.816 ± 0.146
0.908IleTrp: 0.908 ± 0.315
0.908IleTyr: 0.908 ± 0.504
0.0IleXaa: 0.0 ± 0.0
Lys
2.421LysAla: 2.421 ± 0.791
2.421LysCys: 2.421 ± 1.441
3.329LysAsp: 3.329 ± 0.993
5.448LysGlu: 5.448 ± 0.772
5.448LysPhe: 5.448 ± 1.561
3.027LysGly: 3.027 ± 0.435
1.211LysHis: 1.211 ± 0.393
3.935LysIle: 3.935 ± 1.22
4.54LysLys: 4.54 ± 1.656
9.988LysLeu: 9.988 ± 2.117
0.908LysMet: 0.908 ± 0.315
4.843LysAsn: 4.843 ± 0.657
2.119LysPro: 2.119 ± 1.399
1.513LysGln: 1.513 ± 0.796
3.632LysArg: 3.632 ± 1.01
7.264LysSer: 7.264 ± 1.652
5.448LysThr: 5.448 ± 1.662
2.724LysVal: 2.724 ± 0.942
1.211LysTrp: 1.211 ± 0.671
1.513LysTyr: 1.513 ± 0.732
0.0LysXaa: 0.0 ± 0.0
Leu
4.843LeuAla: 4.843 ± 0.657
4.843LeuCys: 4.843 ± 0.464
7.869LeuAsp: 7.869 ± 2.096
8.172LeuGlu: 8.172 ± 2.118
4.54LeuPhe: 4.54 ± 1.579
5.145LeuGly: 5.145 ± 1.832
1.816LeuHis: 1.816 ± 0.69
8.777LeuIle: 8.777 ± 2.206
10.291LeuLys: 10.291 ± 0.59
13.317LeuLeu: 13.317 ± 2.163
3.935LeuMet: 3.935 ± 1.437
6.659LeuAsn: 6.659 ± 1.008
3.027LeuPro: 3.027 ± 0.893
4.54LeuGln: 4.54 ± 0.551
6.053LeuArg: 6.053 ± 1.402
11.501LeuSer: 11.501 ± 2.596
7.869LeuThr: 7.869 ± 1.731
6.961LeuVal: 6.961 ± 0.565
0.605LeuTrp: 0.605 ± 0.403
3.935LeuTyr: 3.935 ± 1.09
0.0LeuXaa: 0.0 ± 0.0
Met
1.816MetAla: 1.816 ± 0.69
0.605MetCys: 0.605 ± 0.336
0.605MetAsp: 0.605 ± 0.315
0.908MetGlu: 0.908 ± 0.504
1.211MetPhe: 1.211 ± 0.791
4.237MetGly: 4.237 ± 0.669
0.303MetHis: 0.303 ± 0.168
0.908MetIle: 0.908 ± 0.504
1.816MetLys: 1.816 ± 0.146
3.632MetLeu: 3.632 ± 2.413
1.513MetMet: 1.513 ± 0.642
0.605MetAsn: 0.605 ± 0.336
0.908MetPro: 0.908 ± 0.925
0.605MetGln: 0.605 ± 0.315
2.119MetArg: 2.119 ± 0.393
2.421MetSer: 2.421 ± 0.97
1.816MetThr: 1.816 ± 0.146
3.027MetVal: 3.027 ± 0.854
0.605MetTrp: 0.605 ± 0.336
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.211AsnAla: 1.211 ± 0.806
1.211AsnCys: 1.211 ± 0.836
1.816AsnAsp: 1.816 ± 1.305
2.724AsnGlu: 2.724 ± 1.035
3.027AsnPhe: 3.027 ± 1.001
2.119AsnGly: 2.119 ± 0.692
1.513AsnHis: 1.513 ± 2.434
1.816AsnIle: 1.816 ± 1.305
3.027AsnLys: 3.027 ± 1.027
9.988AsnLeu: 9.988 ± 0.292
1.211AsnMet: 1.211 ± 0.949
2.119AsnAsn: 2.119 ± 0.548
2.724AsnPro: 2.724 ± 0.832
3.027AsnGln: 3.027 ± 0.722
2.119AsnArg: 2.119 ± 0.816
3.329AsnSer: 3.329 ± 1.342
3.935AsnThr: 3.935 ± 1.594
3.935AsnVal: 3.935 ± 0.922
0.303AsnTrp: 0.303 ± 0.168
1.513AsnTyr: 1.513 ± 0.732
0.0AsnXaa: 0.0 ± 0.0
Pro
2.119ProAla: 2.119 ± 0.915
0.0ProCys: 0.0 ± 0.0
2.119ProAsp: 2.119 ± 0.393
1.816ProGlu: 1.816 ± 1.457
1.513ProPhe: 1.513 ± 0.839
1.513ProGly: 1.513 ± 0.642
1.211ProHis: 1.211 ± 0.791
1.211ProIle: 1.211 ± 0.363
3.329ProLys: 3.329 ± 0.958
3.027ProLeu: 3.027 ± 0.897
0.605ProMet: 0.605 ± 0.92
1.211ProAsn: 1.211 ± 0.791
0.908ProPro: 0.908 ± 0.823
1.211ProGln: 1.211 ± 1.087
1.211ProArg: 1.211 ± 0.806
3.632ProSer: 3.632 ± 1.312
3.329ProThr: 3.329 ± 1.814
1.211ProVal: 1.211 ± 0.791
0.0ProTrp: 0.0 ± 0.0
1.513ProTyr: 1.513 ± 0.642
0.0ProXaa: 0.0 ± 0.0
Gln
2.421GlnAla: 2.421 ± 0.816
0.605GlnCys: 0.605 ± 0.403
1.211GlnAsp: 1.211 ± 0.393
1.816GlnGlu: 1.816 ± 0.66
1.211GlnPhe: 1.211 ± 0.393
3.935GlnGly: 3.935 ± 1.437
0.303GlnHis: 0.303 ± 0.168
3.027GlnIle: 3.027 ± 0.722
2.724GlnLys: 2.724 ± 0.59
2.119GlnLeu: 2.119 ± 1.639
1.816GlnMet: 1.816 ± 0.66
0.605GlnAsn: 0.605 ± 0.403
1.513GlnPro: 1.513 ± 1.369
1.211GlnGln: 1.211 ± 1.087
1.513GlnArg: 1.513 ± 0.193
3.329GlnSer: 3.329 ± 0.993
1.816GlnThr: 1.816 ± 0.629
2.724GlnVal: 2.724 ± 0.944
0.605GlnTrp: 0.605 ± 0.336
0.303GlnTyr: 0.303 ± 0.168
0.0GlnXaa: 0.0 ± 0.0
Arg
2.119ArgAla: 2.119 ± 0.713
1.816ArgCys: 1.816 ± 1.622
2.421ArgAsp: 2.421 ± 0.787
2.119ArgGlu: 2.119 ± 0.393
2.421ArgPhe: 2.421 ± 1.343
3.027ArgGly: 3.027 ± 0.854
2.119ArgHis: 2.119 ± 1.639
0.605ArgIle: 0.605 ± 0.336
2.724ArgLys: 2.724 ± 1.227
7.567ArgLeu: 7.567 ± 0.316
0.303ArgMet: 0.303 ± 0.168
2.119ArgAsn: 2.119 ± 0.548
1.211ArgPro: 1.211 ± 0.806
1.816ArgGln: 1.816 ± 1.084
3.027ArgArg: 3.027 ± 1.033
5.751ArgSer: 5.751 ± 1.521
2.119ArgThr: 2.119 ± 0.833
3.027ArgVal: 3.027 ± 0.435
1.211ArgTrp: 1.211 ± 0.331
2.119ArgTyr: 2.119 ± 0.713
0.0ArgXaa: 0.0 ± 0.0
Ser
3.329SerAla: 3.329 ± 1.29
1.816SerCys: 1.816 ± 1.21
5.751SerAsp: 5.751 ± 1.391
6.961SerGlu: 6.961 ± 1.659
4.237SerPhe: 4.237 ± 0.811
2.724SerGly: 2.724 ± 1.431
1.816SerHis: 1.816 ± 0.505
5.145SerIle: 5.145 ± 1.498
7.869SerLys: 7.869 ± 2.941
10.291SerLeu: 10.291 ± 0.916
1.816SerMet: 1.816 ± 0.66
4.237SerAsn: 4.237 ± 1.118
1.816SerPro: 1.816 ± 0.66
2.119SerGln: 2.119 ± 1.604
5.448SerArg: 5.448 ± 1.117
7.264SerSer: 7.264 ± 2.383
4.54SerThr: 4.54 ± 0.536
5.145SerVal: 5.145 ± 1.401
0.605SerTrp: 0.605 ± 0.315
3.632SerTyr: 3.632 ± 1.053
0.0SerXaa: 0.0 ± 0.0
Thr
1.816ThrAla: 1.816 ± 0.146
0.605ThrCys: 0.605 ± 0.336
3.935ThrAsp: 3.935 ± 0.843
3.632ThrGlu: 3.632 ± 0.703
1.816ThrPhe: 1.816 ± 0.572
2.421ThrGly: 2.421 ± 1.499
1.513ThrHis: 1.513 ± 1.445
3.329ThrIle: 3.329 ± 1.885
4.843ThrLys: 4.843 ± 0.868
5.751ThrLeu: 5.751 ± 1.691
2.119ThrMet: 2.119 ± 1.418
2.119ThrAsn: 2.119 ± 1.175
3.632ThrPro: 3.632 ± 0.532
1.211ThrGln: 1.211 ± 0.631
3.935ThrArg: 3.935 ± 0.636
4.54ThrSer: 4.54 ± 1.086
3.027ThrThr: 3.027 ± 1.768
4.237ThrVal: 4.237 ± 0.669
1.513ThrTrp: 1.513 ± 2.09
0.605ThrTyr: 0.605 ± 0.336
0.0ThrXaa: 0.0 ± 0.0
Val
3.329ValAla: 3.329 ± 0.56
1.211ValCys: 1.211 ± 0.671
4.843ValAsp: 4.843 ± 1.019
3.027ValGlu: 3.027 ± 1.293
1.816ValPhe: 1.816 ± 0.69
3.632ValGly: 3.632 ± 0.703
0.908ValHis: 0.908 ± 0.345
2.421ValIle: 2.421 ± 0.816
7.264ValLys: 7.264 ± 1.507
6.053ValLeu: 6.053 ± 1.26
1.816ValMet: 1.816 ± 0.572
4.237ValAsn: 4.237 ± 2.26
3.027ValPro: 3.027 ± 0.881
2.421ValGln: 2.421 ± 0.774
2.119ValArg: 2.119 ± 0.249
5.751ValSer: 5.751 ± 1.853
3.329ValThr: 3.329 ± 0.792
5.448ValVal: 5.448 ± 1.772
0.303ValTrp: 0.303 ± 0.396
0.908ValTyr: 0.908 ± 0.315
0.0ValXaa: 0.0 ± 0.0
Trp
0.605TrpAla: 0.605 ± 0.336
0.303TrpCys: 0.303 ± 0.168
0.605TrpAsp: 0.605 ± 0.315
0.303TrpGlu: 0.303 ± 0.396
0.605TrpPhe: 0.605 ± 0.74
0.605TrpGly: 0.605 ± 0.403
0.605TrpHis: 0.605 ± 0.403
0.605TrpIle: 0.605 ± 0.65
0.908TrpLys: 0.908 ± 0.345
2.119TrpLeu: 2.119 ± 0.812
0.605TrpMet: 0.605 ± 1.024
0.303TrpAsn: 0.303 ± 0.168
0.605TrpPro: 0.605 ± 0.315
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.211TrpSer: 1.211 ± 0.949
0.605TrpThr: 0.605 ± 0.403
0.605TrpVal: 0.605 ± 0.403
0.0TrpTrp: 0.0 ± 0.0
0.605TrpTyr: 0.605 ± 0.403
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.908TyrAla: 0.908 ± 0.504
1.816TyrCys: 1.816 ± 0.69
2.119TyrAsp: 2.119 ± 0.713
1.211TyrGlu: 1.211 ± 0.393
0.908TyrPhe: 0.908 ± 0.504
1.513TyrGly: 1.513 ± 0.712
0.303TyrHis: 0.303 ± 0.168
1.816TyrIle: 1.816 ± 0.505
1.513TyrLys: 1.513 ± 0.517
3.935TyrLeu: 3.935 ± 0.944
0.605TyrMet: 0.605 ± 0.336
1.816TyrAsn: 1.816 ± 0.66
0.908TyrPro: 0.908 ± 0.906
0.908TyrGln: 0.908 ± 0.315
0.605TyrArg: 0.605 ± 0.315
1.211TyrSer: 1.211 ± 0.671
1.816TyrThr: 1.816 ± 0.146
2.119TyrVal: 2.119 ± 0.249
0.0TyrTrp: 0.0 ± 0.0
0.303TyrTyr: 0.303 ± 0.512
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3305 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski