Amino acid dipepetide frequency for Xinzhou dimarhabdovirus virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.033AlaAla: 1.033 ± 0.529
0.689AlaCys: 0.689 ± 1.066
2.411AlaAsp: 2.411 ± 0.728
1.378AlaGlu: 1.378 ± 0.978
0.689AlaPhe: 0.689 ± 1.021
2.411AlaGly: 2.411 ± 1.233
0.344AlaHis: 0.344 ± 0.176
3.789AlaIle: 3.789 ± 1.414
1.033AlaLys: 1.033 ± 0.529
4.823AlaLeu: 4.823 ± 1.284
0.344AlaMet: 0.344 ± 0.176
1.033AlaAsn: 1.033 ± 0.529
1.722AlaPro: 1.722 ± 2.092
2.067AlaGln: 2.067 ± 1.057
1.378AlaArg: 1.378 ± 0.705
2.411AlaSer: 2.411 ± 0.678
1.378AlaThr: 1.378 ± 0.978
3.1AlaVal: 3.1 ± 0.79
0.689AlaTrp: 0.689 ± 1.021
3.445AlaTyr: 3.445 ± 3.246
0.0AlaXaa: 0.0 ± 0.0
Cys
0.689CysAla: 0.689 ± 0.352
0.344CysCys: 0.344 ± 0.533
0.689CysAsp: 0.689 ± 0.412
1.033CysGlu: 1.033 ± 0.936
1.033CysPhe: 1.033 ± 0.529
1.378CysGly: 1.378 ± 0.824
0.344CysHis: 0.344 ± 0.533
0.344CysIle: 0.344 ± 0.533
1.378CysLys: 1.378 ± 0.705
2.067CysLeu: 2.067 ± 1.956
0.0CysMet: 0.0 ± 0.0
0.344CysAsn: 0.344 ± 1.136
0.689CysPro: 0.689 ± 1.066
1.033CysGln: 1.033 ± 1.135
1.378CysArg: 1.378 ± 2.043
1.033CysSer: 1.033 ± 0.529
1.033CysThr: 1.033 ± 0.343
1.722CysVal: 1.722 ± 0.738
1.033CysTrp: 1.033 ± 0.529
1.033CysTyr: 1.033 ± 0.529
0.0CysXaa: 0.0 ± 0.0
Asp
1.722AspAla: 1.722 ± 0.816
0.344AspCys: 0.344 ± 0.533
1.722AspAsp: 1.722 ± 0.881
4.823AspGlu: 4.823 ± 1.909
2.411AspPhe: 2.411 ± 0.678
1.378AspGly: 1.378 ± 0.824
1.722AspHis: 1.722 ± 1.345
2.411AspIle: 2.411 ± 1.755
2.756AspLys: 2.756 ± 1.343
5.167AspLeu: 5.167 ± 1.856
1.378AspMet: 1.378 ± 0.824
2.411AspAsn: 2.411 ± 0.728
2.756AspPro: 2.756 ± 0.496
2.756AspGln: 2.756 ± 0.496
1.033AspArg: 1.033 ± 0.529
1.033AspSer: 1.033 ± 0.343
2.756AspThr: 2.756 ± 1.648
3.1AspVal: 3.1 ± 1.586
0.689AspTrp: 0.689 ± 0.412
2.756AspTyr: 2.756 ± 0.921
0.0AspXaa: 0.0 ± 0.0
Glu
3.1GluAla: 3.1 ± 0.579
1.378GluCys: 1.378 ± 0.358
3.1GluAsp: 3.1 ± 0.79
4.478GluGlu: 4.478 ± 2.291
4.134GluPhe: 4.134 ± 1.563
4.478GluGly: 4.478 ± 1.353
0.344GluHis: 0.344 ± 0.176
5.856GluIle: 5.856 ± 0.857
1.722GluLys: 1.722 ± 0.828
7.578GluLeu: 7.578 ± 1.769
2.067GluMet: 2.067 ± 1.1
2.411GluAsn: 2.411 ± 1.233
2.067GluPro: 2.067 ± 0.691
1.378GluGln: 1.378 ± 0.358
3.789GluArg: 3.789 ± 1.938
6.2GluSer: 6.2 ± 2.36
3.1GluThr: 3.1 ± 0.579
3.445GluVal: 3.445 ± 0.513
0.689GluTrp: 0.689 ± 0.352
1.378GluTyr: 1.378 ± 0.358
0.0GluXaa: 0.0 ± 0.0
Phe
1.722PheAla: 1.722 ± 0.816
1.033PheCys: 1.033 ± 1.135
2.411PheAsp: 2.411 ± 0.678
2.756PheGlu: 2.756 ± 1.41
1.378PhePhe: 1.378 ± 0.978
1.033PheGly: 1.033 ± 0.343
2.756PheHis: 2.756 ± 1.41
1.378PheIle: 1.378 ± 0.824
2.067PheLys: 2.067 ± 0.691
6.2PheLeu: 6.2 ± 2.107
1.378PheMet: 1.378 ± 0.705
1.722PheAsn: 1.722 ± 0.881
1.722PhePro: 1.722 ± 0.828
1.378PheGln: 1.378 ± 0.824
2.411PheArg: 2.411 ± 0.851
4.134PheSer: 4.134 ± 1.381
2.756PheThr: 2.756 ± 0.496
3.445PheVal: 3.445 ± 1.222
1.033PheTrp: 1.033 ± 0.343
1.378PheTyr: 1.378 ± 0.705
0.0PheXaa: 0.0 ± 0.0
Gly
2.067GlyAla: 2.067 ± 0.686
0.344GlyCys: 0.344 ± 0.176
1.722GlyAsp: 1.722 ± 0.881
3.445GlyGlu: 3.445 ± 2.04
3.445GlyPhe: 3.445 ± 1.222
3.445GlyGly: 3.445 ± 1.475
1.378GlyHis: 1.378 ± 0.978
3.1GlyIle: 3.1 ± 2.488
4.134GlyLys: 4.134 ± 1.373
9.645GlyLeu: 9.645 ± 0.084
1.033GlyMet: 1.033 ± 0.529
1.033GlyAsn: 1.033 ± 0.529
3.789GlyPro: 3.789 ± 1.489
1.378GlyGln: 1.378 ± 0.705
2.411GlyArg: 2.411 ± 0.925
4.478GlySer: 4.478 ± 1.547
3.445GlyThr: 3.445 ± 1.222
5.512GlyVal: 5.512 ± 4.302
0.344GlyTrp: 0.344 ± 0.176
2.067GlyTyr: 2.067 ± 1.236
0.0GlyXaa: 0.0 ± 0.0
His
0.689HisAla: 0.689 ± 1.021
0.689HisCys: 0.689 ± 0.412
0.689HisAsp: 0.689 ± 0.352
2.067HisGlu: 2.067 ± 0.815
1.033HisPhe: 1.033 ± 0.924
1.378HisGly: 1.378 ± 0.824
1.722HisHis: 1.722 ± 0.881
2.756HisIle: 2.756 ± 0.921
1.033HisLys: 1.033 ± 1.135
3.1HisLeu: 3.1 ± 1.554
1.033HisMet: 1.033 ± 0.529
0.0HisAsn: 0.0 ± 0.0
2.067HisPro: 2.067 ± 0.691
0.344HisGln: 0.344 ± 0.176
4.478HisArg: 4.478 ± 1.142
1.378HisSer: 1.378 ± 0.854
1.033HisThr: 1.033 ± 0.936
1.722HisVal: 1.722 ± 1.275
0.344HisTrp: 0.344 ± 0.176
1.722HisTyr: 1.722 ± 0.738
0.0HisXaa: 0.0 ± 0.0
Ile
3.1IleAla: 3.1 ± 0.579
1.378IleCys: 1.378 ± 0.824
3.789IleAsp: 3.789 ± 1.019
3.789IleGlu: 3.789 ± 1.019
3.789IlePhe: 3.789 ± 1.019
2.756IleGly: 2.756 ± 0.752
2.067IleHis: 2.067 ± 1.236
4.823IleIle: 4.823 ± 0.338
4.478IleLys: 4.478 ± 1.142
7.234IleLeu: 7.234 ± 2.282
1.378IleMet: 1.378 ± 0.358
3.1IleAsn: 3.1 ± 0.79
4.823IlePro: 4.823 ± 1.357
1.378IleGln: 1.378 ± 0.358
3.445IleArg: 3.445 ± 0.895
5.856IleSer: 5.856 ± 0.634
4.478IleThr: 4.478 ± 1.303
3.445IleVal: 3.445 ± 0.513
1.378IleTrp: 1.378 ± 0.358
3.789IleTyr: 3.789 ± 1.029
0.0IleXaa: 0.0 ± 0.0
Lys
2.067LysAla: 2.067 ± 0.691
2.067LysCys: 2.067 ± 1.1
2.756LysAsp: 2.756 ± 1.648
3.789LysGlu: 3.789 ± 1.392
2.067LysPhe: 2.067 ± 1.236
3.789LysGly: 3.789 ± 1.019
1.722LysHis: 1.722 ± 0.816
3.1LysIle: 3.1 ± 0.79
2.411LysLys: 2.411 ± 0.678
5.856LysLeu: 5.856 ± 1.595
1.033LysMet: 1.033 ± 0.343
1.033LysAsn: 1.033 ± 0.529
2.411LysPro: 2.411 ± 0.925
1.722LysGln: 1.722 ± 0.738
3.445LysArg: 3.445 ± 1.131
2.756LysSer: 2.756 ± 2.861
3.445LysThr: 3.445 ± 2.725
2.411LysVal: 2.411 ± 0.678
1.722LysTrp: 1.722 ± 0.881
1.722LysTyr: 1.722 ± 0.828
0.0LysXaa: 0.0 ± 0.0
Leu
2.411LeuAla: 2.411 ± 0.575
2.411LeuCys: 2.411 ± 0.925
3.789LeuAsp: 3.789 ± 1.621
4.823LeuGlu: 4.823 ± 1.015
4.823LeuPhe: 4.823 ± 1.383
8.612LeuGly: 8.612 ± 3.17
2.411LeuHis: 2.411 ± 0.925
9.645LeuIle: 9.645 ± 3.328
4.134LeuLys: 4.134 ± 1.4
7.923LeuLeu: 7.923 ± 2.503
4.134LeuMet: 4.134 ± 1.4
5.512LeuAsn: 5.512 ± 0.684
3.445LeuPro: 3.445 ± 1.222
3.1LeuGln: 3.1 ± 1.053
5.856LeuArg: 5.856 ± 1.941
13.434LeuSer: 13.434 ± 1.991
5.512LeuThr: 5.512 ± 1.078
5.856LeuVal: 5.856 ± 2.609
1.378LeuTrp: 1.378 ± 0.358
6.545LeuTyr: 6.545 ± 1.848
0.0LeuXaa: 0.0 ± 0.0
Met
1.033MetAla: 1.033 ± 0.343
0.344MetCys: 0.344 ± 0.533
0.689MetAsp: 0.689 ± 0.352
1.722MetGlu: 1.722 ± 0.816
0.0MetPhe: 0.0 ± 0.0
1.378MetGly: 1.378 ± 0.854
0.344MetHis: 0.344 ± 0.533
2.411MetIle: 2.411 ± 0.728
0.689MetLys: 0.689 ± 1.021
1.378MetLeu: 1.378 ± 0.358
0.344MetMet: 0.344 ± 0.176
2.756MetAsn: 2.756 ± 0.715
0.344MetPro: 0.344 ± 0.176
0.344MetGln: 0.344 ± 0.176
1.722MetArg: 1.722 ± 0.881
3.1MetSer: 3.1 ± 1.053
3.1MetThr: 3.1 ± 1.053
1.033MetVal: 1.033 ± 0.529
0.0MetTrp: 0.0 ± 0.0
1.033MetTyr: 1.033 ± 0.343
0.0MetXaa: 0.0 ± 0.0
Asn
1.033AsnAla: 1.033 ± 0.529
0.344AsnCys: 0.344 ± 0.176
1.378AsnAsp: 1.378 ± 0.705
2.756AsnGlu: 2.756 ± 1.074
2.067AsnPhe: 2.067 ± 0.815
2.067AsnGly: 2.067 ± 0.578
1.722AsnHis: 1.722 ± 0.447
3.445AsnIle: 3.445 ± 1.762
2.756AsnLys: 2.756 ± 0.496
3.789AsnLeu: 3.789 ± 1.938
2.411AsnMet: 2.411 ± 0.682
3.1AsnAsn: 3.1 ± 1.586
2.756AsnPro: 2.756 ± 0.715
1.378AsnGln: 1.378 ± 0.854
1.722AsnArg: 1.722 ± 0.816
5.856AsnSer: 5.856 ± 2.179
2.411AsnThr: 2.411 ± 0.728
2.756AsnVal: 2.756 ± 0.888
1.033AsnTrp: 1.033 ± 0.924
3.1AsnTyr: 3.1 ± 1.586
0.0AsnXaa: 0.0 ± 0.0
Pro
2.067ProAla: 2.067 ± 1.236
0.344ProCys: 0.344 ± 0.176
1.378ProAsp: 1.378 ± 0.705
2.067ProGlu: 2.067 ± 1.956
1.033ProPhe: 1.033 ± 0.924
1.722ProGly: 1.722 ± 0.738
1.378ProHis: 1.378 ± 0.358
2.756ProIle: 2.756 ± 0.888
2.067ProLys: 2.067 ± 0.691
6.2ProLeu: 6.2 ± 1.158
0.344ProMet: 0.344 ± 0.176
1.722ProAsn: 1.722 ± 0.738
3.445ProPro: 3.445 ± 0.41
1.033ProGln: 1.033 ± 0.529
2.067ProArg: 2.067 ± 0.686
4.134ProSer: 4.134 ± 1.4
3.789ProThr: 3.789 ± 1.489
3.445ProVal: 3.445 ± 0.513
1.378ProTrp: 1.378 ± 0.824
3.1ProTyr: 3.1 ± 1.053
0.0ProXaa: 0.0 ± 0.0
Gln
0.689GlnAla: 0.689 ± 0.352
1.033GlnCys: 1.033 ± 0.924
2.067GlnAsp: 2.067 ± 1.236
2.067GlnGlu: 2.067 ± 0.686
1.033GlnPhe: 1.033 ± 0.529
2.411GlnGly: 2.411 ± 1.77
1.722GlnHis: 1.722 ± 1.275
1.378GlnIle: 1.378 ± 0.854
0.689GlnLys: 0.689 ± 0.352
5.856GlnLeu: 5.856 ± 0.634
1.033GlnMet: 1.033 ± 0.529
1.033GlnAsn: 1.033 ± 0.529
0.344GlnPro: 0.344 ± 0.533
1.033GlnGln: 1.033 ± 0.343
1.033GlnArg: 1.033 ± 0.529
3.1GlnSer: 3.1 ± 1.66
2.067GlnThr: 2.067 ± 1.236
1.033GlnVal: 1.033 ± 0.529
1.033GlnTrp: 1.033 ± 0.343
0.344GlnTyr: 0.344 ± 1.136
0.0GlnXaa: 0.0 ± 0.0
Arg
0.689ArgAla: 0.689 ± 0.412
0.689ArgCys: 0.689 ± 0.412
1.722ArgAsp: 1.722 ± 2.092
4.478ArgGlu: 4.478 ± 2.291
3.445ArgPhe: 3.445 ± 1.131
3.1ArgGly: 3.1 ± 0.473
1.378ArgHis: 1.378 ± 0.705
3.789ArgIle: 3.789 ± 1.26
2.067ArgLys: 2.067 ± 0.686
4.823ArgLeu: 4.823 ± 1.234
0.689ArgMet: 0.689 ± 0.352
2.756ArgAsn: 2.756 ± 1.41
1.378ArgPro: 1.378 ± 0.705
2.756ArgGln: 2.756 ± 0.715
0.689ArgArg: 0.689 ± 0.412
5.512ArgSer: 5.512 ± 1.43
3.789ArgThr: 3.789 ± 1.392
3.1ArgVal: 3.1 ± 1.803
1.378ArgTrp: 1.378 ± 0.705
1.378ArgTyr: 1.378 ± 0.978
0.0ArgXaa: 0.0 ± 0.0
Ser
4.823SerAla: 4.823 ± 2.337
1.722SerCys: 1.722 ± 0.738
4.823SerAsp: 4.823 ± 0.338
4.823SerGlu: 4.823 ± 1.7
2.756SerPhe: 2.756 ± 1.41
8.267SerGly: 8.267 ± 3.03
2.756SerHis: 2.756 ± 2.861
6.2SerIle: 6.2 ± 0.48
6.2SerLys: 6.2 ± 1.983
8.612SerLeu: 8.612 ± 2.269
1.722SerMet: 1.722 ± 0.692
3.789SerAsn: 3.789 ± 1.392
3.1SerPro: 3.1 ± 1.053
2.067SerGln: 2.067 ± 3.222
4.823SerArg: 4.823 ± 1.7
11.368SerSer: 11.368 ± 3.042
4.478SerThr: 4.478 ± 1.303
6.2SerVal: 6.2 ± 2.788
3.445SerTrp: 3.445 ± 1.656
3.1SerTyr: 3.1 ± 1.586
0.0SerXaa: 0.0 ± 0.0
Thr
2.411ThrAla: 2.411 ± 0.728
0.689ThrCys: 0.689 ± 0.352
3.445ThrAsp: 3.445 ± 2.06
4.478ThrGlu: 4.478 ± 1.544
2.411ThrPhe: 2.411 ± 0.728
1.722ThrGly: 1.722 ± 0.881
2.067ThrHis: 2.067 ± 0.578
4.478ThrIle: 4.478 ± 1.846
4.478ThrLys: 4.478 ± 1.811
5.167ThrLeu: 5.167 ± 0.224
0.689ThrMet: 0.689 ± 0.412
3.1ThrAsn: 3.1 ± 1.66
3.1ThrPro: 3.1 ± 0.79
1.033ThrGln: 1.033 ± 0.924
2.756ThrArg: 2.756 ± 0.496
7.578ThrSer: 7.578 ± 1.209
3.445ThrThr: 3.445 ± 1.222
3.789ThrVal: 3.789 ± 0.824
2.067ThrTrp: 2.067 ± 0.578
1.033ThrTyr: 1.033 ± 0.529
0.0ThrXaa: 0.0 ± 0.0
Val
2.756ValAla: 2.756 ± 0.752
1.722ValCys: 1.722 ± 0.816
2.411ValAsp: 2.411 ± 0.728
3.1ValGlu: 3.1 ± 1.016
2.067ValPhe: 2.067 ± 1.692
2.411ValGly: 2.411 ± 0.678
0.344ValHis: 0.344 ± 0.533
3.445ValIle: 3.445 ± 2.06
3.789ValLys: 3.789 ± 0.824
6.2ValLeu: 6.2 ± 1.151
1.378ValMet: 1.378 ± 0.665
5.167ValAsn: 5.167 ± 1.764
2.067ValPro: 2.067 ± 0.815
3.1ValGln: 3.1 ± 1.169
3.1ValArg: 3.1 ± 1.053
7.923ValSer: 7.923 ± 4.159
4.134ValThr: 4.134 ± 1.156
4.478ValVal: 4.478 ± 1.168
0.689ValTrp: 0.689 ± 0.412
1.722ValTyr: 1.722 ± 0.738
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
2.411TrpAsp: 2.411 ± 0.575
1.722TrpGlu: 1.722 ± 0.447
1.378TrpPhe: 1.378 ± 0.705
1.033TrpGly: 1.033 ± 0.529
0.689TrpHis: 0.689 ± 0.412
2.756TrpIle: 2.756 ± 1.648
1.033TrpLys: 1.033 ± 0.529
0.344TrpLeu: 0.344 ± 0.176
0.344TrpMet: 0.344 ± 0.176
3.1TrpAsn: 3.1 ± 1.586
0.344TrpPro: 0.344 ± 0.176
0.689TrpGln: 0.689 ± 0.412
0.0TrpArg: 0.0 ± 0.0
1.722TrpSer: 1.722 ± 0.816
1.033TrpThr: 1.033 ± 1.626
1.378TrpVal: 1.378 ± 0.978
1.033TrpTrp: 1.033 ± 0.924
0.344TrpTyr: 0.344 ± 0.176
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.067TyrAla: 2.067 ± 0.578
1.033TyrCys: 1.033 ± 0.343
1.722TyrAsp: 1.722 ± 0.447
3.1TyrGlu: 3.1 ± 1.169
3.1TyrPhe: 3.1 ± 1.053
3.445TyrGly: 3.445 ± 1.497
2.067TyrHis: 2.067 ± 0.815
2.411TyrIle: 2.411 ± 0.678
2.411TyrLys: 2.411 ± 1.233
3.789TyrLeu: 3.789 ± 0.25
0.689TyrMet: 0.689 ± 0.352
2.756TyrAsn: 2.756 ± 0.888
3.1TyrPro: 3.1 ± 1.053
1.033TyrGln: 1.033 ± 0.924
2.067TyrArg: 2.067 ± 1.692
2.411TyrSer: 2.411 ± 0.678
2.756TyrThr: 2.756 ± 0.921
1.033TyrVal: 1.033 ± 0.529
0.0TyrTrp: 0.0 ± 0.0
1.378TyrTyr: 1.378 ± 0.705
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2904 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski