Amino acid dipepetide frequency for Capim virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.788AlaAla: 1.788 ± 2.8
2.043AlaCys: 2.043 ± 0.772
3.064AlaAsp: 3.064 ± 0.406
3.32AlaGlu: 3.32 ± 4.301
2.298AlaPhe: 2.298 ± 0.583
1.277AlaGly: 1.277 ± 0.919
1.021AlaHis: 1.021 ± 0.329
4.597AlaIle: 4.597 ± 0.153
5.873AlaLys: 5.873 ± 0.962
4.086AlaLeu: 4.086 ± 0.294
0.255AlaMet: 0.255 ± 0.22
4.852AlaAsn: 4.852 ± 1.823
0.766AlaPro: 0.766 ± 0.822
1.021AlaGln: 1.021 ± 0.266
3.83AlaArg: 3.83 ± 2.071
2.298AlaSer: 2.298 ± 1.425
2.043AlaThr: 2.043 ± 0.536
2.554AlaVal: 2.554 ± 0.666
0.255AlaTrp: 0.255 ± 0.159
1.788AlaTyr: 1.788 ± 1.558
0.0AlaXaa: 0.0 ± 0.0
Cys
1.021CysAla: 1.021 ± 0.329
0.255CysCys: 0.255 ± 0.159
1.021CysAsp: 1.021 ± 0.54
2.554CysGlu: 2.554 ± 1.191
2.043CysPhe: 2.043 ± 0.772
2.809CysGly: 2.809 ± 1.405
1.021CysHis: 1.021 ± 0.54
2.298CysIle: 2.298 ± 0.703
2.298CysLys: 2.298 ± 1.297
1.532CysLeu: 1.532 ± 0.653
1.021CysMet: 1.021 ± 0.54
2.809CysAsn: 2.809 ± 0.665
1.021CysPro: 1.021 ± 0.266
1.021CysGln: 1.021 ± 0.54
1.021CysArg: 1.021 ± 0.878
2.043CysSer: 2.043 ± 0.479
1.788CysThr: 1.788 ± 0.865
1.277CysVal: 1.277 ± 0.757
0.255CysTrp: 0.255 ± 0.159
0.766CysTyr: 0.766 ± 0.327
0.0CysXaa: 0.0 ± 0.0
Asp
2.298AspAla: 2.298 ± 0.591
1.788AspCys: 1.788 ± 0.411
3.83AspAsp: 3.83 ± 0.174
4.852AspGlu: 4.852 ± 0.747
3.83AspPhe: 3.83 ± 1.225
1.021AspGly: 1.021 ± 0.266
0.255AspHis: 0.255 ± 0.22
4.852AspIle: 4.852 ± 1.112
2.809AspLys: 2.809 ± 0.844
6.895AspLeu: 6.895 ± 1.34
3.064AspMet: 3.064 ± 0.777
3.575AspAsn: 3.575 ± 0.822
1.788AspPro: 1.788 ± 0.789
1.021AspGln: 1.021 ± 0.266
1.788AspArg: 1.788 ± 1.112
0.511AspSer: 0.511 ± 0.439
3.32AspThr: 3.32 ± 1.225
3.575AspVal: 3.575 ± 2.115
0.0AspTrp: 0.0 ± 0.0
2.043AspTyr: 2.043 ± 0.479
0.0AspXaa: 0.0 ± 0.0
Glu
3.575GluAla: 3.575 ± 0.396
1.277GluCys: 1.277 ± 0.293
3.32GluAsp: 3.32 ± 1.225
2.809GluGlu: 2.809 ± 0.669
5.363GluPhe: 5.363 ± 1.969
2.298GluGly: 2.298 ± 0.56
2.043GluHis: 2.043 ± 0.735
7.661GluIle: 7.661 ± 1.768
5.363GluLys: 5.363 ± 1.723
6.639GluLeu: 6.639 ± 2.015
3.83GluMet: 3.83 ± 1.225
3.32GluAsn: 3.32 ± 0.606
2.043GluPro: 2.043 ± 0.946
2.298GluGln: 2.298 ± 0.537
4.086GluArg: 4.086 ± 2.15
5.873GluSer: 5.873 ± 1.262
2.554GluThr: 2.554 ± 0.712
2.554GluVal: 2.554 ± 1.458
0.255GluTrp: 0.255 ± 0.159
2.298GluTyr: 2.298 ± 0.583
0.0GluXaa: 0.0 ± 0.0
Phe
2.554PheAla: 2.554 ± 0.585
2.043PheCys: 2.043 ± 0.946
2.809PheAsp: 2.809 ± 0.665
3.064PheGlu: 3.064 ± 0.825
3.064PhePhe: 3.064 ± 1.345
3.575PheGly: 3.575 ± 2.666
1.532PheHis: 1.532 ± 0.673
3.83PheIle: 3.83 ± 0.174
4.341PheLys: 4.341 ± 1.463
5.873PheLeu: 5.873 ± 3.532
1.021PheMet: 1.021 ± 0.442
2.043PheAsn: 2.043 ± 0.658
1.021PhePro: 1.021 ± 0.742
1.277PheGln: 1.277 ± 0.479
3.575PheArg: 3.575 ± 1.033
4.341PheSer: 4.341 ± 1.463
2.554PheThr: 2.554 ± 0.41
2.554PheVal: 2.554 ± 1.381
0.511PheTrp: 0.511 ± 0.318
1.788PheTyr: 1.788 ± 1.194
0.0PheXaa: 0.0 ± 0.0
Gly
1.277GlyAla: 1.277 ± 0.69
2.809GlyCys: 2.809 ± 1.733
3.32GlyAsp: 3.32 ± 0.264
3.83GlyGlu: 3.83 ± 1.43
1.021GlyPhe: 1.021 ± 0.742
0.766GlyGly: 0.766 ± 0.822
0.766GlyHis: 0.766 ± 0.327
4.852GlyIle: 4.852 ± 2.023
2.043GlyLys: 2.043 ± 0.735
3.83GlyLeu: 3.83 ± 1.081
1.277GlyMet: 1.277 ± 0.287
3.064GlyAsn: 3.064 ± 0.5
1.021GlyPro: 1.021 ± 0.54
1.277GlyGln: 1.277 ± 0.794
1.277GlyArg: 1.277 ± 0.729
3.064GlySer: 3.064 ± 1.621
3.064GlyThr: 3.064 ± 1.405
3.064GlyVal: 3.064 ± 0.799
0.766GlyTrp: 0.766 ± 0.194
1.532GlyTyr: 1.532 ± 0.653
0.0GlyXaa: 0.0 ± 0.0
His
1.788HisAla: 1.788 ± 0.65
0.511HisCys: 0.511 ± 0.439
0.766HisAsp: 0.766 ± 0.327
0.255HisGlu: 0.255 ± 0.22
1.788HisPhe: 1.788 ± 0.789
1.277HisGly: 1.277 ± 0.479
0.511HisHis: 0.511 ± 0.133
0.511HisIle: 0.511 ± 0.133
2.043HisLys: 2.043 ± 0.479
1.021HisLeu: 1.021 ± 0.805
1.021HisMet: 1.021 ± 0.742
2.298HisAsn: 2.298 ± 0.827
1.021HisPro: 1.021 ± 0.266
0.511HisGln: 0.511 ± 0.133
0.766HisArg: 0.766 ± 0.822
1.532HisSer: 1.532 ± 0.399
1.788HisThr: 1.788 ± 0.574
1.277HisVal: 1.277 ± 0.479
0.511HisTrp: 0.511 ± 0.318
1.021HisTyr: 1.021 ± 0.54
0.0HisXaa: 0.0 ± 0.0
Ile
4.086IleAla: 4.086 ± 1.071
2.043IleCys: 2.043 ± 1.081
3.32IleAsp: 3.32 ± 1.422
6.129IleGlu: 6.129 ± 1.127
3.83IlePhe: 3.83 ± 1.298
3.83IleGly: 3.83 ± 0.238
1.788IleHis: 1.788 ± 0.516
6.895IleIle: 6.895 ± 1.89
5.107IleLys: 5.107 ± 1.415
9.704IleLeu: 9.704 ± 2.223
2.809IleMet: 2.809 ± 0.665
2.554IleAsn: 2.554 ± 0.957
3.575IlePro: 3.575 ± 1.284
3.83IleGln: 3.83 ± 1.172
4.597IleArg: 4.597 ± 2.178
8.172IleSer: 8.172 ± 1.576
5.873IleThr: 5.873 ± 0.962
4.852IleVal: 4.852 ± 1.29
0.511IleTrp: 0.511 ± 0.318
3.064IleTyr: 3.064 ± 1.405
0.0IleXaa: 0.0 ± 0.0
Lys
3.575LysAla: 3.575 ± 1.3
2.554LysCys: 2.554 ± 1.191
5.873LysAsp: 5.873 ± 1.453
6.639LysGlu: 6.639 ± 0.637
3.32LysPhe: 3.32 ± 1.225
4.597LysGly: 4.597 ± 1.165
1.532LysHis: 1.532 ± 0.633
6.895LysIle: 6.895 ± 1.577
4.852LysLys: 4.852 ± 0.716
7.15LysLeu: 7.15 ± 1.833
3.32LysMet: 3.32 ± 0.264
3.83LysAsn: 3.83 ± 0.584
2.298LysPro: 2.298 ± 0.703
3.575LysGln: 3.575 ± 1.618
2.298LysArg: 2.298 ± 0.806
4.852LysSer: 4.852 ± 1.112
4.597LysThr: 4.597 ± 0.498
3.575LysVal: 3.575 ± 0.932
0.511LysTrp: 0.511 ± 0.318
2.298LysTyr: 2.298 ± 0.703
0.0LysXaa: 0.0 ± 0.0
Leu
5.363LeuAla: 5.363 ± 4.098
2.043LeuCys: 2.043 ± 0.772
5.618LeuAsp: 5.618 ± 0.875
6.384LeuGlu: 6.384 ± 1.674
3.83LeuPhe: 3.83 ± 1.436
2.809LeuGly: 2.809 ± 0.344
3.064LeuHis: 3.064 ± 0.406
7.15LeuIle: 7.15 ± 1.833
8.938LeuLys: 8.938 ± 0.939
8.682LeuLeu: 8.682 ± 2.109
1.532LeuMet: 1.532 ± 0.633
5.618LeuAsn: 5.618 ± 1.94
3.575LeuPro: 3.575 ± 1.166
2.043LeuGln: 2.043 ± 0.946
3.064LeuArg: 3.064 ± 1.517
7.661LeuSer: 7.661 ± 0.475
5.873LeuThr: 5.873 ± 1.609
4.341LeuVal: 4.341 ± 1.463
0.255LeuTrp: 0.255 ± 0.159
3.064LeuTyr: 3.064 ± 0.777
0.0LeuXaa: 0.0 ± 0.0
Met
1.532MetAla: 1.532 ± 0.388
1.021MetCys: 1.021 ± 0.54
2.298MetAsp: 2.298 ± 0.806
1.532MetGlu: 1.532 ± 0.953
2.043MetPhe: 2.043 ± 0.746
2.298MetGly: 2.298 ± 0.591
0.255MetHis: 0.255 ± 0.159
2.554MetIle: 2.554 ± 0.456
3.32MetLys: 3.32 ± 0.264
4.597MetLeu: 4.597 ± 0.899
0.766MetMet: 0.766 ± 0.194
1.277MetAsn: 1.277 ± 0.448
1.277MetPro: 1.277 ± 0.293
0.766MetGln: 0.766 ± 0.194
1.021MetArg: 1.021 ± 0.329
2.809MetSer: 2.809 ± 0.713
2.298MetThr: 2.298 ± 0.583
0.511MetVal: 0.511 ± 0.318
0.0MetTrp: 0.0 ± 0.0
0.766MetTyr: 0.766 ± 0.194
0.0MetXaa: 0.0 ± 0.0
Asn
2.809AsnAla: 2.809 ± 0.713
1.021AsnCys: 1.021 ± 0.54
3.32AsnAsp: 3.32 ± 1.134
4.852AsnGlu: 4.852 ± 0.37
2.809AsnPhe: 2.809 ± 1.494
1.532AsnGly: 1.532 ± 0.633
2.043AsnHis: 2.043 ± 0.746
5.363AsnIle: 5.363 ± 0.823
4.086AsnLys: 4.086 ± 1.093
4.086AsnLeu: 4.086 ± 0.652
3.064AsnMet: 3.064 ± 0.777
3.575AsnAsn: 3.575 ± 1.085
3.32AsnPro: 3.32 ± 1.92
3.064AsnGln: 3.064 ± 1.578
3.064AsnArg: 3.064 ± 0.777
2.043AsnSer: 2.043 ± 0.479
4.341AsnThr: 4.341 ± 1.057
0.766AsnVal: 0.766 ± 0.477
1.277AsnTrp: 1.277 ± 0.794
3.32AsnTyr: 3.32 ± 1.422
0.0AsnXaa: 0.0 ± 0.0
Pro
2.043ProAla: 2.043 ± 0.946
0.255ProCys: 0.255 ± 0.159
2.043ProAsp: 2.043 ± 0.536
2.554ProGlu: 2.554 ± 0.957
1.532ProPhe: 1.532 ± 0.633
3.32ProGly: 3.32 ± 1.427
0.766ProHis: 0.766 ± 0.327
2.554ProIle: 2.554 ± 0.712
3.064ProLys: 3.064 ± 1.021
2.809ProLeu: 2.809 ± 0.873
0.511ProMet: 0.511 ± 0.858
2.043ProAsn: 2.043 ± 0.479
0.511ProPro: 0.511 ± 0.318
0.766ProGln: 0.766 ± 0.327
0.255ProArg: 0.255 ± 0.159
3.32ProSer: 3.32 ± 1.134
1.532ProThr: 1.532 ± 0.388
1.788ProVal: 1.788 ± 0.411
1.021ProTrp: 1.021 ± 0.742
1.021ProTyr: 1.021 ± 0.329
0.0ProXaa: 0.0 ± 0.0
Gln
1.532GlnAla: 1.532 ± 0.633
1.021GlnCys: 1.021 ± 0.54
1.277GlnAsp: 1.277 ± 0.794
1.532GlnGlu: 1.532 ± 0.388
1.277GlnPhe: 1.277 ± 0.479
1.021GlnGly: 1.021 ± 0.266
0.766GlnHis: 0.766 ± 0.194
3.064GlnIle: 3.064 ± 0.777
3.32GlnLys: 3.32 ± 1.225
3.32GlnLeu: 3.32 ± 1.134
1.021GlnMet: 1.021 ± 0.805
2.298GlnAsn: 2.298 ± 0.703
0.766GlnPro: 0.766 ± 0.194
1.277GlnGln: 1.277 ± 1.666
1.788GlnArg: 1.788 ± 0.789
2.554GlnSer: 2.554 ± 1.261
2.554GlnThr: 2.554 ± 0.707
0.511GlnVal: 0.511 ± 0.133
0.255GlnTrp: 0.255 ± 0.925
1.021GlnTyr: 1.021 ± 0.742
0.0GlnXaa: 0.0 ± 0.0
Arg
1.788ArgAla: 1.788 ± 0.692
1.788ArgCys: 1.788 ± 0.411
3.32ArgAsp: 3.32 ± 0.951
4.597ArgGlu: 4.597 ± 1.366
2.554ArgPhe: 2.554 ± 0.456
0.766ArgGly: 0.766 ± 0.327
1.021ArgHis: 1.021 ± 0.635
4.086ArgIle: 4.086 ± 1.082
3.064ArgLys: 3.064 ± 0.406
4.852ArgLeu: 4.852 ± 1.172
1.788ArgMet: 1.788 ± 0.789
2.809ArgAsn: 2.809 ± 2.323
0.766ArgPro: 0.766 ± 0.194
1.277ArgGln: 1.277 ± 0.69
2.043ArgArg: 2.043 ± 0.946
2.043ArgSer: 2.043 ± 0.946
1.788ArgThr: 1.788 ± 0.692
1.788ArgVal: 1.788 ± 0.692
0.255ArgTrp: 0.255 ± 0.22
2.043ArgTyr: 2.043 ± 0.658
0.0ArgXaa: 0.0 ± 0.0
Ser
3.83SerAla: 3.83 ± 0.94
1.021SerCys: 1.021 ± 0.266
3.83SerAsp: 3.83 ± 1.172
4.597SerGlu: 4.597 ± 0.153
4.086SerPhe: 4.086 ± 1.507
2.043SerGly: 2.043 ± 0.536
1.021SerHis: 1.021 ± 0.635
6.129SerIle: 6.129 ± 0.536
5.618SerLys: 5.618 ± 0.554
6.384SerLeu: 6.384 ± 1.024
1.788SerMet: 1.788 ± 0.516
2.554SerAsn: 2.554 ± 0.896
3.83SerPro: 3.83 ± 0.878
2.043SerGln: 2.043 ± 0.541
4.341SerArg: 4.341 ± 0.798
5.107SerSer: 5.107 ± 1.331
5.107SerThr: 5.107 ± 1.248
4.597SerVal: 4.597 ± 1.165
0.511SerTrp: 0.511 ± 0.318
2.043SerTyr: 2.043 ± 0.772
0.0SerXaa: 0.0 ± 0.0
Thr
4.597ThrAla: 4.597 ± 2.909
2.809ThrCys: 2.809 ± 1.733
2.298ThrAsp: 2.298 ± 0.537
2.298ThrGlu: 2.298 ± 0.806
3.32ThrPhe: 3.32 ± 2.164
2.298ThrGly: 2.298 ± 0.98
0.766ThrHis: 0.766 ± 0.327
6.129ThrIle: 6.129 ± 0.536
4.597ThrLys: 4.597 ± 1.061
2.554ThrLeu: 2.554 ± 1.373
1.788ThrMet: 1.788 ± 0.574
5.107ThrAsn: 5.107 ± 1.171
2.554ThrPro: 2.554 ± 0.621
1.788ThrGln: 1.788 ± 0.516
1.788ThrArg: 1.788 ± 0.516
4.086ThrSer: 4.086 ± 0.957
3.575ThrThr: 3.575 ± 1.768
1.788ThrVal: 1.788 ± 0.567
1.021ThrTrp: 1.021 ± 0.844
3.83ThrTyr: 3.83 ± 0.926
0.0ThrXaa: 0.0 ± 0.0
Val
2.298ValAla: 2.298 ± 1.455
2.043ValCys: 2.043 ± 1.081
0.766ValAsp: 0.766 ± 0.817
3.575ValGlu: 3.575 ± 0.731
2.554ValPhe: 2.554 ± 1.261
2.554ValGly: 2.554 ± 0.707
1.021ValHis: 1.021 ± 0.329
2.554ValIle: 2.554 ± 0.707
3.83ValLys: 3.83 ± 1.618
2.554ValLeu: 2.554 ± 0.666
1.021ValMet: 1.021 ± 0.329
3.32ValAsn: 3.32 ± 0.951
1.021ValPro: 1.021 ± 0.844
1.532ValGln: 1.532 ± 0.78
1.788ValArg: 1.788 ± 0.567
5.107ValSer: 5.107 ± 1.415
1.532ValThr: 1.532 ± 0.399
1.532ValVal: 1.532 ± 0.388
0.511ValTrp: 0.511 ± 0.439
2.554ValTyr: 2.554 ± 0.712
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.255TrpAsp: 0.255 ± 0.159
0.766TrpGlu: 0.766 ± 0.817
0.511TrpPhe: 0.511 ± 0.133
1.532TrpGly: 1.532 ± 0.633
0.0TrpHis: 0.0 ± 0.0
0.255TrpIle: 0.255 ± 0.159
0.0TrpLys: 0.0 ± 0.0
1.021TrpLeu: 1.021 ± 0.266
0.255TrpMet: 0.255 ± 0.925
0.766TrpAsn: 0.766 ± 0.477
0.0TrpPro: 0.0 ± 0.0
0.766TrpGln: 0.766 ± 0.477
0.255TrpArg: 0.255 ± 0.22
0.766TrpSer: 0.766 ± 0.477
0.255TrpThr: 0.255 ± 0.159
0.511TrpVal: 0.511 ± 0.318
0.0TrpTrp: 0.0 ± 0.0
0.766TrpTyr: 0.766 ± 0.194
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.277TyrAla: 1.277 ± 0.919
1.532TyrCys: 1.532 ± 0.976
0.766TyrAsp: 0.766 ± 0.817
3.32TyrGlu: 3.32 ± 1.22
2.554TyrPhe: 2.554 ± 0.585
2.298TyrGly: 2.298 ± 0.868
1.021TyrHis: 1.021 ± 0.54
4.341TyrIle: 4.341 ± 1.223
3.575TyrLys: 3.575 ± 1.424
3.32TyrLeu: 3.32 ± 0.264
1.532TyrMet: 1.532 ± 0.417
2.043TyrAsn: 2.043 ± 0.658
1.532TyrPro: 1.532 ± 0.399
1.021TyrGln: 1.021 ± 0.266
1.788TyrArg: 1.788 ± 0.567
2.298TyrSer: 2.298 ± 0.537
2.554TyrThr: 2.554 ± 0.707
0.255TyrVal: 0.255 ± 0.159
0.0TyrTrp: 0.0 ± 0.0
1.532TyrTyr: 1.532 ± 0.399
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3917 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski