Amino acid dipepetide frequency for CRESS virus sp. ct0Vt4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.721AlaAla: 3.721 ± 2.419
0.0AlaCys: 0.0 ± 0.0
2.791AlaAsp: 2.791 ± 2.006
2.791AlaGlu: 2.791 ± 2.338
2.791AlaPhe: 2.791 ± 1.153
3.721AlaGly: 3.721 ± 1.635
0.93AlaHis: 0.93 ± 0.919
4.651AlaIle: 4.651 ± 0.853
1.86AlaLys: 1.86 ± 1.337
7.442AlaLeu: 7.442 ± 1.885
2.791AlaMet: 2.791 ± 0.523
4.651AlaAsn: 4.651 ± 0.994
0.0AlaPro: 0.0 ± 0.0
1.86AlaGln: 1.86 ± 1.337
5.581AlaArg: 5.581 ± 1.23
6.512AlaSer: 6.512 ± 1.477
2.791AlaThr: 2.791 ± 1.153
1.86AlaVal: 1.86 ± 0.748
1.86AlaTrp: 1.86 ± 0.748
2.791AlaTyr: 2.791 ± 0.523
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.93CysPhe: 0.93 ± 0.919
0.0CysGly: 0.0 ± 0.0
1.86CysHis: 1.86 ± 1.938
1.86CysIle: 1.86 ± 1.641
1.86CysLys: 1.86 ± 1.225
1.86CysLeu: 1.86 ± 1.225
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.93CysPro: 0.93 ± 0.969
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.93CysThr: 0.93 ± 0.969
0.93CysVal: 0.93 ± 0.919
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.512AspAla: 6.512 ± 1.517
0.0AspCys: 0.0 ± 0.0
0.0AspAsp: 0.0 ± 0.0
1.86AspGlu: 1.86 ± 1.838
0.93AspPhe: 0.93 ± 0.919
4.651AspGly: 4.651 ± 0.853
0.93AspHis: 0.93 ± 0.919
1.86AspIle: 1.86 ± 1.641
0.93AspLys: 0.93 ± 0.669
6.512AspLeu: 6.512 ± 1.804
0.93AspMet: 0.93 ± 0.669
1.86AspAsn: 1.86 ± 0.829
1.86AspPro: 1.86 ± 1.122
1.86AspGln: 1.86 ± 1.938
0.93AspArg: 0.93 ± 0.919
2.791AspSer: 2.791 ± 1.153
3.721AspThr: 3.721 ± 2.779
3.721AspVal: 3.721 ± 1.635
0.93AspTrp: 0.93 ± 0.919
1.86AspTyr: 1.86 ± 0.748
0.0AspXaa: 0.0 ± 0.0
Glu
5.581GluAla: 5.581 ± 1.047
0.0GluCys: 0.0 ± 0.0
0.93GluAsp: 0.93 ± 0.969
3.721GluGlu: 3.721 ± 2.608
1.86GluPhe: 1.86 ± 0.748
0.0GluGly: 0.0 ± 0.0
1.86GluHis: 1.86 ± 1.938
3.721GluIle: 3.721 ± 0.429
3.721GluLys: 3.721 ± 2.674
4.651GluLeu: 4.651 ± 0.853
1.86GluMet: 1.86 ± 0.716
0.93GluAsn: 0.93 ± 0.969
0.0GluPro: 0.0 ± 0.0
0.93GluGln: 0.93 ± 0.919
3.721GluArg: 3.721 ± 1.974
3.721GluSer: 3.721 ± 1.635
3.721GluThr: 3.721 ± 1.245
2.791GluVal: 2.791 ± 1.153
0.0GluTrp: 0.0 ± 0.0
2.791GluTyr: 2.791 ± 0.523
0.0GluXaa: 0.0 ± 0.0
Phe
1.86PheAla: 1.86 ± 1.337
0.0PheCys: 0.0 ± 0.0
0.93PheAsp: 0.93 ± 0.919
6.512PheGlu: 6.512 ± 1.329
0.0PhePhe: 0.0 ± 0.0
5.581PheGly: 5.581 ± 1.639
0.0PheHis: 0.0 ± 0.0
1.86PheIle: 1.86 ± 2.535
2.791PheLys: 2.791 ± 1.081
0.93PheLeu: 0.93 ± 0.669
0.0PheMet: 0.0 ± 0.0
1.86PheAsn: 1.86 ± 1.225
1.86PhePro: 1.86 ± 1.938
0.93PheGln: 0.93 ± 0.969
2.791PheArg: 2.791 ± 1.081
4.651PheSer: 4.651 ± 2.31
2.791PheThr: 2.791 ± 2.006
0.93PheVal: 0.93 ± 0.669
0.93PheTrp: 0.93 ± 0.919
0.93PheTyr: 0.93 ± 0.669
0.0PheXaa: 0.0 ± 0.0
Gly
0.93GlyAla: 0.93 ± 0.669
0.93GlyCys: 0.93 ± 0.919
1.86GlyAsp: 1.86 ± 0.748
1.86GlyGlu: 1.86 ± 0.748
5.581GlyPhe: 5.581 ± 1.087
1.86GlyGly: 1.86 ± 0.748
0.93GlyHis: 0.93 ± 0.669
5.581GlyIle: 5.581 ± 2.305
3.721GlyLys: 3.721 ± 1.496
1.86GlyLeu: 1.86 ± 2.535
0.93GlyMet: 0.93 ± 0.969
5.581GlyAsn: 5.581 ± 2.358
0.93GlyPro: 0.93 ± 0.919
7.442GlyGln: 7.442 ± 1.93
4.651GlyArg: 4.651 ± 1.691
4.651GlySer: 4.651 ± 0.994
7.442GlyThr: 7.442 ± 2.96
5.581GlyVal: 5.581 ± 2.163
0.93GlyTrp: 0.93 ± 0.669
1.86GlyTyr: 1.86 ± 0.748
0.0GlyXaa: 0.0 ± 0.0
His
0.93HisAla: 0.93 ± 0.919
0.93HisCys: 0.93 ± 0.919
1.86HisAsp: 1.86 ± 0.829
0.93HisGlu: 0.93 ± 0.969
2.791HisPhe: 2.791 ± 2.006
1.86HisGly: 1.86 ± 1.938
0.93HisHis: 0.93 ± 0.969
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
0.93HisLeu: 0.93 ± 0.919
0.0HisMet: 0.0 ± 0.801
0.93HisAsn: 0.93 ± 0.969
2.791HisPro: 2.791 ± 0.523
0.93HisGln: 0.93 ± 0.919
1.86HisArg: 1.86 ± 1.337
0.93HisSer: 0.93 ± 0.669
1.86HisThr: 1.86 ± 1.938
0.93HisVal: 0.93 ± 0.919
0.93HisTrp: 0.93 ± 0.919
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
0.93IleCys: 0.93 ± 0.969
2.791IleAsp: 2.791 ± 0.523
0.93IleGlu: 0.93 ± 0.669
0.93IlePhe: 0.93 ± 1.268
3.721IleGly: 3.721 ± 1.692
0.93IleHis: 0.93 ± 0.969
0.0IleIle: 0.0 ± 0.0
1.86IleLys: 1.86 ± 0.748
1.86IleLeu: 1.86 ± 0.748
1.86IleMet: 1.86 ± 0.748
3.721IleAsn: 3.721 ± 1.992
5.581IlePro: 5.581 ± 2.358
2.791IleGln: 2.791 ± 1.153
5.581IleArg: 5.581 ± 0.895
5.581IleSer: 5.581 ± 1.248
3.721IleThr: 3.721 ± 1.635
0.93IleVal: 0.93 ± 1.268
0.93IleTrp: 0.93 ± 0.919
1.86IleTyr: 1.86 ± 1.225
0.0IleXaa: 0.0 ± 0.0
Lys
3.721LysAla: 3.721 ± 2.674
1.86LysCys: 1.86 ± 1.337
4.651LysAsp: 4.651 ± 2.762
0.0LysGlu: 0.0 ± 0.0
1.86LysPhe: 1.86 ± 0.748
0.93LysGly: 0.93 ± 0.919
0.93LysHis: 0.93 ± 0.669
3.721LysIle: 3.721 ± 1.382
6.512LysLys: 6.512 ± 3.542
2.791LysLeu: 2.791 ± 1.537
1.86LysMet: 1.86 ± 1.337
3.721LysAsn: 3.721 ± 2.674
2.791LysPro: 2.791 ± 1.081
0.93LysGln: 0.93 ± 0.919
7.442LysArg: 7.442 ± 3.311
2.791LysSer: 2.791 ± 1.537
6.512LysThr: 6.512 ± 1.038
2.791LysVal: 2.791 ± 2.006
1.86LysTrp: 1.86 ± 1.337
2.791LysTyr: 2.791 ± 2.006
0.0LysXaa: 0.0 ± 0.0
Leu
5.581LeuAla: 5.581 ± 1.796
0.0LeuCys: 0.0 ± 0.0
2.791LeuAsp: 2.791 ± 1.884
0.93LeuGlu: 0.93 ± 0.919
2.791LeuPhe: 2.791 ± 2.006
4.651LeuGly: 4.651 ± 4.392
1.86LeuHis: 1.86 ± 1.122
1.86LeuIle: 1.86 ± 1.337
5.581LeuLys: 5.581 ± 2.163
8.372LeuLeu: 8.372 ± 3.236
2.791LeuMet: 2.791 ± 1.884
6.512LeuAsn: 6.512 ± 2.297
5.581LeuPro: 5.581 ± 1.23
0.93LeuGln: 0.93 ± 0.969
6.512LeuArg: 6.512 ± 3.198
5.581LeuSer: 5.581 ± 2.657
7.442LeuThr: 7.442 ± 1.229
4.651LeuVal: 4.651 ± 2.422
0.93LeuTrp: 0.93 ± 0.919
2.791LeuTyr: 2.791 ± 1.675
0.0LeuXaa: 0.0 ± 0.0
Met
1.86MetAla: 1.86 ± 1.337
0.0MetCys: 0.0 ± 0.0
1.86MetAsp: 1.86 ± 1.838
2.791MetGlu: 2.791 ± 1.081
0.0MetPhe: 0.0 ± 0.0
1.86MetGly: 1.86 ± 0.829
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.86MetLeu: 1.86 ± 0.829
0.0MetMet: 0.0 ± 0.0
0.93MetAsn: 0.93 ± 0.669
3.721MetPro: 3.721 ± 1.366
0.93MetGln: 0.93 ± 0.669
1.86MetArg: 1.86 ± 0.748
1.86MetSer: 1.86 ± 1.122
1.86MetThr: 1.86 ± 1.225
1.86MetVal: 1.86 ± 1.337
0.93MetTrp: 0.93 ± 0.969
0.93MetTyr: 0.93 ± 0.969
0.0MetXaa: 0.0 ± 0.0
Asn
2.791AsnAla: 2.791 ± 2.006
0.0AsnCys: 0.0 ± 0.0
3.721AsnAsp: 3.721 ± 1.366
2.791AsnGlu: 2.791 ± 0.523
2.791AsnPhe: 2.791 ± 1.512
2.791AsnGly: 2.791 ± 2.006
0.0AsnHis: 0.0 ± 0.0
0.93AsnIle: 0.93 ± 0.919
2.791AsnLys: 2.791 ± 1.512
3.721AsnLeu: 3.721 ± 2.522
1.86AsnMet: 1.86 ± 0.829
1.86AsnAsn: 1.86 ± 1.337
0.93AsnPro: 0.93 ± 0.669
1.86AsnGln: 1.86 ± 0.829
1.86AsnArg: 1.86 ± 1.938
4.651AsnSer: 4.651 ± 1.038
3.721AsnThr: 3.721 ± 0.429
5.581AsnVal: 5.581 ± 2.163
3.721AsnTrp: 3.721 ± 2.608
1.86AsnTyr: 1.86 ± 1.337
0.0AsnXaa: 0.0 ± 0.0
Pro
2.791ProAla: 2.791 ± 2.908
1.86ProCys: 1.86 ± 1.938
0.93ProAsp: 0.93 ± 0.919
2.791ProGlu: 2.791 ± 1.675
2.791ProPhe: 2.791 ± 1.675
3.721ProGly: 3.721 ± 2.674
0.0ProHis: 0.0 ± 0.0
1.86ProIle: 1.86 ± 1.938
5.581ProLys: 5.581 ± 1.639
3.721ProLeu: 3.721 ± 1.658
1.86ProMet: 1.86 ± 0.817
1.86ProAsn: 1.86 ± 1.122
2.791ProPro: 2.791 ± 2.006
1.86ProGln: 1.86 ± 1.122
0.93ProArg: 0.93 ± 0.969
5.581ProSer: 5.581 ± 3.365
5.581ProThr: 5.581 ± 1.23
4.651ProVal: 4.651 ± 2.236
0.93ProTrp: 0.93 ± 0.969
3.721ProTyr: 3.721 ± 0.429
0.0ProXaa: 0.0 ± 0.0
Gln
2.791GlnAla: 2.791 ± 1.081
0.93GlnCys: 0.93 ± 0.969
1.86GlnAsp: 1.86 ± 1.122
0.0GlnGlu: 0.0 ± 0.0
0.93GlnPhe: 0.93 ± 0.669
2.791GlnGly: 2.791 ± 1.537
1.86GlnHis: 1.86 ± 0.748
3.721GlnIle: 3.721 ± 1.658
0.93GlnLys: 0.93 ± 0.669
4.651GlnLeu: 4.651 ± 1.618
0.0GlnMet: 0.0 ± 0.0
1.86GlnAsn: 1.86 ± 0.829
2.791GlnPro: 2.791 ± 1.153
1.86GlnGln: 1.86 ± 1.122
4.651GlnArg: 4.651 ± 3.019
2.791GlnSer: 2.791 ± 2.908
1.86GlnThr: 1.86 ± 0.748
0.93GlnVal: 0.93 ± 0.669
0.0GlnTrp: 0.0 ± 0.0
0.93GlnTyr: 0.93 ± 0.919
0.0GlnXaa: 0.0 ± 0.0
Arg
4.651ArgAla: 4.651 ± 0.879
0.93ArgCys: 0.93 ± 1.268
0.0ArgAsp: 0.0 ± 0.0
5.581ArgGlu: 5.581 ± 2.305
1.86ArgPhe: 1.86 ± 0.748
3.721ArgGly: 3.721 ± 2.419
0.93ArgHis: 0.93 ± 0.669
1.86ArgIle: 1.86 ± 0.748
6.512ArgLys: 6.512 ± 2.702
7.442ArgLeu: 7.442 ± 3.994
1.86ArgMet: 1.86 ± 1.337
2.791ArgAsn: 2.791 ± 1.081
2.791ArgPro: 2.791 ± 0.523
3.721ArgGln: 3.721 ± 2.608
5.581ArgArg: 5.581 ± 1.962
4.651ArgSer: 4.651 ± 0.853
5.581ArgThr: 5.581 ± 3.623
3.721ArgVal: 3.721 ± 1.382
0.93ArgTrp: 0.93 ± 0.919
2.791ArgTyr: 2.791 ± 2.757
0.0ArgXaa: 0.0 ± 0.0
Ser
8.372SerAla: 8.372 ± 1.57
0.93SerCys: 0.93 ± 1.268
2.791SerAsp: 2.791 ± 1.153
2.791SerGlu: 2.791 ± 1.884
1.86SerPhe: 1.86 ± 0.829
3.721SerGly: 3.721 ± 1.366
2.791SerHis: 2.791 ± 0.523
4.651SerIle: 4.651 ± 1.038
4.651SerLys: 4.651 ± 1.735
3.721SerLeu: 3.721 ± 1.294
2.791SerMet: 2.791 ± 1.153
1.86SerAsn: 1.86 ± 1.838
6.512SerPro: 6.512 ± 5.483
1.86SerGln: 1.86 ± 1.337
5.581SerArg: 5.581 ± 2.249
7.442SerSer: 7.442 ± 3.364
6.512SerThr: 6.512 ± 1.032
5.581SerVal: 5.581 ± 2.249
3.721SerTrp: 3.721 ± 1.19
3.721SerTyr: 3.721 ± 1.496
0.0SerXaa: 0.0 ± 0.0
Thr
4.651ThrAla: 4.651 ± 2.301
0.93ThrCys: 0.93 ± 1.268
1.86ThrAsp: 1.86 ± 1.838
2.791ThrGlu: 2.791 ± 0.523
2.791ThrPhe: 2.791 ± 1.675
10.233ThrGly: 10.233 ± 1.554
0.93ThrHis: 0.93 ± 0.969
2.791ThrIle: 2.791 ± 1.081
4.651ThrLys: 4.651 ± 2.236
8.372ThrLeu: 8.372 ± 3.495
0.0ThrMet: 0.0 ± 0.0
5.581ThrAsn: 5.581 ± 2.129
9.302ThrPro: 9.302 ± 2.702
2.791ThrGln: 2.791 ± 1.884
2.791ThrArg: 2.791 ± 2.338
8.372ThrSer: 8.372 ± 2.605
7.442ThrThr: 7.442 ± 1.728
0.93ThrVal: 0.93 ± 1.268
1.86ThrTrp: 1.86 ± 1.337
1.86ThrTyr: 1.86 ± 1.122
0.0ThrXaa: 0.0 ± 0.0
Val
1.86ValAla: 1.86 ± 1.838
0.93ValCys: 0.93 ± 0.919
6.512ValAsp: 6.512 ± 1.477
3.721ValGlu: 3.721 ± 1.496
4.651ValPhe: 4.651 ± 1.795
4.651ValGly: 4.651 ± 1.735
3.721ValHis: 3.721 ± 1.496
1.86ValIle: 1.86 ± 1.337
1.86ValLys: 1.86 ± 1.337
3.721ValLeu: 3.721 ± 3.64
0.93ValMet: 0.93 ± 0.919
1.86ValAsn: 1.86 ± 1.225
1.86ValPro: 1.86 ± 1.337
1.86ValGln: 1.86 ± 1.337
1.86ValArg: 1.86 ± 0.748
4.651ValSer: 4.651 ± 2.125
2.791ValThr: 2.791 ± 1.081
9.302ValVal: 9.302 ± 2.385
1.86ValTrp: 1.86 ± 1.838
1.86ValTyr: 1.86 ± 0.748
0.0ValXaa: 0.0 ± 0.0
Trp
0.93TrpAla: 0.93 ± 0.669
0.0TrpCys: 0.0 ± 0.0
3.721TrpAsp: 3.721 ± 2.243
0.93TrpGlu: 0.93 ± 0.669
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.86TrpIle: 1.86 ± 1.838
4.651TrpLys: 4.651 ± 1.795
1.86TrpLeu: 1.86 ± 0.748
0.0TrpMet: 0.0 ± 0.0
1.86TrpAsn: 1.86 ± 1.938
0.0TrpPro: 0.0 ± 0.0
1.86TrpGln: 1.86 ± 0.748
0.93TrpArg: 0.93 ± 0.669
2.791TrpSer: 2.791 ± 0.523
1.86TrpThr: 1.86 ± 1.122
0.0TrpVal: 0.0 ± 0.0
0.93TrpTrp: 0.93 ± 0.669
0.93TrpTyr: 0.93 ± 0.919
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.86TyrAla: 1.86 ± 0.748
0.0TyrCys: 0.0 ± 0.0
2.791TyrAsp: 2.791 ± 2.006
2.791TyrGlu: 2.791 ± 1.153
0.0TyrPhe: 0.0 ± 0.0
4.651TyrGly: 4.651 ± 1.038
1.86TyrHis: 1.86 ± 0.829
1.86TyrIle: 1.86 ± 1.337
0.0TyrLys: 0.0 ± 0.0
1.86TyrLeu: 1.86 ± 1.122
1.86TyrMet: 1.86 ± 1.003
0.0TyrAsn: 0.0 ± 0.0
2.791TyrPro: 2.791 ± 2.757
0.93TyrGln: 0.93 ± 0.919
2.791TyrArg: 2.791 ± 2.006
1.86TyrSer: 1.86 ± 0.829
2.791TyrThr: 2.791 ± 0.523
4.651TyrVal: 4.651 ± 4.595
0.93TyrTrp: 0.93 ± 0.919
1.86TyrTyr: 1.86 ± 0.748
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1076 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski