Amino acid dipepetide frequency for Duwamo virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.078AlaAla: 11.078 ± 6.584
0.739AlaCys: 0.739 ± 0.479
2.954AlaAsp: 2.954 ± 0.698
4.431AlaGlu: 4.431 ± 0.354
5.908AlaPhe: 5.908 ± 2.655
6.647AlaGly: 6.647 ± 2.708
0.739AlaHis: 0.739 ± 0.479
4.431AlaIle: 4.431 ± 1.591
3.693AlaLys: 3.693 ± 1.144
11.817AlaLeu: 11.817 ± 1.51
0.739AlaMet: 0.739 ± 0.57
3.693AlaAsn: 3.693 ± 1.874
3.693AlaPro: 3.693 ± 0.894
0.739AlaGln: 0.739 ± 1.138
2.216AlaArg: 2.216 ± 2.178
4.431AlaSer: 4.431 ± 0.354
5.17AlaThr: 5.17 ± 2.281
2.954AlaVal: 2.954 ± 0.727
2.216AlaTrp: 2.216 ± 1.709
4.431AlaTyr: 4.431 ± 1.987
0.0AlaXaa: 0.0 ± 0.0
Cys
2.216CysAla: 2.216 ± 0.827
0.0CysCys: 0.0 ± 0.0
0.739CysAsp: 0.739 ± 0.57
0.0CysGlu: 0.0 ± 0.0
0.739CysPhe: 0.739 ± 0.479
0.739CysGly: 0.739 ± 0.479
0.739CysHis: 0.739 ± 0.479
0.739CysIle: 0.739 ± 0.479
0.739CysLys: 0.739 ± 0.57
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.739CysAsn: 0.739 ± 0.479
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.739CysArg: 0.739 ± 0.479
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.739CysVal: 0.739 ± 0.57
0.0CysTrp: 0.0 ± 0.0
1.477CysTyr: 1.477 ± 0.364
0.0CysXaa: 0.0 ± 0.0
Asp
1.477AspAla: 1.477 ± 0.364
0.0AspCys: 0.0 ± 0.0
2.954AspAsp: 2.954 ± 1.916
4.431AspGlu: 4.431 ± 1.091
3.693AspPhe: 3.693 ± 1.519
1.477AspGly: 1.477 ± 0.958
1.477AspHis: 1.477 ± 0.958
2.216AspIle: 2.216 ± 0.795
4.431AspLys: 4.431 ± 2.873
2.954AspLeu: 2.954 ± 0.727
0.739AspMet: 0.739 ± 0.479
3.693AspAsn: 3.693 ± 0.484
4.431AspPro: 4.431 ± 1.654
3.693AspGln: 3.693 ± 0.484
0.739AspArg: 0.739 ± 0.57
2.954AspSer: 2.954 ± 0.896
3.693AspThr: 3.693 ± 0.912
2.216AspVal: 2.216 ± 0.631
1.477AspTrp: 1.477 ± 0.958
2.954AspTyr: 2.954 ± 0.727
0.0AspXaa: 0.0 ± 0.0
Glu
1.477GluAla: 1.477 ± 1.139
0.0GluCys: 0.0 ± 0.0
2.216GluAsp: 2.216 ± 0.827
2.954GluGlu: 2.954 ± 1.06
2.216GluPhe: 2.216 ± 1.709
2.954GluGly: 2.954 ± 0.727
2.954GluHis: 2.954 ± 1.457
0.739GluIle: 0.739 ± 0.57
2.216GluLys: 2.216 ± 1.437
2.954GluLeu: 2.954 ± 1.916
1.477GluMet: 1.477 ± 0.364
4.431GluAsn: 4.431 ± 2.497
2.954GluPro: 2.954 ± 1.457
0.739GluGln: 0.739 ± 0.479
2.954GluArg: 2.954 ± 1.06
2.216GluSer: 2.216 ± 0.631
3.693GluThr: 3.693 ± 0.912
2.954GluVal: 2.954 ± 1.06
0.739GluTrp: 0.739 ± 0.479
4.431GluTyr: 4.431 ± 1.262
0.0GluXaa: 0.0 ± 0.0
Phe
4.431PheAla: 4.431 ± 1.751
0.739PheCys: 0.739 ± 0.57
2.216PheAsp: 2.216 ± 0.827
2.216PheGlu: 2.216 ± 0.827
2.216PhePhe: 2.216 ± 0.631
2.216PheGly: 2.216 ± 0.795
2.954PheHis: 2.954 ± 0.698
1.477PheIle: 1.477 ± 0.364
3.693PheLys: 3.693 ± 1.144
1.477PheLeu: 1.477 ± 0.364
0.0PheMet: 0.0 ± 0.0
2.954PheAsn: 2.954 ± 1.916
5.17PhePro: 5.17 ± 1.237
0.739PheGln: 0.739 ± 0.57
1.477PheArg: 1.477 ± 0.958
3.693PheSer: 3.693 ± 1.933
3.693PheThr: 3.693 ± 1.273
3.693PheVal: 3.693 ± 0.894
0.0PheTrp: 0.0 ± 0.0
2.954PheTyr: 2.954 ± 0.896
0.0PheXaa: 0.0 ± 0.0
Gly
4.431GlyAla: 4.431 ± 2.704
0.0GlyCys: 0.0 ± 0.0
4.431GlyAsp: 4.431 ± 1.253
2.954GlyGlu: 2.954 ± 0.896
4.431GlyPhe: 4.431 ± 0.794
4.431GlyGly: 4.431 ± 1.987
0.739GlyHis: 0.739 ± 0.479
2.954GlyIle: 2.954 ± 0.727
3.693GlyLys: 3.693 ± 1.519
5.908GlyLeu: 5.908 ± 4.219
0.739GlyMet: 0.739 ± 1.138
4.431GlyAsn: 4.431 ± 1.591
4.431GlyPro: 4.431 ± 2.905
2.954GlyGln: 2.954 ± 0.698
1.477GlyArg: 1.477 ± 1.112
2.216GlySer: 2.216 ± 0.827
3.693GlyThr: 3.693 ± 0.484
2.954GlyVal: 2.954 ± 1.916
1.477GlyTrp: 1.477 ± 0.364
2.216GlyTyr: 2.216 ± 0.631
0.0GlyXaa: 0.0 ± 0.0
His
2.954HisAla: 2.954 ± 1.93
0.739HisCys: 0.739 ± 0.479
0.739HisAsp: 0.739 ± 0.479
1.477HisGlu: 1.477 ± 0.364
1.477HisPhe: 1.477 ± 0.364
1.477HisGly: 1.477 ± 0.958
1.477HisHis: 1.477 ± 0.958
2.216HisIle: 2.216 ± 0.631
0.739HisLys: 0.739 ± 0.479
2.216HisLeu: 2.216 ± 1.437
0.739HisMet: 0.739 ± 0.479
0.739HisAsn: 0.739 ± 1.138
0.739HisPro: 0.739 ± 0.57
3.693HisGln: 3.693 ± 0.894
1.477HisArg: 1.477 ± 0.364
0.739HisSer: 0.739 ± 0.479
2.954HisThr: 2.954 ± 1.06
1.477HisVal: 1.477 ± 0.958
1.477HisTrp: 1.477 ± 0.958
1.477HisTyr: 1.477 ± 0.364
0.0HisXaa: 0.0 ± 0.0
Ile
10.34IleAla: 10.34 ± 1.15
0.0IleCys: 0.0 ± 0.0
1.477IleAsp: 1.477 ± 0.364
0.739IleGlu: 0.739 ± 0.479
0.739IlePhe: 0.739 ± 0.479
2.954IleGly: 2.954 ± 0.698
4.431IleHis: 4.431 ± 0.794
1.477IleIle: 1.477 ± 0.958
1.477IleLys: 1.477 ± 0.364
2.216IleLeu: 2.216 ± 0.631
0.739IleMet: 0.739 ± 0.57
1.477IleAsn: 1.477 ± 0.958
7.386IlePro: 7.386 ± 1.98
2.954IleGln: 2.954 ± 1.373
1.477IleArg: 1.477 ± 0.364
6.647IleSer: 6.647 ± 3.025
2.954IleThr: 2.954 ± 1.06
5.17IleVal: 5.17 ± 0.689
0.0IleTrp: 0.0 ± 0.0
1.477IleTyr: 1.477 ± 1.112
0.0IleXaa: 0.0 ± 0.0
Lys
0.0LysAla: 0.0 ± 0.0
0.0LysCys: 0.0 ± 0.0
4.431LysAsp: 4.431 ± 2.873
3.693LysGlu: 3.693 ± 1.519
3.693LysPhe: 3.693 ± 1.933
3.693LysGly: 3.693 ± 1.519
0.739LysHis: 0.739 ± 0.479
3.693LysIle: 3.693 ± 1.519
2.954LysLys: 2.954 ± 1.916
5.17LysLeu: 5.17 ± 1.678
2.216LysMet: 2.216 ± 1.178
2.954LysAsn: 2.954 ± 0.727
5.17LysPro: 5.17 ± 1.237
2.954LysGln: 2.954 ± 1.373
0.0LysArg: 0.0 ± 0.0
2.216LysSer: 2.216 ± 0.631
4.431LysThr: 4.431 ± 1.262
2.954LysVal: 2.954 ± 0.727
2.954LysTrp: 2.954 ± 1.06
4.431LysTyr: 4.431 ± 1.253
0.0LysXaa: 0.0 ± 0.0
Leu
3.693LeuAla: 3.693 ± 1.519
1.477LeuCys: 1.477 ± 0.958
6.647LeuAsp: 6.647 ± 2.575
2.954LeuGlu: 2.954 ± 0.727
2.954LeuPhe: 2.954 ± 0.727
2.954LeuGly: 2.954 ± 0.698
2.216LeuHis: 2.216 ± 0.631
5.17LeuIle: 5.17 ± 1.678
6.647LeuLys: 6.647 ± 2.575
6.647LeuLeu: 6.647 ± 1.894
0.0LeuMet: 0.0 ± 0.0
3.693LeuAsn: 3.693 ± 1.779
3.693LeuPro: 3.693 ± 1.874
3.693LeuGln: 3.693 ± 2.185
2.954LeuArg: 2.954 ± 1.06
5.17LeuSer: 5.17 ± 2.194
8.124LeuThr: 8.124 ± 1.601
2.216LeuVal: 2.216 ± 1.437
1.477LeuTrp: 1.477 ± 0.364
4.431LeuTyr: 4.431 ± 1.654
0.0LeuXaa: 0.0 ± 0.0
Met
2.216MetAla: 2.216 ± 2.178
1.477MetCys: 1.477 ± 0.364
0.739MetAsp: 0.739 ± 0.479
1.477MetGlu: 1.477 ± 0.958
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.739MetHis: 0.739 ± 0.479
0.739MetIle: 0.739 ± 0.57
0.739MetLys: 0.739 ± 0.479
2.216MetLeu: 2.216 ± 1.178
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.477MetPro: 1.477 ± 1.112
0.739MetGln: 0.739 ± 0.479
1.477MetArg: 1.477 ± 0.364
1.477MetSer: 1.477 ± 1.139
0.0MetThr: 0.0 ± 0.0
3.693MetVal: 3.693 ± 0.484
0.0MetTrp: 0.0 ± 0.0
5.17MetTyr: 5.17 ± 1.418
0.0MetXaa: 0.0 ± 0.0
Asn
5.17AsnAla: 5.17 ± 2.977
0.739AsnCys: 0.739 ± 0.57
1.477AsnAsp: 1.477 ± 1.055
2.216AsnGlu: 2.216 ± 0.631
1.477AsnPhe: 1.477 ± 2.276
2.954AsnGly: 2.954 ± 1.93
0.0AsnHis: 0.0 ± 0.0
2.954AsnIle: 2.954 ± 1.93
1.477AsnLys: 1.477 ± 1.139
5.17AsnLeu: 5.17 ± 1.674
2.216AsnMet: 2.216 ± 1.228
2.954AsnAsn: 2.954 ± 3.264
1.477AsnPro: 1.477 ± 1.139
0.739AsnGln: 0.739 ± 0.479
2.954AsnArg: 2.954 ± 0.698
7.386AsnSer: 7.386 ± 3.86
4.431AsnThr: 4.431 ± 1.654
3.693AsnVal: 3.693 ± 0.484
0.739AsnTrp: 0.739 ± 1.138
2.954AsnTyr: 2.954 ± 1.06
0.0AsnXaa: 0.0 ± 0.0
Pro
3.693ProAla: 3.693 ± 2.227
0.0ProCys: 0.0 ± 0.0
1.477ProAsp: 1.477 ± 0.364
2.216ProGlu: 2.216 ± 1.437
1.477ProPhe: 1.477 ± 0.364
2.954ProGly: 2.954 ± 1.457
0.0ProHis: 0.0 ± 0.0
6.647ProIle: 6.647 ± 2.481
2.954ProLys: 2.954 ± 0.698
6.647ProLeu: 6.647 ± 1.894
2.216ProMet: 2.216 ± 3.415
2.216ProAsn: 2.216 ± 2.178
2.954ProPro: 2.954 ± 0.727
2.954ProGln: 2.954 ± 1.373
0.739ProArg: 0.739 ± 1.138
8.124ProSer: 8.124 ± 2.786
3.693ProThr: 3.693 ± 1.273
2.216ProVal: 2.216 ± 2.142
0.739ProTrp: 0.739 ± 0.479
1.477ProTyr: 1.477 ± 0.364
0.0ProXaa: 0.0 ± 0.0
Gln
2.954GlnAla: 2.954 ± 1.93
0.739GlnCys: 0.739 ± 0.57
0.739GlnAsp: 0.739 ± 0.57
3.693GlnGlu: 3.693 ± 0.912
0.739GlnPhe: 0.739 ± 1.138
1.477GlnGly: 1.477 ± 2.276
1.477GlnHis: 1.477 ± 1.055
1.477GlnIle: 1.477 ± 2.276
2.216GlnLys: 2.216 ± 1.437
0.739GlnLeu: 0.739 ± 0.57
2.216GlnMet: 2.216 ± 0.622
2.216GlnAsn: 2.216 ± 0.795
3.693GlnPro: 3.693 ± 1.144
2.216GlnGln: 2.216 ± 2.178
1.477GlnArg: 1.477 ± 0.364
2.954GlnSer: 2.954 ± 1.752
5.17GlnThr: 5.17 ± 1.678
3.693GlnVal: 3.693 ± 1.144
0.739GlnTrp: 0.739 ± 0.57
0.739GlnTyr: 0.739 ± 0.57
0.0GlnXaa: 0.0 ± 0.0
Arg
3.693ArgAla: 3.693 ± 1.874
0.0ArgCys: 0.0 ± 0.0
1.477ArgAsp: 1.477 ± 0.364
1.477ArgGlu: 1.477 ± 0.364
0.739ArgPhe: 0.739 ± 0.479
1.477ArgGly: 1.477 ± 0.958
0.739ArgHis: 0.739 ± 0.479
2.954ArgIle: 2.954 ± 0.727
1.477ArgLys: 1.477 ± 0.958
2.954ArgLeu: 2.954 ± 1.373
1.477ArgMet: 1.477 ± 0.958
2.954ArgAsn: 2.954 ± 0.896
1.477ArgPro: 1.477 ± 1.112
0.739ArgGln: 0.739 ± 1.138
2.216ArgArg: 2.216 ± 1.437
1.477ArgSer: 1.477 ± 0.958
2.216ArgThr: 2.216 ± 0.827
2.216ArgVal: 2.216 ± 1.352
0.0ArgTrp: 0.0 ± 0.0
1.477ArgTyr: 1.477 ± 1.055
0.0ArgXaa: 0.0 ± 0.0
Ser
5.908SerAla: 5.908 ± 0.167
0.739SerCys: 0.739 ± 0.479
2.216SerAsp: 2.216 ± 0.827
4.431SerGlu: 4.431 ± 2.497
2.216SerPhe: 2.216 ± 1.709
5.908SerGly: 5.908 ± 2.655
2.216SerHis: 2.216 ± 0.631
6.647SerIle: 6.647 ± 3.304
4.431SerLys: 4.431 ± 1.654
5.908SerLeu: 5.908 ± 2.119
0.739SerMet: 0.739 ± 0.479
4.431SerAsn: 4.431 ± 2.968
0.0SerPro: 0.0 ± 0.0
4.431SerGln: 4.431 ± 0.794
3.693SerArg: 3.693 ± 1.273
5.908SerSer: 5.908 ± 1.836
4.431SerThr: 4.431 ± 2.497
5.17SerVal: 5.17 ± 0.316
1.477SerTrp: 1.477 ± 0.364
0.739SerTyr: 0.739 ± 0.57
0.0SerXaa: 0.0 ± 0.0
Thr
9.601ThrAla: 9.601 ± 2.249
0.0ThrCys: 0.0 ± 0.0
10.34ThrAsp: 10.34 ± 1.609
1.477ThrGlu: 1.477 ± 0.364
4.431ThrPhe: 4.431 ± 1.262
5.17ThrGly: 5.17 ± 2.194
1.477ThrHis: 1.477 ± 0.364
6.647ThrIle: 6.647 ± 1.578
7.386ThrLys: 7.386 ± 3.04
3.693ThrLeu: 3.693 ± 1.144
1.477ThrMet: 1.477 ± 0.364
0.739ThrAsn: 0.739 ± 0.479
2.216ThrPro: 2.216 ± 0.795
2.216ThrGln: 2.216 ± 0.827
0.0ThrArg: 0.0 ± 0.0
3.693ThrSer: 3.693 ± 1.144
4.431ThrThr: 4.431 ± 2.968
3.693ThrVal: 3.693 ± 1.144
1.477ThrTrp: 1.477 ± 1.139
1.477ThrTyr: 1.477 ± 1.139
0.0ThrXaa: 0.0 ± 0.0
Val
5.17ValAla: 5.17 ± 1.678
0.0ValCys: 0.0 ± 0.0
2.216ValAsp: 2.216 ± 0.631
1.477ValGlu: 1.477 ± 0.958
2.954ValPhe: 2.954 ± 1.06
5.908ValGly: 5.908 ± 1.395
0.739ValHis: 0.739 ± 0.479
1.477ValIle: 1.477 ± 1.112
4.431ValLys: 4.431 ± 1.654
6.647ValLeu: 6.647 ± 1.578
2.216ValMet: 2.216 ± 1.315
4.431ValAsn: 4.431 ± 1.253
2.216ValPro: 2.216 ± 0.795
1.477ValGln: 1.477 ± 1.112
4.431ValArg: 4.431 ± 0.794
5.17ValSer: 5.17 ± 1.295
4.431ValThr: 4.431 ± 1.654
2.954ValVal: 2.954 ± 0.727
0.0ValTrp: 0.0 ± 0.0
1.477ValTyr: 1.477 ± 0.958
0.0ValXaa: 0.0 ± 0.0
Trp
0.739TrpAla: 0.739 ± 0.479
0.739TrpCys: 0.739 ± 0.479
0.739TrpAsp: 0.739 ± 0.479
0.0TrpGlu: 0.0 ± 0.0
1.477TrpPhe: 1.477 ± 0.364
2.216TrpGly: 2.216 ± 1.352
2.216TrpHis: 2.216 ± 0.631
1.477TrpIle: 1.477 ± 0.364
1.477TrpLys: 1.477 ± 0.958
0.0TrpLeu: 0.0 ± 0.0
0.739TrpMet: 0.739 ± 0.57
0.0TrpAsn: 0.0 ± 0.0
0.739TrpPro: 0.739 ± 0.479
0.739TrpGln: 0.739 ± 0.57
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
2.216TrpThr: 2.216 ± 0.827
1.477TrpVal: 1.477 ± 0.364
0.0TrpTrp: 0.0 ± 0.0
0.739TrpTyr: 0.739 ± 0.479
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.477TyrAla: 1.477 ± 1.139
2.216TyrCys: 2.216 ± 0.827
1.477TyrAsp: 1.477 ± 0.364
2.954TyrGlu: 2.954 ± 1.916
4.431TyrPhe: 4.431 ± 1.262
2.954TyrGly: 2.954 ± 0.698
2.954TyrHis: 2.954 ± 1.06
0.0TyrIle: 0.0 ± 0.0
1.477TyrLys: 1.477 ± 0.364
1.477TyrLeu: 1.477 ± 0.364
2.954TyrMet: 2.954 ± 0.896
5.17TyrAsn: 5.17 ± 1.595
1.477TyrPro: 1.477 ± 0.364
2.954TyrGln: 2.954 ± 1.93
0.739TyrArg: 0.739 ± 0.479
5.17TyrSer: 5.17 ± 1.295
2.216TyrThr: 2.216 ± 0.795
3.693TyrVal: 3.693 ± 1.144
0.739TyrTrp: 0.739 ± 0.479
2.216TyrTyr: 2.216 ± 0.631
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1355 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski