Amino acid dipepetide frequency for Wuhan louse fly virus 6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.202AlaAla: 9.202 ± 6.819
1.534AlaCys: 1.534 ± 0.388
4.601AlaAsp: 4.601 ± 2.287
5.752AlaGlu: 5.752 ± 2.859
4.601AlaPhe: 4.601 ± 0.042
5.368AlaGly: 5.368 ± 4.165
2.301AlaHis: 2.301 ± 1.101
4.218AlaIle: 4.218 ± 0.226
2.301AlaLys: 2.301 ± 0.021
9.969AlaLeu: 9.969 ± 1.962
2.684AlaMet: 2.684 ± 0.162
1.15AlaAsn: 1.15 ± 0.572
4.218AlaPro: 4.218 ± 0.897
3.451AlaGln: 3.451 ± 1.715
5.368AlaArg: 5.368 ± 3.042
5.752AlaSer: 5.752 ± 0.614
6.135AlaThr: 6.135 ± 2.675
6.135AlaVal: 6.135 ± 0.43
3.067AlaTrp: 3.067 ± 0.346
4.601AlaTyr: 4.601 ± 0.042
0.0AlaXaa: 0.0 ± 0.0
Cys
1.917CysAla: 1.917 ± 0.205
0.0CysCys: 0.0 ± 0.0
2.684CysAsp: 2.684 ± 1.285
1.534CysGlu: 1.534 ± 0.734
0.0CysPhe: 0.0 ± 0.0
0.767CysGly: 0.767 ± 0.367
0.383CysHis: 0.383 ± 0.184
0.383CysIle: 0.383 ± 0.184
1.534CysLys: 1.534 ± 0.734
3.067CysLeu: 3.067 ± 0.346
0.767CysMet: 0.767 ± 0.367
0.0CysAsn: 0.0 ± 0.0
0.767CysPro: 0.767 ± 0.367
1.15CysGln: 1.15 ± 0.551
1.15CysArg: 1.15 ± 0.551
0.383CysSer: 0.383 ± 0.184
0.0CysThr: 0.0 ± 0.0
0.383CysVal: 0.383 ± 0.184
0.0CysTrp: 0.0 ± 0.0
1.15CysTyr: 1.15 ± 0.551
0.0CysXaa: 0.0 ± 0.0
Asp
4.985AspAla: 4.985 ± 0.981
0.767AspCys: 0.767 ± 0.367
2.684AspAsp: 2.684 ± 0.162
3.834AspGlu: 3.834 ± 1.532
1.917AspPhe: 1.917 ± 1.327
3.067AspGly: 3.067 ± 1.468
1.534AspHis: 1.534 ± 0.734
2.684AspIle: 2.684 ± 1.285
1.534AspLys: 1.534 ± 0.388
4.218AspLeu: 4.218 ± 2.019
1.917AspMet: 1.917 ± 0.205
0.383AspAsn: 0.383 ± 0.184
3.834AspPro: 3.834 ± 1.532
0.383AspGln: 0.383 ± 0.184
6.518AspArg: 6.518 ± 3.614
2.301AspSer: 2.301 ± 0.021
3.451AspThr: 3.451 ± 1.652
3.834AspVal: 3.834 ± 0.713
1.917AspTrp: 1.917 ± 0.205
2.684AspTyr: 2.684 ± 0.162
0.0AspXaa: 0.0 ± 0.0
Glu
5.752GluAla: 5.752 ± 2.753
1.15GluCys: 1.15 ± 0.551
2.301GluAsp: 2.301 ± 0.021
2.684GluGlu: 2.684 ± 0.162
0.767GluPhe: 0.767 ± 0.755
3.451GluGly: 3.451 ± 0.53
3.451GluHis: 3.451 ± 0.53
3.451GluIle: 3.451 ± 0.593
2.301GluLys: 2.301 ± 0.021
7.285GluLeu: 7.285 ± 1.243
1.917GluMet: 1.917 ± 1.327
1.15GluAsn: 1.15 ± 0.551
1.917GluPro: 1.917 ± 0.205
1.917GluGln: 1.917 ± 0.918
5.368GluArg: 5.368 ± 0.798
5.752GluSer: 5.752 ± 1.631
3.451GluThr: 3.451 ± 0.53
3.451GluVal: 3.451 ± 0.593
0.383GluTrp: 0.383 ± 0.184
3.451GluTyr: 3.451 ± 0.593
0.0GluXaa: 0.0 ± 0.0
Phe
1.534PheAla: 1.534 ± 1.511
1.15PheCys: 1.15 ± 0.551
1.917PheAsp: 1.917 ± 1.327
1.917PheGlu: 1.917 ± 0.918
0.767PhePhe: 0.767 ± 0.367
1.917PheGly: 1.917 ± 1.327
0.0PheHis: 0.0 ± 0.0
1.917PheIle: 1.917 ± 0.205
1.15PheLys: 1.15 ± 1.694
2.301PheLeu: 2.301 ± 1.101
0.767PheMet: 0.767 ± 0.367
0.383PheAsn: 0.383 ± 0.184
1.534PhePro: 1.534 ± 0.734
0.767PheGln: 0.767 ± 0.755
5.752PheArg: 5.752 ± 0.614
3.451PheSer: 3.451 ± 0.53
1.917PheThr: 1.917 ± 0.205
1.15PheVal: 1.15 ± 0.551
0.0PheTrp: 0.0 ± 0.0
1.917PheTyr: 1.917 ± 0.918
0.0PheXaa: 0.0 ± 0.0
Gly
2.684GlyAla: 2.684 ± 0.162
1.15GlyCys: 1.15 ± 0.551
3.834GlyAsp: 3.834 ± 0.713
3.067GlyGlu: 3.067 ± 0.346
3.067GlyPhe: 3.067 ± 1.899
3.451GlyGly: 3.451 ± 0.593
1.917GlyHis: 1.917 ± 0.918
3.834GlyIle: 3.834 ± 1.532
2.301GlyLys: 2.301 ± 1.144
5.752GlyLeu: 5.752 ± 0.509
2.301GlyMet: 2.301 ± 2.266
1.917GlyAsn: 1.917 ± 0.205
1.917GlyPro: 1.917 ± 1.327
3.067GlyGln: 3.067 ± 1.899
4.601GlyArg: 4.601 ± 1.08
1.15GlySer: 1.15 ± 0.551
1.534GlyThr: 1.534 ± 2.633
5.368GlyVal: 5.368 ± 1.447
1.917GlyTrp: 1.917 ± 0.205
1.534GlyTyr: 1.534 ± 0.388
0.0GlyXaa: 0.0 ± 0.0
His
2.301HisAla: 2.301 ± 1.101
0.383HisCys: 0.383 ± 0.184
1.534HisAsp: 1.534 ± 0.734
1.15HisGlu: 1.15 ± 0.572
0.767HisPhe: 0.767 ± 0.367
2.301HisGly: 2.301 ± 1.101
0.767HisHis: 0.767 ± 0.367
1.917HisIle: 1.917 ± 0.918
1.15HisLys: 1.15 ± 0.572
3.067HisLeu: 3.067 ± 0.346
1.15HisMet: 1.15 ± 0.551
2.684HisAsn: 2.684 ± 0.162
2.684HisPro: 2.684 ± 0.96
1.917HisGln: 1.917 ± 0.918
2.301HisArg: 2.301 ± 1.101
1.534HisSer: 1.534 ± 0.734
1.917HisThr: 1.917 ± 0.918
1.917HisVal: 1.917 ± 0.918
0.0HisTrp: 0.0 ± 0.0
1.534HisTyr: 1.534 ± 0.734
0.0HisXaa: 0.0 ± 0.0
Ile
3.451IleAla: 3.451 ± 1.715
0.383IleCys: 0.383 ± 0.184
3.067IleAsp: 3.067 ± 0.346
3.451IleGlu: 3.451 ± 0.53
0.767IlePhe: 0.767 ± 0.367
4.601IleGly: 4.601 ± 1.08
1.15IleHis: 1.15 ± 0.551
4.601IleIle: 4.601 ± 0.042
1.534IleLys: 1.534 ± 0.388
7.669IleLeu: 7.669 ± 0.304
1.917IleMet: 1.917 ± 0.918
0.767IleAsn: 0.767 ± 0.367
2.684IlePro: 2.684 ± 0.96
1.15IleGln: 1.15 ± 0.551
3.834IleArg: 3.834 ± 0.409
3.451IleSer: 3.451 ± 1.715
4.601IleThr: 4.601 ± 2.287
5.752IleVal: 5.752 ± 1.631
0.0IleTrp: 0.0 ± 0.0
4.601IleTyr: 4.601 ± 2.203
0.0IleXaa: 0.0 ± 0.0
Lys
2.301LysAla: 2.301 ± 0.021
0.383LysCys: 0.383 ± 0.184
2.684LysAsp: 2.684 ± 2.082
2.301LysGlu: 2.301 ± 0.021
1.15LysPhe: 1.15 ± 0.551
1.917LysGly: 1.917 ± 0.205
0.0LysHis: 0.0 ± 0.0
1.15LysIle: 1.15 ± 0.551
1.917LysLys: 1.917 ± 0.918
5.752LysLeu: 5.752 ± 1.631
0.383LysMet: 0.383 ± 0.184
1.917LysAsn: 1.917 ± 1.327
2.301LysPro: 2.301 ± 0.021
0.767LysGln: 0.767 ± 0.755
1.15LysArg: 1.15 ± 0.572
1.15LysSer: 1.15 ± 0.572
3.451LysThr: 3.451 ± 0.53
3.834LysVal: 3.834 ± 1.836
1.15LysTrp: 1.15 ± 0.551
0.767LysTyr: 0.767 ± 0.755
0.0LysXaa: 0.0 ± 0.0
Leu
8.819LeuAla: 8.819 ± 3.635
3.067LeuCys: 3.067 ± 1.468
4.985LeuAsp: 4.985 ± 2.386
4.601LeuGlu: 4.601 ± 2.203
3.067LeuPhe: 3.067 ± 1.468
5.752LeuGly: 5.752 ± 2.753
4.218LeuHis: 4.218 ± 0.897
2.684LeuIle: 2.684 ± 0.96
2.301LeuLys: 2.301 ± 1.101
9.202LeuLeu: 9.202 ± 1.038
4.218LeuMet: 4.218 ± 2.019
2.301LeuAsn: 2.301 ± 0.021
5.752LeuPro: 5.752 ± 1.736
3.067LeuGln: 3.067 ± 0.346
7.285LeuArg: 7.285 ± 1.002
6.135LeuSer: 6.135 ± 1.815
6.518LeuThr: 6.518 ± 1.998
6.135LeuVal: 6.135 ± 0.43
3.067LeuTrp: 3.067 ± 0.776
1.917LeuTyr: 1.917 ± 0.918
0.0LeuXaa: 0.0 ± 0.0
Met
2.301MetAla: 2.301 ± 1.144
0.767MetCys: 0.767 ± 0.367
1.917MetAsp: 1.917 ± 0.918
2.301MetGlu: 2.301 ± 1.101
0.767MetPhe: 0.767 ± 0.367
1.15MetGly: 1.15 ± 0.572
1.15MetHis: 1.15 ± 0.551
1.917MetIle: 1.917 ± 0.205
1.534MetLys: 1.534 ± 0.734
1.534MetLeu: 1.534 ± 0.734
1.534MetMet: 1.534 ± 0.388
0.767MetAsn: 0.767 ± 0.367
0.383MetPro: 0.383 ± 0.184
1.15MetGln: 1.15 ± 1.694
3.451MetArg: 3.451 ± 0.593
3.451MetSer: 3.451 ± 1.652
3.067MetThr: 3.067 ± 0.776
1.15MetVal: 1.15 ± 0.572
0.0MetTrp: 0.0 ± 0.0
0.767MetTyr: 0.767 ± 0.755
0.0MetXaa: 0.0 ± 0.0
Asn
2.684AsnAla: 2.684 ± 0.96
0.0AsnCys: 0.0 ± 0.0
0.383AsnAsp: 0.383 ± 0.184
1.15AsnGlu: 1.15 ± 0.551
0.383AsnPhe: 0.383 ± 0.184
0.383AsnGly: 0.383 ± 0.939
1.534AsnHis: 1.534 ± 0.734
3.067AsnIle: 3.067 ± 0.346
0.767AsnLys: 0.767 ± 0.367
4.218AsnLeu: 4.218 ± 0.897
0.383AsnMet: 0.383 ± 0.184
1.15AsnAsn: 1.15 ± 0.572
1.917AsnPro: 1.917 ± 0.205
0.767AsnGln: 0.767 ± 0.755
0.767AsnArg: 0.767 ± 0.755
1.15AsnSer: 1.15 ± 1.694
1.917AsnThr: 1.917 ± 0.205
3.067AsnVal: 3.067 ± 1.468
0.0AsnTrp: 0.0 ± 0.0
0.767AsnTyr: 0.767 ± 0.367
0.0AsnXaa: 0.0 ± 0.0
Pro
3.834ProAla: 3.834 ± 4.899
0.767ProCys: 0.767 ± 0.367
1.534ProAsp: 1.534 ± 0.388
3.451ProGlu: 3.451 ± 0.593
2.301ProPhe: 2.301 ± 0.021
2.301ProGly: 2.301 ± 0.021
2.684ProHis: 2.684 ± 0.162
4.218ProIle: 4.218 ± 0.226
0.767ProLys: 0.767 ± 0.367
2.301ProLeu: 2.301 ± 0.021
1.534ProMet: 1.534 ± 0.734
1.917ProAsn: 1.917 ± 1.327
4.985ProPro: 4.985 ± 0.981
2.684ProGln: 2.684 ± 0.96
1.534ProArg: 1.534 ± 0.388
1.534ProSer: 1.534 ± 0.734
1.534ProThr: 1.534 ± 1.511
3.451ProVal: 3.451 ± 1.652
1.534ProTrp: 1.534 ± 0.388
3.834ProTyr: 3.834 ± 0.713
0.0ProXaa: 0.0 ± 0.0
Gln
4.218GlnAla: 4.218 ± 2.471
0.767GlnCys: 0.767 ± 0.367
3.451GlnAsp: 3.451 ± 1.715
1.534GlnGlu: 1.534 ± 0.734
2.301GlnPhe: 2.301 ± 0.021
0.767GlnGly: 0.767 ± 0.367
1.534GlnHis: 1.534 ± 0.388
3.067GlnIle: 3.067 ± 0.776
1.15GlnLys: 1.15 ± 0.551
2.301GlnLeu: 2.301 ± 1.101
0.383GlnMet: 0.383 ± 0.184
1.15GlnAsn: 1.15 ± 0.551
1.534GlnPro: 1.534 ± 1.511
0.383GlnGln: 0.383 ± 0.184
2.301GlnArg: 2.301 ± 1.101
1.534GlnSer: 1.534 ± 1.511
1.917GlnThr: 1.917 ± 0.205
1.534GlnVal: 1.534 ± 0.734
0.767GlnTrp: 0.767 ± 0.367
0.767GlnTyr: 0.767 ± 0.367
0.0GlnXaa: 0.0 ± 0.0
Arg
6.902ArgAla: 6.902 ± 2.182
1.15ArgCys: 1.15 ± 0.551
3.834ArgAsp: 3.834 ± 1.532
4.601ArgGlu: 4.601 ± 0.042
2.684ArgPhe: 2.684 ± 1.285
5.368ArgGly: 5.368 ± 1.92
2.301ArgHis: 2.301 ± 0.021
3.451ArgIle: 3.451 ± 1.652
4.601ArgLys: 4.601 ± 2.287
5.752ArgLeu: 5.752 ± 1.736
2.684ArgMet: 2.684 ± 0.96
2.301ArgAsn: 2.301 ± 0.021
1.15ArgPro: 1.15 ± 0.551
1.917ArgGln: 1.917 ± 1.327
4.601ArgArg: 4.601 ± 1.08
3.834ArgSer: 3.834 ± 0.409
3.834ArgThr: 3.834 ± 0.409
6.518ArgVal: 6.518 ± 0.876
0.767ArgTrp: 0.767 ± 0.755
3.451ArgTyr: 3.451 ± 0.53
0.0ArgXaa: 0.0 ± 0.0
Ser
6.518SerAla: 6.518 ± 3.121
0.0SerCys: 0.0 ± 0.0
2.684SerAsp: 2.684 ± 0.96
2.684SerGlu: 2.684 ± 1.285
1.917SerPhe: 1.917 ± 2.45
1.534SerGly: 1.534 ± 0.734
0.767SerHis: 0.767 ± 0.755
6.135SerIle: 6.135 ± 1.815
3.067SerLys: 3.067 ± 1.468
4.985SerLeu: 4.985 ± 1.264
1.917SerMet: 1.917 ± 0.918
0.767SerAsn: 0.767 ± 0.367
2.684SerPro: 2.684 ± 2.082
1.534SerGln: 1.534 ± 0.734
3.451SerArg: 3.451 ± 0.53
1.917SerSer: 1.917 ± 0.918
2.301SerThr: 2.301 ± 0.021
4.218SerVal: 4.218 ± 0.897
0.767SerTrp: 0.767 ± 0.755
3.451SerTyr: 3.451 ± 2.838
0.0SerXaa: 0.0 ± 0.0
Thr
5.752ThrAla: 5.752 ± 3.981
1.917ThrCys: 1.917 ± 0.205
3.451ThrAsp: 3.451 ± 0.593
2.301ThrGlu: 2.301 ± 0.021
0.767ThrPhe: 0.767 ± 0.367
4.218ThrGly: 4.218 ± 2.471
3.834ThrHis: 3.834 ± 1.836
5.368ThrIle: 5.368 ± 0.798
1.534ThrLys: 1.534 ± 0.388
7.669ThrLeu: 7.669 ± 0.819
1.917ThrMet: 1.917 ± 0.918
0.767ThrAsn: 0.767 ± 0.755
3.451ThrPro: 3.451 ± 0.593
3.067ThrGln: 3.067 ± 1.468
2.684ThrArg: 2.684 ± 0.162
2.684ThrSer: 2.684 ± 0.162
4.601ThrThr: 4.601 ± 0.042
1.917ThrVal: 1.917 ± 0.918
1.534ThrTrp: 1.534 ± 0.388
3.451ThrTyr: 3.451 ± 0.53
0.0ThrXaa: 0.0 ± 0.0
Val
9.202ValAla: 9.202 ± 0.084
1.534ValCys: 1.534 ± 0.734
2.684ValAsp: 2.684 ± 1.285
8.052ValGlu: 8.052 ± 1.61
1.534ValPhe: 1.534 ± 0.388
4.985ValGly: 4.985 ± 3.226
1.534ValHis: 1.534 ± 0.734
1.917ValIle: 1.917 ± 0.205
1.534ValLys: 1.534 ± 0.734
3.067ValLeu: 3.067 ± 1.468
1.534ValMet: 1.534 ± 1.99
1.917ValAsn: 1.917 ± 0.918
2.684ValPro: 2.684 ± 1.285
2.684ValGln: 2.684 ± 1.285
4.985ValArg: 4.985 ± 2.386
2.301ValSer: 2.301 ± 1.101
5.752ValThr: 5.752 ± 0.509
4.601ValVal: 4.601 ± 0.042
1.917ValTrp: 1.917 ± 0.918
3.451ValTyr: 3.451 ± 1.652
0.0ValXaa: 0.0 ± 0.0
Trp
2.301TrpAla: 2.301 ± 2.266
0.767TrpCys: 0.767 ± 0.367
0.383TrpAsp: 0.383 ± 0.184
1.917TrpGlu: 1.917 ± 0.205
0.383TrpPhe: 0.383 ± 0.184
0.383TrpGly: 0.383 ± 0.184
0.767TrpHis: 0.767 ± 0.367
0.383TrpIle: 0.383 ± 0.184
0.0TrpLys: 0.0 ± 0.0
1.15TrpLeu: 1.15 ± 0.551
0.383TrpMet: 0.383 ± 0.172
0.767TrpAsn: 0.767 ± 0.367
0.383TrpPro: 0.383 ± 0.184
0.767TrpGln: 0.767 ± 0.367
1.15TrpArg: 1.15 ± 0.551
2.301TrpSer: 2.301 ± 1.101
1.917TrpThr: 1.917 ± 1.327
1.917TrpVal: 1.917 ± 1.327
0.767TrpTrp: 0.767 ± 0.367
0.767TrpTyr: 0.767 ± 0.755
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.752TyrAla: 5.752 ± 3.981
0.383TyrCys: 0.383 ± 0.184
3.834TyrAsp: 3.834 ± 1.836
3.067TyrGlu: 3.067 ± 0.346
2.301TyrPhe: 2.301 ± 0.021
2.684TyrGly: 2.684 ± 0.96
1.15TyrHis: 1.15 ± 0.551
3.067TyrIle: 3.067 ± 0.346
3.451TyrLys: 3.451 ± 0.53
3.834TyrLeu: 3.834 ± 1.836
0.383TyrMet: 0.383 ± 0.184
2.301TyrAsn: 2.301 ± 0.021
1.917TyrPro: 1.917 ± 0.918
0.767TyrGln: 0.767 ± 0.367
3.451TyrArg: 3.451 ± 0.53
1.534TyrSer: 1.534 ± 0.388
3.067TyrThr: 3.067 ± 1.468
1.917TyrVal: 1.917 ± 0.205
0.0TyrTrp: 0.0 ± 0.0
1.534TyrTyr: 1.534 ± 0.734
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2609 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski