Amino acid dipepetide frequency for Drosophila A virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.617AlaAla: 4.617 ± 0.045
1.979AlaCys: 1.979 ± 0.148
1.319AlaAsp: 1.319 ± 0.488
2.639AlaGlu: 2.639 ± 0.193
0.66AlaPhe: 0.66 ± 0.34
3.298AlaGly: 3.298 ± 1.701
2.639AlaHis: 2.639 ± 0.193
3.298AlaIle: 3.298 ± 0.635
5.277AlaLys: 5.277 ± 0.385
4.617AlaLeu: 4.617 ± 1.214
2.639AlaMet: 2.639 ± 0.976
3.958AlaAsn: 3.958 ± 2.632
2.639AlaPro: 2.639 ± 0.976
3.958AlaGln: 3.958 ± 0.295
4.617AlaArg: 4.617 ± 2.382
6.596AlaSer: 6.596 ± 0.103
2.639AlaThr: 2.639 ± 0.976
7.916AlaVal: 7.916 ± 1.759
0.66AlaTrp: 0.66 ± 0.34
3.958AlaTyr: 3.958 ± 0.295
0.0AlaXaa: 0.0 ± 0.0
Cys
0.66CysAla: 0.66 ± 0.34
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.66CysIle: 0.66 ± 0.34
0.0CysLys: 0.0 ± 0.0
2.639CysLeu: 2.639 ± 0.193
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
3.298CysPro: 3.298 ± 0.635
0.66CysGln: 0.66 ± 0.34
0.66CysArg: 0.66 ± 0.828
0.66CysSer: 0.66 ± 0.34
0.0CysThr: 0.0 ± 0.0
0.66CysVal: 0.66 ± 0.34
0.0CysTrp: 0.0 ± 0.0
0.66CysTyr: 0.66 ± 0.34
0.0CysXaa: 0.0 ± 0.0
Asp
5.277AspAla: 5.277 ± 0.385
0.0AspCys: 0.0 ± 0.0
2.639AspAsp: 2.639 ± 0.193
3.298AspGlu: 3.298 ± 0.635
2.639AspPhe: 2.639 ± 0.193
4.617AspGly: 4.617 ± 0.045
1.319AspHis: 1.319 ± 0.681
2.639AspIle: 2.639 ± 0.976
2.639AspLys: 2.639 ± 0.193
5.277AspLeu: 5.277 ± 1.554
0.66AspMet: 0.66 ± 0.34
0.66AspAsn: 0.66 ± 0.34
3.298AspPro: 3.298 ± 0.635
1.979AspGln: 1.979 ± 0.148
3.958AspArg: 3.958 ± 2.042
1.979AspSer: 1.979 ± 0.148
5.277AspThr: 5.277 ± 1.951
1.979AspVal: 1.979 ± 1.021
0.66AspTrp: 0.66 ± 0.34
1.979AspTyr: 1.979 ± 0.148
0.0AspXaa: 0.0 ± 0.0
Glu
3.958GluAla: 3.958 ± 0.873
1.319GluCys: 1.319 ± 0.681
3.958GluAsp: 3.958 ± 1.464
3.298GluGlu: 3.298 ± 1.701
1.319GluPhe: 1.319 ± 0.488
4.617GluGly: 4.617 ± 0.045
0.66GluHis: 0.66 ± 0.34
3.298GluIle: 3.298 ± 0.533
2.639GluLys: 2.639 ± 1.361
4.617GluLeu: 4.617 ± 1.214
2.639GluMet: 2.639 ± 1.361
0.66GluAsn: 0.66 ± 0.34
0.66GluPro: 0.66 ± 0.34
2.639GluGln: 2.639 ± 0.193
0.0GluArg: 0.0 ± 0.0
2.639GluSer: 2.639 ± 1.361
2.639GluThr: 2.639 ± 0.193
5.277GluVal: 5.277 ± 1.554
1.979GluTrp: 1.979 ± 0.148
3.958GluTyr: 3.958 ± 0.873
0.0GluXaa: 0.0 ± 0.0
Phe
2.639PheAla: 2.639 ± 2.144
0.66PheCys: 0.66 ± 0.34
0.66PheAsp: 0.66 ± 0.34
3.298PheGlu: 3.298 ± 0.635
0.66PhePhe: 0.66 ± 0.828
0.66PheGly: 0.66 ± 0.828
0.66PheHis: 0.66 ± 0.34
1.319PheIle: 1.319 ± 0.488
1.979PheLys: 1.979 ± 0.148
3.298PheLeu: 3.298 ± 1.701
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
3.298PhePro: 3.298 ± 1.701
1.979PheGln: 1.979 ± 0.148
0.66PheArg: 0.66 ± 0.828
5.277PheSer: 5.277 ± 0.385
2.639PheThr: 2.639 ± 0.976
0.66PheVal: 0.66 ± 0.828
1.319PheTrp: 1.319 ± 0.681
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.958GlyAla: 3.958 ± 1.464
0.66GlyCys: 0.66 ± 0.34
3.298GlyAsp: 3.298 ± 0.533
3.958GlyGlu: 3.958 ± 2.042
6.596GlyPhe: 6.596 ± 0.103
5.277GlyGly: 5.277 ± 0.385
1.319GlyHis: 1.319 ± 0.681
1.979GlyIle: 1.979 ± 1.021
2.639GlyLys: 2.639 ± 0.976
5.277GlyLeu: 5.277 ± 2.722
0.66GlyMet: 0.66 ± 0.828
2.639GlyAsn: 2.639 ± 2.144
1.319GlyPro: 1.319 ± 1.656
0.66GlyGln: 0.66 ± 0.828
2.639GlyArg: 2.639 ± 0.193
5.277GlySer: 5.277 ± 4.288
5.277GlyThr: 5.277 ± 0.385
9.235GlyVal: 9.235 ± 2.427
1.319GlyTrp: 1.319 ± 0.488
1.979GlyTyr: 1.979 ± 1.316
0.0GlyXaa: 0.0 ± 0.0
His
1.319HisAla: 1.319 ± 0.681
0.66HisCys: 0.66 ± 0.34
0.66HisAsp: 0.66 ± 0.34
1.319HisGlu: 1.319 ± 0.681
0.66HisPhe: 0.66 ± 0.34
1.979HisGly: 1.979 ± 1.316
1.979HisHis: 1.979 ± 0.148
1.979HisIle: 1.979 ± 1.021
1.979HisLys: 1.979 ± 1.021
0.66HisLeu: 0.66 ± 0.34
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.979HisPro: 1.979 ± 1.021
0.66HisGln: 0.66 ± 0.828
1.979HisArg: 1.979 ± 1.021
1.319HisSer: 1.319 ± 0.681
1.979HisThr: 1.979 ± 1.021
1.979HisVal: 1.979 ± 1.021
0.66HisTrp: 0.66 ± 0.34
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.979IleAla: 1.979 ± 1.021
1.319IleCys: 1.319 ± 0.488
4.617IleAsp: 4.617 ± 0.045
2.639IleGlu: 2.639 ± 0.976
0.0IlePhe: 0.0 ± 0.0
1.319IleGly: 1.319 ± 1.656
1.319IleHis: 1.319 ± 0.681
3.298IleIle: 3.298 ± 0.533
1.979IleLys: 1.979 ± 1.021
2.639IleLeu: 2.639 ± 1.361
1.979IleMet: 1.979 ± 1.021
3.958IleAsn: 3.958 ± 0.873
5.937IlePro: 5.937 ± 0.443
1.319IleGln: 1.319 ± 0.488
1.979IleArg: 1.979 ± 1.316
4.617IleSer: 4.617 ± 2.292
5.277IleThr: 5.277 ± 1.554
5.937IleVal: 5.937 ± 0.443
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.639LysAla: 2.639 ± 1.361
0.66LysCys: 0.66 ± 0.828
2.639LysAsp: 2.639 ± 0.976
3.298LysGlu: 3.298 ± 1.701
1.979LysPhe: 1.979 ± 0.148
1.979LysGly: 1.979 ± 0.148
1.319LysHis: 1.319 ± 0.681
1.979LysIle: 1.979 ± 0.148
5.937LysLys: 5.937 ± 1.894
5.937LysLeu: 5.937 ± 0.443
1.319LysMet: 1.319 ± 0.488
3.298LysAsn: 3.298 ± 0.533
1.319LysPro: 1.319 ± 0.681
1.319LysGln: 1.319 ± 0.681
5.277LysArg: 5.277 ± 1.554
3.298LysSer: 3.298 ± 0.533
5.937LysThr: 5.937 ± 0.443
4.617LysVal: 4.617 ± 0.045
1.319LysTrp: 1.319 ± 0.488
1.979LysTyr: 1.979 ± 0.148
0.0LysXaa: 0.0 ± 0.0
Leu
3.298LeuAla: 3.298 ± 0.533
0.66LeuCys: 0.66 ± 0.34
7.916LeuAsp: 7.916 ± 4.083
3.958LeuGlu: 3.958 ± 1.464
4.617LeuPhe: 4.617 ± 0.045
3.958LeuGly: 3.958 ± 0.873
1.979LeuHis: 1.979 ± 1.021
3.958LeuIle: 3.958 ± 2.042
3.298LeuLys: 3.298 ± 0.635
4.617LeuLeu: 4.617 ± 1.214
1.979LeuMet: 1.979 ± 0.873
4.617LeuAsn: 4.617 ± 0.045
5.937LeuPro: 5.937 ± 3.062
3.298LeuGln: 3.298 ± 0.533
3.298LeuArg: 3.298 ± 1.804
3.298LeuSer: 3.298 ± 1.804
5.277LeuThr: 5.277 ± 0.783
3.298LeuVal: 3.298 ± 0.533
0.66LeuTrp: 0.66 ± 0.34
3.958LeuTyr: 3.958 ± 0.873
0.0LeuXaa: 0.0 ± 0.0
Met
3.958MetAla: 3.958 ± 0.295
0.0MetCys: 0.0 ± 0.0
1.979MetAsp: 1.979 ± 1.021
0.66MetGlu: 0.66 ± 0.34
1.319MetPhe: 1.319 ± 0.488
2.639MetGly: 2.639 ± 1.361
0.66MetHis: 0.66 ± 0.828
0.66MetIle: 0.66 ± 0.34
1.319MetLys: 1.319 ± 0.488
1.979MetLeu: 1.979 ± 1.021
0.66MetMet: 0.66 ± 0.34
1.319MetAsn: 1.319 ± 1.656
2.639MetPro: 2.639 ± 0.976
1.319MetGln: 1.319 ± 0.681
0.66MetArg: 0.66 ± 0.34
0.66MetSer: 0.66 ± 0.828
0.66MetThr: 0.66 ± 0.34
2.639MetVal: 2.639 ± 0.976
0.0MetTrp: 0.0 ± 0.0
0.66MetTyr: 0.66 ± 0.34
0.0MetXaa: 0.0 ± 0.0
Asn
3.298AsnAla: 3.298 ± 0.635
0.66AsnCys: 0.66 ± 0.828
1.979AsnAsp: 1.979 ± 0.148
1.319AsnGlu: 1.319 ± 0.681
1.319AsnPhe: 1.319 ± 1.656
0.0AsnGly: 0.0 ± 0.0
0.66AsnHis: 0.66 ± 0.828
1.979AsnIle: 1.979 ± 1.316
2.639AsnLys: 2.639 ± 1.361
5.937AsnLeu: 5.937 ± 0.726
0.66AsnMet: 0.66 ± 0.34
3.298AsnAsn: 3.298 ± 1.804
5.277AsnPro: 5.277 ± 1.554
1.979AsnGln: 1.979 ± 0.148
3.298AsnArg: 3.298 ± 2.972
1.319AsnSer: 1.319 ± 1.656
1.979AsnThr: 1.979 ± 0.148
1.979AsnVal: 1.979 ± 1.316
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.639ProAla: 2.639 ± 0.193
0.0ProCys: 0.0 ± 0.0
4.617ProAsp: 4.617 ± 2.292
5.277ProGlu: 5.277 ± 2.722
2.639ProPhe: 2.639 ± 0.976
3.958ProGly: 3.958 ± 0.295
3.298ProHis: 3.298 ± 1.701
1.979ProIle: 1.979 ± 0.148
5.277ProLys: 5.277 ± 1.951
2.639ProLeu: 2.639 ± 1.361
0.66ProMet: 0.66 ± 0.828
2.639ProAsn: 2.639 ± 1.361
8.575ProPro: 8.575 ± 3.255
2.639ProGln: 2.639 ± 0.193
3.958ProArg: 3.958 ± 0.295
3.298ProSer: 3.298 ± 0.533
3.958ProThr: 3.958 ± 0.873
6.596ProVal: 6.596 ± 1.271
0.0ProTrp: 0.0 ± 0.0
1.979ProTyr: 1.979 ± 1.021
0.0ProXaa: 0.0 ± 0.0
Gln
2.639GlnAla: 2.639 ± 0.976
0.0GlnCys: 0.0 ± 0.0
4.617GlnAsp: 4.617 ± 0.045
1.979GlnGlu: 1.979 ± 1.021
0.66GlnPhe: 0.66 ± 0.34
4.617GlnGly: 4.617 ± 1.123
0.0GlnHis: 0.0 ± 0.0
3.298GlnIle: 3.298 ± 0.533
1.979GlnLys: 1.979 ± 1.021
1.979GlnLeu: 1.979 ± 1.316
3.298GlnMet: 3.298 ± 0.533
0.66GlnAsn: 0.66 ± 0.828
1.319GlnPro: 1.319 ± 0.488
3.298GlnGln: 3.298 ± 0.635
3.298GlnArg: 3.298 ± 0.533
1.979GlnSer: 1.979 ± 1.316
2.639GlnThr: 2.639 ± 0.976
5.937GlnVal: 5.937 ± 1.894
0.0GlnTrp: 0.0 ± 0.0
1.319GlnTyr: 1.319 ± 0.681
0.0GlnXaa: 0.0 ± 0.0
Arg
1.979ArgAla: 1.979 ± 1.316
0.0ArgCys: 0.0 ± 0.0
1.319ArgAsp: 1.319 ± 0.681
4.617ArgGlu: 4.617 ± 2.382
1.319ArgPhe: 1.319 ± 0.681
3.298ArgGly: 3.298 ± 0.533
1.319ArgHis: 1.319 ± 0.681
3.958ArgIle: 3.958 ± 0.295
3.958ArgLys: 3.958 ± 0.295
3.298ArgLeu: 3.298 ± 0.533
1.979ArgMet: 1.979 ± 0.148
1.979ArgAsn: 1.979 ± 1.316
3.958ArgPro: 3.958 ± 1.464
5.277ArgGln: 5.277 ± 1.554
4.617ArgArg: 4.617 ± 2.292
1.319ArgSer: 1.319 ± 0.681
7.916ArgThr: 7.916 ± 0.59
5.277ArgVal: 5.277 ± 0.783
1.319ArgTrp: 1.319 ± 0.488
0.66ArgTyr: 0.66 ± 0.34
0.0ArgXaa: 0.0 ± 0.0
Ser
5.277SerAla: 5.277 ± 0.783
0.66SerCys: 0.66 ± 0.34
3.958SerAsp: 3.958 ± 0.295
3.298SerGlu: 3.298 ± 0.635
0.66SerPhe: 0.66 ± 0.34
6.596SerGly: 6.596 ± 2.439
1.319SerHis: 1.319 ± 0.681
5.937SerIle: 5.937 ± 0.726
3.298SerLys: 3.298 ± 0.635
2.639SerLeu: 2.639 ± 0.976
0.66SerMet: 0.66 ± 0.828
3.958SerAsn: 3.958 ± 0.873
2.639SerPro: 2.639 ± 2.144
2.639SerGln: 2.639 ± 0.193
4.617SerArg: 4.617 ± 1.123
6.596SerSer: 6.596 ± 1.066
3.298SerThr: 3.298 ± 1.804
2.639SerVal: 2.639 ± 0.193
1.979SerTrp: 1.979 ± 0.148
0.66SerTyr: 0.66 ± 0.828
0.0SerXaa: 0.0 ± 0.0
Thr
6.596ThrAla: 6.596 ± 0.103
0.0ThrCys: 0.0 ± 0.0
1.319ThrAsp: 1.319 ± 0.488
3.298ThrGlu: 3.298 ± 0.533
1.979ThrPhe: 1.979 ± 1.316
7.256ThrGly: 7.256 ± 2.099
0.66ThrHis: 0.66 ± 0.34
2.639ThrIle: 2.639 ± 0.193
3.958ThrLys: 3.958 ± 1.464
8.575ThrLeu: 8.575 ± 0.25
1.979ThrMet: 1.979 ± 0.148
1.979ThrAsn: 1.979 ± 2.484
3.958ThrPro: 3.958 ± 0.295
3.958ThrGln: 3.958 ± 0.295
4.617ThrArg: 4.617 ± 1.123
1.979ThrSer: 1.979 ± 0.148
9.894ThrThr: 9.894 ± 6.58
8.575ThrVal: 8.575 ± 1.419
1.319ThrTrp: 1.319 ± 0.681
0.66ThrTyr: 0.66 ± 0.828
0.0ThrXaa: 0.0 ± 0.0
Val
9.235ValAla: 9.235 ± 2.247
1.319ValCys: 1.319 ± 0.681
3.298ValAsp: 3.298 ± 0.533
3.958ValGlu: 3.958 ± 0.873
1.979ValPhe: 1.979 ± 1.021
7.916ValGly: 7.916 ± 0.578
1.979ValHis: 1.979 ± 1.021
4.617ValIle: 4.617 ± 3.46
3.958ValLys: 3.958 ± 2.042
3.298ValLeu: 3.298 ± 0.635
1.979ValMet: 1.979 ± 1.734
2.639ValAsn: 2.639 ± 0.193
6.596ValPro: 6.596 ± 2.234
5.277ValGln: 5.277 ± 1.951
5.937ValArg: 5.937 ± 3.062
5.277ValSer: 5.277 ± 0.385
4.617ValThr: 4.617 ± 3.46
9.235ValVal: 9.235 ± 1.259
1.319ValTrp: 1.319 ± 0.681
3.298ValTyr: 3.298 ± 0.533
0.0ValXaa: 0.0 ± 0.0
Trp
1.319TrpAla: 1.319 ± 0.681
0.0TrpCys: 0.0 ± 0.0
0.66TrpAsp: 0.66 ± 0.34
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.319TrpIle: 1.319 ± 0.681
0.66TrpLys: 0.66 ± 0.34
1.319TrpLeu: 1.319 ± 0.681
1.319TrpMet: 1.319 ± 0.681
1.319TrpAsn: 1.319 ± 0.488
0.66TrpPro: 0.66 ± 0.34
0.0TrpGln: 0.0 ± 0.0
1.979TrpArg: 1.979 ± 0.148
2.639TrpSer: 2.639 ± 0.193
0.0TrpThr: 0.0 ± 0.0
1.319TrpVal: 1.319 ± 0.488
0.0TrpTrp: 0.0 ± 0.0
0.66TrpTyr: 0.66 ± 0.828
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.639TyrAla: 2.639 ± 1.361
0.0TyrCys: 0.0 ± 0.0
1.319TyrAsp: 1.319 ± 0.681
0.66TyrGlu: 0.66 ± 0.34
0.66TyrPhe: 0.66 ± 0.34
1.979TyrGly: 1.979 ± 1.021
0.66TyrHis: 0.66 ± 0.34
1.319TyrIle: 1.319 ± 0.488
2.639TyrLys: 2.639 ± 1.361
2.639TyrLeu: 2.639 ± 2.144
0.66TyrMet: 0.66 ± 0.34
0.0TyrAsn: 0.0 ± 0.0
1.319TyrPro: 1.319 ± 0.681
0.66TyrGln: 0.66 ± 0.34
1.319TyrArg: 1.319 ± 0.681
3.298TyrSer: 3.298 ± 1.804
3.298TyrThr: 3.298 ± 2.972
2.639TyrVal: 2.639 ± 0.193
0.66TyrTrp: 0.66 ± 0.34
1.319TyrTyr: 1.319 ± 1.656
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1517 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski