Amino acid dipepetide frequency for Brugia malayi (Filarial nematode worm)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.998AlaAla: 4.998 ± 0.059
1.137AlaCys: 1.137 ± 0.018
3.044AlaAsp: 3.044 ± 0.026
4.145AlaGlu: 4.145 ± 0.04
2.468AlaPhe: 2.468 ± 0.026
2.893AlaGly: 2.893 ± 0.038
1.361AlaHis: 1.361 ± 0.019
4.072AlaIle: 4.072 ± 0.036
3.58AlaLys: 3.58 ± 0.03
5.889AlaLeu: 5.889 ± 0.045
1.538AlaMet: 1.538 ± 0.019
2.946AlaAsn: 2.946 ± 0.027
2.28AlaPro: 2.28 ± 0.029
2.416AlaGln: 2.416 ± 0.029
2.991AlaArg: 2.991 ± 0.032
4.772AlaSer: 4.772 ± 0.038
3.663AlaThr: 3.663 ± 0.03
4.584AlaVal: 4.584 ± 0.045
0.535AlaTrp: 0.535 ± 0.012
1.775AlaTyr: 1.775 ± 0.021
0.0AlaXaa: 0.0 ± 0.0
Cys
1.256CysAla: 1.256 ± 0.018
0.662CysCys: 0.662 ± 0.015
1.254CysAsp: 1.254 ± 0.026
1.305CysGlu: 1.305 ± 0.022
0.942CysPhe: 0.942 ± 0.016
1.355CysGly: 1.355 ± 0.022
0.567CysHis: 0.567 ± 0.014
1.439CysIle: 1.439 ± 0.025
1.294CysLys: 1.294 ± 0.028
1.912CysLeu: 1.912 ± 0.026
0.472CysMet: 0.472 ± 0.01
1.116CysAsn: 1.116 ± 0.023
1.07CysPro: 1.07 ± 0.026
0.818CysGln: 0.818 ± 0.017
1.296CysArg: 1.296 ± 0.021
1.981CysSer: 1.981 ± 0.03
1.153CysThr: 1.153 ± 0.018
1.205CysVal: 1.205 ± 0.021
0.247CysTrp: 0.247 ± 0.008
0.703CysTyr: 0.703 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
3.171AspAla: 3.171 ± 0.031
1.099AspCys: 1.099 ± 0.019
4.317AspAsp: 4.317 ± 0.06
4.789AspGlu: 4.789 ± 0.051
2.333AspPhe: 2.333 ± 0.024
3.415AspGly: 3.415 ± 0.041
1.135AspHis: 1.135 ± 0.018
3.615AspIle: 3.615 ± 0.036
2.946AspLys: 2.946 ± 0.036
4.723AspLeu: 4.723 ± 0.042
1.286AspMet: 1.286 ± 0.018
2.708AspAsn: 2.708 ± 0.036
1.966AspPro: 1.966 ± 0.025
1.806AspGln: 1.806 ± 0.023
2.816AspArg: 2.816 ± 0.036
4.161AspSer: 4.161 ± 0.037
2.515AspThr: 2.515 ± 0.023
3.351AspVal: 3.351 ± 0.031
0.65AspTrp: 0.65 ± 0.013
1.81AspTyr: 1.81 ± 0.026
0.001AspXaa: 0.001 ± 0.0
Glu
3.849GluAla: 3.849 ± 0.044
1.344GluCys: 1.344 ± 0.024
3.288GluAsp: 3.288 ± 0.034
5.713GluGlu: 5.713 ± 0.079
2.172GluPhe: 2.172 ± 0.028
2.722GluGly: 2.722 ± 0.03
1.516GluHis: 1.516 ± 0.022
4.479GluIle: 4.479 ± 0.053
5.599GluLys: 5.599 ± 0.086
6.01GluLeu: 6.01 ± 0.06
2.089GluMet: 2.089 ± 0.028
3.941GluAsn: 3.941 ± 0.046
2.126GluPro: 2.126 ± 0.031
3.292GluGln: 3.292 ± 0.038
4.239GluArg: 4.239 ± 0.051
4.579GluSer: 4.579 ± 0.048
3.57GluThr: 3.57 ± 0.047
3.567GluVal: 3.567 ± 0.045
0.759GluTrp: 0.759 ± 0.022
2.001GluTyr: 2.001 ± 0.026
0.0GluXaa: 0.0 ± 0.0
Phe
2.596PheAla: 2.596 ± 0.027
1.058PheCys: 1.058 ± 0.018
2.48PheAsp: 2.48 ± 0.029
2.455PheGlu: 2.455 ± 0.027
1.803PhePhe: 1.803 ± 0.026
2.633PheGly: 2.633 ± 0.028
1.049PheHis: 1.049 ± 0.02
2.825PheIle: 2.825 ± 0.029
1.883PheLys: 1.883 ± 0.023
3.832PheLeu: 3.832 ± 0.034
0.98PheMet: 0.98 ± 0.017
1.826PheAsn: 1.826 ± 0.021
1.708PhePro: 1.708 ± 0.025
1.459PheGln: 1.459 ± 0.018
2.097PheArg: 2.097 ± 0.024
3.274PheSer: 3.274 ± 0.03
2.32PheThr: 2.32 ± 0.027
2.585PheVal: 2.585 ± 0.027
0.498PheTrp: 0.498 ± 0.011
1.409PheTyr: 1.409 ± 0.021
0.0PheXaa: 0.0 ± 0.0
Gly
2.84GlyAla: 2.84 ± 0.038
1.088GlyCys: 1.088 ± 0.019
2.726GlyAsp: 2.726 ± 0.029
3.147GlyGlu: 3.147 ± 0.039
2.166GlyPhe: 2.166 ± 0.029
3.083GlyGly: 3.083 ± 0.051
1.232GlyHis: 1.232 ± 0.02
3.412GlyIle: 3.412 ± 0.036
3.163GlyLys: 3.163 ± 0.038
4.653GlyLeu: 4.653 ± 0.255
1.237GlyMet: 1.237 ± 0.019
2.697GlyAsn: 2.697 ± 0.033
1.911GlyPro: 1.911 ± 0.044
2.002GlyGln: 2.002 ± 0.023
3.097GlyArg: 3.097 ± 0.038
4.267GlySer: 4.267 ± 0.039
3.066GlyThr: 3.066 ± 0.032
2.82GlyVal: 2.82 ± 0.034
0.613GlyTrp: 0.613 ± 0.018
1.771GlyTyr: 1.771 ± 0.027
0.0GlyXaa: 0.0 ± 0.0
His
1.336HisAla: 1.336 ± 0.018
0.669HisCys: 0.669 ± 0.014
1.108HisAsp: 1.108 ± 0.017
1.397HisGlu: 1.397 ± 0.023
1.216HisPhe: 1.216 ± 0.017
1.262HisGly: 1.262 ± 0.022
0.821HisHis: 0.821 ± 0.037
1.541HisIle: 1.541 ± 0.017
1.185HisLys: 1.185 ± 0.019
2.412HisLeu: 2.412 ± 0.026
0.714HisMet: 0.714 ± 0.021
1.036HisAsn: 1.036 ± 0.015
1.159HisPro: 1.159 ± 0.016
1.048HisGln: 1.048 ± 0.018
1.517HisArg: 1.517 ± 0.02
2.09HisSer: 2.09 ± 0.025
1.236HisThr: 1.236 ± 0.026
1.285HisVal: 1.285 ± 0.018
0.303HisTrp: 0.303 ± 0.008
0.922HisTyr: 0.922 ± 0.015
0.0HisXaa: 0.0 ± 0.0
Ile
4.359IleAla: 4.359 ± 0.034
1.728IleCys: 1.728 ± 0.027
3.813IleAsp: 3.813 ± 0.031
3.988IleGlu: 3.988 ± 0.05
2.76IlePhe: 2.76 ± 0.029
3.339IleGly: 3.339 ± 0.031
1.589IleHis: 1.589 ± 0.022
4.42IleIle: 4.42 ± 0.068
3.407IleLys: 3.407 ± 0.035
5.583IleLeu: 5.583 ± 0.049
1.428IleMet: 1.428 ± 0.019
3.245IleAsn: 3.245 ± 0.039
3.082IlePro: 3.082 ± 0.031
2.366IleGln: 2.366 ± 0.026
3.613IleArg: 3.613 ± 0.03
5.833IleSer: 5.833 ± 0.045
3.878IleThr: 3.878 ± 0.04
3.805IleVal: 3.805 ± 0.033
0.743IleTrp: 0.743 ± 0.014
2.04IleTyr: 2.04 ± 0.025
0.0IleXaa: 0.0 ± 0.0
Lys
3.383LysAla: 3.383 ± 0.032
1.398LysCys: 1.398 ± 0.025
3.042LysAsp: 3.042 ± 0.03
4.642LysGlu: 4.642 ± 0.059
2.207LysPhe: 2.207 ± 0.023
2.613LysGly: 2.613 ± 0.037
1.516LysHis: 1.516 ± 0.021
3.981LysIle: 3.981 ± 0.04
5.258LysLys: 5.258 ± 0.068
5.856LysLeu: 5.856 ± 0.066
1.821LysMet: 1.821 ± 0.023
3.402LysAsn: 3.402 ± 0.035
2.427LysPro: 2.427 ± 0.042
2.949LysGln: 2.949 ± 0.035
4.11LysArg: 4.11 ± 0.035
4.348LysSer: 4.348 ± 0.039
3.295LysThr: 3.295 ± 0.035
3.235LysVal: 3.235 ± 0.031
0.733LysTrp: 0.733 ± 0.014
2.037LysTyr: 2.037 ± 0.022
0.0LysXaa: 0.0 ± 0.0
Leu
5.599LeuAla: 5.599 ± 0.045
2.074LeuCys: 2.074 ± 0.029
4.619LeuAsp: 4.619 ± 0.04
5.719LeuGlu: 5.719 ± 0.069
3.859LeuPhe: 3.859 ± 0.036
4.197LeuGly: 4.197 ± 0.25
2.42LeuHis: 2.42 ± 0.026
5.765LeuIle: 5.765 ± 0.047
6.036LeuLys: 6.036 ± 0.048
9.666LeuLeu: 9.666 ± 0.077
2.256LeuMet: 2.256 ± 0.024
4.598LeuAsn: 4.598 ± 0.044
4.608LeuPro: 4.608 ± 0.048
4.321LeuGln: 4.321 ± 0.05
5.521LeuArg: 5.521 ± 0.045
7.841LeuSer: 7.841 ± 0.054
4.893LeuThr: 4.893 ± 0.035
4.657LeuVal: 4.657 ± 0.036
0.966LeuTrp: 0.966 ± 0.016
2.731LeuTyr: 2.731 ± 0.025
0.0LeuXaa: 0.0 ± 0.0
Met
1.524MetAla: 1.524 ± 0.018
0.45MetCys: 0.45 ± 0.01
1.439MetAsp: 1.439 ± 0.021
1.966MetGlu: 1.966 ± 0.026
0.922MetPhe: 0.922 ± 0.015
1.046MetGly: 1.046 ± 0.018
0.646MetHis: 0.646 ± 0.011
1.627MetIle: 1.627 ± 0.022
1.962MetLys: 1.962 ± 0.032
2.384MetLeu: 2.384 ± 0.026
0.78MetMet: 0.78 ± 0.019
1.4MetAsn: 1.4 ± 0.021
1.053MetPro: 1.053 ± 0.017
1.264MetGln: 1.264 ± 0.016
1.374MetArg: 1.374 ± 0.018
1.835MetSer: 1.835 ± 0.024
1.354MetThr: 1.354 ± 0.022
1.4MetVal: 1.4 ± 0.021
0.213MetTrp: 0.213 ± 0.007
0.647MetTyr: 0.647 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
3.295AsnAla: 3.295 ± 0.031
1.175AsnCys: 1.175 ± 0.019
3.512AsnAsp: 3.512 ± 0.058
3.97AsnGlu: 3.97 ± 0.048
2.211AsnPhe: 2.211 ± 0.028
3.193AsnGly: 3.193 ± 0.036
1.097AsnHis: 1.097 ± 0.017
3.47AsnIle: 3.47 ± 0.042
2.782AsnLys: 2.782 ± 0.03
4.36AsnLeu: 4.36 ± 0.041
1.226AsnMet: 1.226 ± 0.018
2.871AsnAsn: 2.871 ± 0.042
1.866AsnPro: 1.866 ± 0.02
1.821AsnGln: 1.821 ± 0.022
2.627AsnArg: 2.627 ± 0.025
4.184AsnSer: 4.184 ± 0.041
2.494AsnThr: 2.494 ± 0.026
3.238AsnVal: 3.238 ± 0.03
0.559AsnTrp: 0.559 ± 0.011
1.712AsnTyr: 1.712 ± 0.025
0.0AsnXaa: 0.0 ± 0.0
Pro
2.333ProAla: 2.333 ± 0.031
0.748ProCys: 0.748 ± 0.017
2.13ProAsp: 2.13 ± 0.027
2.673ProGlu: 2.673 ± 0.031
1.816ProPhe: 1.816 ± 0.029
2.378ProGly: 2.378 ± 0.088
0.937ProHis: 0.937 ± 0.017
2.576ProIle: 2.576 ± 0.034
2.302ProLys: 2.302 ± 0.036
3.808ProLeu: 3.808 ± 0.038
0.919ProMet: 0.919 ± 0.014
2.005ProAsn: 2.005 ± 0.023
3.054ProPro: 3.054 ± 0.058
1.707ProGln: 1.707 ± 0.026
1.959ProArg: 1.959 ± 0.023
4.092ProSer: 4.092 ± 0.054
2.559ProThr: 2.559 ± 0.034
2.875ProVal: 2.875 ± 0.032
0.403ProTrp: 0.403 ± 0.01
1.473ProTyr: 1.473 ± 0.028
0.0ProXaa: 0.0 ± 0.0
Gln
2.156GlnAla: 2.156 ± 0.028
0.973GlnCys: 0.973 ± 0.021
1.442GlnAsp: 1.442 ± 0.02
2.415GlnGlu: 2.415 ± 0.027
1.755GlnPhe: 1.755 ± 0.019
1.464GlnGly: 1.464 ± 0.021
1.173GlnHis: 1.173 ± 0.018
2.791GlnIle: 2.791 ± 0.032
2.955GlnLys: 2.955 ± 0.037
4.533GlnLeu: 4.533 ± 0.055
1.328GlnMet: 1.328 ± 0.018
2.296GlnAsn: 2.296 ± 0.03
1.746GlnPro: 1.746 ± 0.03
3.756GlnGln: 3.756 ± 0.099
2.623GlnArg: 2.623 ± 0.025
3.117GlnSer: 3.117 ± 0.032
2.264GlnThr: 2.264 ± 0.029
1.932GlnVal: 1.932 ± 0.023
0.503GlnTrp: 0.503 ± 0.013
1.362GlnTyr: 1.362 ± 0.016
0.0GlnXaa: 0.0 ± 0.0
Arg
2.92ArgAla: 2.92 ± 0.031
1.334ArgCys: 1.334 ± 0.023
2.677ArgAsp: 2.677 ± 0.029
3.542ArgGlu: 3.542 ± 0.041
2.297ArgPhe: 2.297 ± 0.026
2.539ArgGly: 2.539 ± 0.037
1.464ArgHis: 1.464 ± 0.022
3.824ArgIle: 3.824 ± 0.034
4.278ArgLys: 4.278 ± 0.034
5.269ArgLeu: 5.269 ± 0.046
1.449ArgMet: 1.449 ± 0.018
3.303ArgAsn: 3.303 ± 0.042
2.131ArgPro: 2.131 ± 0.025
2.532ArgGln: 2.532 ± 0.029
4.317ArgArg: 4.317 ± 0.05
4.555ArgSer: 4.555 ± 0.049
3.055ArgThr: 3.055 ± 0.03
2.737ArgVal: 2.737 ± 0.028
0.651ArgTrp: 0.651 ± 0.014
1.852ArgTyr: 1.852 ± 0.022
0.0ArgXaa: 0.0 ± 0.0
Ser
5.135SerAla: 5.135 ± 0.041
1.727SerCys: 1.727 ± 0.029
4.737SerAsp: 4.737 ± 0.04
4.868SerGlu: 4.868 ± 0.054
3.352SerPhe: 3.352 ± 0.03
4.686SerGly: 4.686 ± 0.042
1.81SerHis: 1.81 ± 0.024
4.877SerIle: 4.877 ± 0.035
4.584SerLys: 4.584 ± 0.042
7.386SerLeu: 7.386 ± 0.052
1.915SerMet: 1.915 ± 0.025
4.33SerAsn: 4.33 ± 0.046
3.555SerPro: 3.555 ± 0.051
2.999SerGln: 2.999 ± 0.029
4.356SerArg: 4.356 ± 0.052
9.613SerSer: 9.613 ± 0.14
5.31SerThr: 5.31 ± 0.052
5.169SerVal: 5.169 ± 0.04
0.792SerTrp: 0.792 ± 0.016
2.269SerTyr: 2.269 ± 0.023
0.0SerXaa: 0.0 ± 0.0
Thr
3.961ThrAla: 3.961 ± 0.035
1.093ThrCys: 1.093 ± 0.019
3.068ThrAsp: 3.068 ± 0.035
3.481ThrGlu: 3.481 ± 0.044
2.329ThrPhe: 2.329 ± 0.023
3.012ThrGly: 3.012 ± 0.035
1.316ThrHis: 1.316 ± 0.025
3.674ThrIle: 3.674 ± 0.04
3.135ThrLys: 3.135 ± 0.031
4.827ThrLeu: 4.827 ± 0.032
1.263ThrMet: 1.263 ± 0.016
2.883ThrAsn: 2.883 ± 0.031
2.494ThrPro: 2.494 ± 0.03
1.889ThrGln: 1.889 ± 0.024
2.433ThrArg: 2.433 ± 0.025
5.157ThrSer: 5.157 ± 0.047
4.222ThrThr: 4.222 ± 0.063
4.087ThrVal: 4.087 ± 0.037
0.483ThrTrp: 0.483 ± 0.011
1.585ThrTyr: 1.585 ± 0.022
0.0ThrXaa: 0.0 ± 0.0
Val
3.983ValAla: 3.983 ± 0.032
1.233ValCys: 1.233 ± 0.02
3.45ValAsp: 3.45 ± 0.039
4.095ValGlu: 4.095 ± 0.05
2.213ValPhe: 2.213 ± 0.025
2.847ValGly: 2.847 ± 0.031
1.444ValHis: 1.444 ± 0.017
3.973ValIle: 3.973 ± 0.035
3.424ValLys: 3.424 ± 0.035
5.287ValLeu: 5.287 ± 0.044
1.467ValMet: 1.467 ± 0.02
2.823ValAsn: 2.823 ± 0.032
2.785ValPro: 2.785 ± 0.03
2.429ValGln: 2.429 ± 0.025
3.13ValArg: 3.13 ± 0.028
4.474ValSer: 4.474 ± 0.043
3.438ValThr: 3.438 ± 0.032
3.813ValVal: 3.813 ± 0.042
0.602ValTrp: 0.602 ± 0.012
1.718ValTyr: 1.718 ± 0.021
0.0ValXaa: 0.0 ± 0.0
Trp
0.536TrpAla: 0.536 ± 0.011
0.228TrpCys: 0.228 ± 0.007
0.572TrpAsp: 0.572 ± 0.012
0.585TrpGlu: 0.585 ± 0.012
0.48TrpPhe: 0.48 ± 0.01
0.378TrpGly: 0.378 ± 0.011
0.293TrpHis: 0.293 ± 0.009
0.769TrpIle: 0.769 ± 0.013
0.846TrpLys: 0.846 ± 0.015
1.137TrpLeu: 1.137 ± 0.021
0.329TrpMet: 0.329 ± 0.015
0.699TrpAsn: 0.699 ± 0.018
0.409TrpPro: 0.409 ± 0.01
0.437TrpGln: 0.437 ± 0.011
0.666TrpArg: 0.666 ± 0.013
0.867TrpSer: 0.867 ± 0.016
0.613TrpThr: 0.613 ± 0.012
0.457TrpVal: 0.457 ± 0.009
0.162TrpTrp: 0.162 ± 0.006
0.358TrpTyr: 0.358 ± 0.01
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.855TyrAla: 1.855 ± 0.021
0.831TyrCys: 0.831 ± 0.017
1.895TyrAsp: 1.895 ± 0.029
1.958TyrGlu: 1.958 ± 0.023
1.469TyrPhe: 1.469 ± 0.022
1.931TyrGly: 1.931 ± 0.03
0.869TyrHis: 0.869 ± 0.014
1.888TyrIle: 1.888 ± 0.022
1.634TyrLys: 1.634 ± 0.021
2.819TyrLeu: 2.819 ± 0.029
0.789TyrMet: 0.789 ± 0.015
1.542TyrAsn: 1.542 ± 0.021
1.318TyrPro: 1.318 ± 0.021
1.274TyrGln: 1.274 ± 0.017
1.942TyrArg: 1.942 ± 0.026
2.431TyrSer: 2.431 ± 0.028
1.494TyrThr: 1.494 ± 0.019
1.827TyrVal: 1.827 ± 0.021
0.406TyrTrp: 0.406 ± 0.009
1.261TyrTyr: 1.261 ± 0.024
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10206 proteins (4761086 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski