Amino acid dipepetide frequency for Plasmodium berghei (strain Anka)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.703AlaAla: 0.703 ± 0.021
0.461AlaCys: 0.461 ± 0.012
1.067AlaAsp: 1.067 ± 0.02
1.366AlaGlu: 1.366 ± 0.033
1.175AlaPhe: 1.175 ± 0.021
0.726AlaGly: 0.726 ± 0.018
0.577AlaHis: 0.577 ± 0.014
2.164AlaIle: 2.164 ± 0.029
2.212AlaLys: 2.212 ± 0.037
2.136AlaLeu: 2.136 ± 0.034
0.43AlaMet: 0.43 ± 0.01
2.334AlaAsn: 2.334 ± 0.029
0.609AlaPro: 0.609 ± 0.018
0.777AlaGln: 0.777 ± 0.018
0.577AlaArg: 0.577 ± 0.016
1.841AlaSer: 1.841 ± 0.032
1.051AlaThr: 1.051 ± 0.021
0.827AlaVal: 0.827 ± 0.021
0.134AlaTrp: 0.134 ± 0.007
1.339AlaTyr: 1.339 ± 0.022
0.0AlaXaa: 0.0 ± 0.0
Cys
0.491CysAla: 0.491 ± 0.011
0.293CysCys: 0.293 ± 0.01
1.037CysAsp: 1.037 ± 0.019
1.167CysGlu: 1.167 ± 0.021
0.94CysPhe: 0.94 ± 0.019
0.604CysGly: 0.604 ± 0.013
0.288CysHis: 0.288 ± 0.01
1.818CysIle: 1.818 ± 0.025
1.556CysLys: 1.556 ± 0.025
1.633CysLeu: 1.633 ± 0.027
0.307CysMet: 0.307 ± 0.011
1.748CysAsn: 1.748 ± 0.028
0.434CysPro: 0.434 ± 0.015
0.369CysGln: 0.369 ± 0.011
0.446CysArg: 0.446 ± 0.013
1.415CysSer: 1.415 ± 0.024
0.744CysThr: 0.744 ± 0.014
0.812CysVal: 0.812 ± 0.018
0.075CysTrp: 0.075 ± 0.005
0.84CysTyr: 0.84 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
1.471AspAla: 1.471 ± 0.023
0.676AspCys: 0.676 ± 0.013
4.075AspAsp: 4.075 ± 0.069
5.182AspGlu: 5.182 ± 0.069
2.349AspPhe: 2.349 ± 0.027
1.786AspGly: 1.786 ± 0.027
0.901AspHis: 0.901 ± 0.02
6.891AspIle: 6.891 ± 0.057
6.297AspLys: 6.297 ± 0.066
3.845AspLeu: 3.845 ± 0.036
1.119AspMet: 1.119 ± 0.02
6.849AspAsn: 6.849 ± 0.065
1.003AspPro: 1.003 ± 0.021
1.484AspGln: 1.484 ± 0.027
1.142AspArg: 1.142 ± 0.022
3.577AspSer: 3.577 ± 0.046
2.413AspThr: 2.413 ± 0.033
2.356AspVal: 2.356 ± 0.039
0.224AspTrp: 0.224 ± 0.01
2.525AspTyr: 2.525 ± 0.031
0.0AspXaa: 0.0 ± 0.0
Glu
1.574GluAla: 1.574 ± 0.032
1.012GluCys: 1.012 ± 0.023
3.662GluAsp: 3.662 ± 0.053
6.352GluGlu: 6.352 ± 0.118
2.319GluPhe: 2.319 ± 0.029
1.823GluGly: 1.823 ± 0.028
1.466GluHis: 1.466 ± 0.034
6.884GluIle: 6.884 ± 0.062
10.291GluLys: 10.291 ± 0.081
5.004GluLeu: 5.004 ± 0.056
1.396GluMet: 1.396 ± 0.022
11.027GluAsn: 11.027 ± 0.096
0.93GluPro: 0.93 ± 0.022
2.407GluGln: 2.407 ± 0.06
1.817GluArg: 1.817 ± 0.032
3.923GluSer: 3.923 ± 0.048
2.898GluThr: 2.898 ± 0.046
2.051GluVal: 2.051 ± 0.043
0.358GluTrp: 0.358 ± 0.012
3.731GluTyr: 3.731 ± 0.038
0.0GluXaa: 0.0 ± 0.0
Phe
0.956PheAla: 0.956 ± 0.019
0.933PheCys: 0.933 ± 0.017
2.826PheAsp: 2.826 ± 0.03
3.029PheGlu: 3.029 ± 0.039
3.706PhePhe: 3.706 ± 0.051
1.447PheGly: 1.447 ± 0.032
0.979PheHis: 0.979 ± 0.018
4.937PheIle: 4.937 ± 0.049
4.282PheLys: 4.282 ± 0.041
5.187PheLeu: 5.187 ± 0.062
0.954PheMet: 0.954 ± 0.02
4.782PheAsn: 4.782 ± 0.045
1.26PhePro: 1.26 ± 0.019
1.296PheGln: 1.296 ± 0.022
1.171PheArg: 1.171 ± 0.018
3.958PheSer: 3.958 ± 0.038
1.795PheThr: 1.795 ± 0.026
2.049PheVal: 2.049 ± 0.029
0.246PheTrp: 0.246 ± 0.009
3.18PheTyr: 3.18 ± 0.04
0.0PheXaa: 0.0 ± 0.0
Gly
0.882GlyAla: 0.882 ± 0.02
0.526GlyCys: 0.526 ± 0.013
1.676GlyAsp: 1.676 ± 0.026
1.979GlyGlu: 1.979 ± 0.027
1.321GlyPhe: 1.321 ± 0.023
1.386GlyGly: 1.386 ± 0.033
0.573GlyHis: 0.573 ± 0.015
3.05GlyIle: 3.05 ± 0.035
3.4GlyLys: 3.4 ± 0.034
2.013GlyLeu: 2.013 ± 0.033
0.625GlyMet: 0.625 ± 0.019
3.931GlyAsn: 3.931 ± 0.049
0.484GlyPro: 0.484 ± 0.015
0.721GlyGln: 0.721 ± 0.024
0.871GlyArg: 0.871 ± 0.022
2.124GlySer: 2.124 ± 0.03
1.42GlyThr: 1.42 ± 0.057
1.334GlyVal: 1.334 ± 0.024
0.177GlyTrp: 0.177 ± 0.008
1.579GlyTyr: 1.579 ± 0.026
0.0GlyXaa: 0.0 ± 0.0
His
0.509HisAla: 0.509 ± 0.015
0.281HisCys: 0.281 ± 0.01
1.031HisAsp: 1.031 ± 0.018
1.241HisGlu: 1.241 ± 0.022
1.143HisPhe: 1.143 ± 0.02
0.612HisGly: 0.612 ± 0.015
0.39HisHis: 0.39 ± 0.011
2.461HisIle: 2.461 ± 0.032
1.921HisLys: 1.921 ± 0.028
1.651HisLeu: 1.651 ± 0.024
0.413HisMet: 0.413 ± 0.013
2.169HisAsn: 2.169 ± 0.033
0.507HisPro: 0.507 ± 0.013
0.479HisGln: 0.479 ± 0.011
0.497HisArg: 0.497 ± 0.012
1.327HisSer: 1.327 ± 0.026
0.935HisThr: 0.935 ± 0.016
0.826HisVal: 0.826 ± 0.018
0.087HisTrp: 0.087 ± 0.005
0.882HisTyr: 0.882 ± 0.018
0.0HisXaa: 0.0 ± 0.0
Ile
2.046IleAla: 2.046 ± 0.03
2.072IleCys: 2.072 ± 0.028
5.787IleAsp: 5.787 ± 0.047
6.574IleGlu: 6.574 ± 0.06
5.71IlePhe: 5.71 ± 0.065
2.767IleGly: 2.767 ± 0.041
2.152IleHis: 2.152 ± 0.025
9.49IleIle: 9.49 ± 0.079
11.813IleLys: 11.813 ± 0.084
9.105IleLeu: 9.105 ± 0.082
1.581IleMet: 1.581 ± 0.02
13.278IleAsn: 13.278 ± 0.093
2.667IlePro: 2.667 ± 0.035
2.988IleGln: 2.988 ± 0.035
2.409IleArg: 2.409 ± 0.028
7.538IleSer: 7.538 ± 0.051
3.736IleThr: 3.736 ± 0.035
3.309IleVal: 3.309 ± 0.039
0.536IleTrp: 0.536 ± 0.015
6.469IleTyr: 6.469 ± 0.064
0.0IleXaa: 0.0 ± 0.0
Lys
1.982LysAla: 1.982 ± 0.034
1.953LysCys: 1.953 ± 0.025
5.741LysAsp: 5.741 ± 0.045
8.963LysGlu: 8.963 ± 0.082
3.984LysPhe: 3.984 ± 0.034
3.324LysGly: 3.324 ± 0.037
2.302LysHis: 2.302 ± 0.025
11.728LysIle: 11.728 ± 0.088
19.518LysLys: 19.518 ± 0.155
8.311LysLeu: 8.311 ± 0.062
2.28LysMet: 2.28 ± 0.031
17.749LysAsn: 17.749 ± 0.113
1.693LysPro: 1.693 ± 0.036
3.279LysGln: 3.279 ± 0.043
3.791LysArg: 3.791 ± 0.045
6.67LysSer: 6.67 ± 0.061
4.972LysThr: 4.972 ± 0.051
3.165LysVal: 3.165 ± 0.039
0.659LysTrp: 0.659 ± 0.015
7.201LysTyr: 7.201 ± 0.052
0.0LysXaa: 0.0 ± 0.0
Leu
1.898LeuAla: 1.898 ± 0.029
1.737LeuCys: 1.737 ± 0.025
3.699LeuAsp: 3.699 ± 0.045
4.803LeuGlu: 4.803 ± 0.052
4.878LeuPhe: 4.878 ± 0.058
2.253LeuGly: 2.253 ± 0.033
1.566LeuHis: 1.566 ± 0.019
7.012LeuIle: 7.012 ± 0.068
10.028LeuLys: 10.028 ± 0.078
7.504LeuLeu: 7.504 ± 0.063
1.312LeuMet: 1.312 ± 0.019
9.549LeuAsn: 9.549 ± 0.064
1.945LeuPro: 1.945 ± 0.027
2.183LeuGln: 2.183 ± 0.035
2.293LeuArg: 2.293 ± 0.035
6.124LeuSer: 6.124 ± 0.049
3.054LeuThr: 3.054 ± 0.038
2.589LeuVal: 2.589 ± 0.038
0.484LeuTrp: 0.484 ± 0.012
4.845LeuTyr: 4.845 ± 0.049
0.0LeuXaa: 0.0 ± 0.0
Met
0.491MetAla: 0.491 ± 0.014
0.399MetCys: 0.399 ± 0.013
1.173MetAsp: 1.173 ± 0.018
1.303MetGlu: 1.303 ± 0.022
0.901MetPhe: 0.901 ± 0.017
0.667MetGly: 0.667 ± 0.018
0.58MetHis: 0.58 ± 0.015
1.423MetIle: 1.423 ± 0.023
2.247MetLys: 2.247 ± 0.028
1.545MetLeu: 1.545 ± 0.025
0.352MetMet: 0.352 ± 0.011
2.665MetAsn: 2.665 ± 0.04
0.521MetPro: 0.521 ± 0.016
0.551MetGln: 0.551 ± 0.014
0.519MetArg: 0.519 ± 0.013
1.276MetSer: 1.276 ± 0.02
0.619MetThr: 0.619 ± 0.012
0.567MetVal: 0.567 ± 0.013
0.121MetTrp: 0.121 ± 0.007
0.915MetTyr: 0.915 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
2.759AsnAla: 2.759 ± 0.032
1.881AsnCys: 1.881 ± 0.03
8.454AsnAsp: 8.454 ± 0.074
10.157AsnGlu: 10.157 ± 0.082
5.861AsnPhe: 5.861 ± 0.051
3.744AsnGly: 3.744 ± 0.044
1.892AsnHis: 1.892 ± 0.028
15.427AsnIle: 15.427 ± 0.099
15.11AsnLys: 15.11 ± 0.104
8.683AsnLeu: 8.683 ± 0.059
2.871AsnMet: 2.871 ± 0.047
20.429AsnAsn: 20.429 ± 0.295
2.177AsnPro: 2.177 ± 0.037
2.849AsnGln: 2.849 ± 0.036
2.745AsnArg: 2.745 ± 0.035
9.428AsnSer: 9.428 ± 0.087
5.882AsnThr: 5.882 ± 0.061
5.041AsnVal: 5.041 ± 0.045
0.43AsnTrp: 0.43 ± 0.012
6.976AsnTyr: 6.976 ± 0.064
0.0AsnXaa: 0.0 ± 0.0
Pro
0.424ProAla: 0.424 ± 0.018
0.346ProCys: 0.346 ± 0.011
0.99ProAsp: 0.99 ± 0.021
1.235ProGlu: 1.235 ± 0.03
1.309ProPhe: 1.309 ± 0.024
0.577ProGly: 0.577 ± 0.016
0.503ProHis: 0.503 ± 0.013
2.167ProIle: 2.167 ± 0.029
1.896ProLys: 1.896 ± 0.03
1.893ProLeu: 1.893 ± 0.038
0.334ProMet: 0.334 ± 0.01
2.429ProAsn: 2.429 ± 0.04
0.748ProPro: 0.748 ± 0.026
0.759ProGln: 0.759 ± 0.023
0.495ProArg: 0.495 ± 0.012
1.686ProSer: 1.686 ± 0.036
0.946ProThr: 0.946 ± 0.024
0.785ProVal: 0.785 ± 0.021
0.138ProTrp: 0.138 ± 0.006
1.203ProTyr: 1.203 ± 0.02
0.0ProXaa: 0.0 ± 0.0
Gln
0.631GlnAla: 0.631 ± 0.025
0.389GlnCys: 0.389 ± 0.011
1.123GlnAsp: 1.123 ± 0.027
1.63GlnGlu: 1.63 ± 0.032
1.159GlnPhe: 1.159 ± 0.019
0.768GlnGly: 0.768 ± 0.053
0.492GlnHis: 0.492 ± 0.012
3.12GlnIle: 3.12 ± 0.035
3.394GlnLys: 3.394 ± 0.038
2.04GlnLeu: 2.04 ± 0.026
0.568GlnMet: 0.568 ± 0.017
4.578GlnAsn: 4.578 ± 0.054
0.502GlnPro: 0.502 ± 0.023
0.749GlnGln: 0.749 ± 0.035
0.747GlnArg: 0.747 ± 0.024
1.627GlnSer: 1.627 ± 0.026
1.446GlnThr: 1.446 ± 0.025
0.911GlnVal: 0.911 ± 0.032
0.151GlnTrp: 0.151 ± 0.006
1.267GlnTyr: 1.267 ± 0.019
0.0GlnXaa: 0.0 ± 0.0
Arg
0.693ArgAla: 0.693 ± 0.017
0.428ArgCys: 0.428 ± 0.011
1.339ArgAsp: 1.339 ± 0.023
1.777ArgGlu: 1.777 ± 0.026
0.974ArgPhe: 0.974 ± 0.016
0.973ArgGly: 0.973 ± 0.021
0.54ArgHis: 0.54 ± 0.013
2.35ArgIle: 2.35 ± 0.033
3.665ArgLys: 3.665 ± 0.042
1.765ArgLeu: 1.765 ± 0.029
0.474ArgMet: 0.474 ± 0.012
3.37ArgAsn: 3.37 ± 0.04
0.447ArgPro: 0.447 ± 0.014
0.684ArgGln: 0.684 ± 0.014
1.092ArgArg: 1.092 ± 0.027
1.646ArgSer: 1.646 ± 0.027
1.026ArgThr: 1.026 ± 0.02
0.918ArgVal: 0.918 ± 0.022
0.153ArgTrp: 0.153 ± 0.007
1.257ArgTyr: 1.257 ± 0.021
0.0ArgXaa: 0.0 ± 0.0
Ser
1.582SerAla: 1.582 ± 0.024
1.128SerCys: 1.128 ± 0.02
4.2SerAsp: 4.2 ± 0.044
4.908SerGlu: 4.908 ± 0.049
3.778SerPhe: 3.778 ± 0.035
2.257SerGly: 2.257 ± 0.032
1.338SerHis: 1.338 ± 0.02
7.033SerIle: 7.033 ± 0.057
6.987SerLys: 6.987 ± 0.055
5.785SerLeu: 5.785 ± 0.05
1.293SerMet: 1.293 ± 0.022
8.757SerAsn: 8.757 ± 0.093
1.493SerPro: 1.493 ± 0.027
2.034SerGln: 2.034 ± 0.036
1.714SerArg: 1.714 ± 0.03
6.129SerSer: 6.129 ± 0.069
3.164SerThr: 3.164 ± 0.039
2.598SerVal: 2.598 ± 0.029
0.306SerTrp: 0.306 ± 0.009
3.896SerTyr: 3.896 ± 0.04
0.0SerXaa: 0.0 ± 0.0
Thr
0.907ThrAla: 0.907 ± 0.022
0.76ThrCys: 0.76 ± 0.016
2.251ThrAsp: 2.251 ± 0.031
2.656ThrGlu: 2.656 ± 0.054
2.091ThrPhe: 2.091 ± 0.026
1.293ThrGly: 1.293 ± 0.026
1.013ThrHis: 1.013 ± 0.021
3.915ThrIle: 3.915 ± 0.042
4.61ThrLys: 4.61 ± 0.047
3.284ThrLeu: 3.284 ± 0.042
0.634ThrMet: 0.634 ± 0.015
5.808ThrAsn: 5.808 ± 0.054
1.272ThrPro: 1.272 ± 0.027
1.361ThrGln: 1.361 ± 0.022
0.953ThrArg: 0.953 ± 0.017
3.304ThrSer: 3.304 ± 0.038
1.989ThrThr: 1.989 ± 0.032
1.436ThrVal: 1.436 ± 0.024
0.216ThrTrp: 0.216 ± 0.008
2.275ThrTyr: 2.275 ± 0.031
0.0ThrXaa: 0.0 ± 0.0
Val
0.999ValAla: 0.999 ± 0.02
0.777ValCys: 0.777 ± 0.018
2.243ValAsp: 2.243 ± 0.031
2.546ValGlu: 2.546 ± 0.056
1.756ValPhe: 1.756 ± 0.023
1.211ValGly: 1.211 ± 0.022
0.819ValHis: 0.819 ± 0.017
3.182ValIle: 3.182 ± 0.038
3.745ValLys: 3.745 ± 0.039
3.229ValLeu: 3.229 ± 0.043
0.595ValMet: 0.595 ± 0.013
3.912ValAsn: 3.912 ± 0.034
0.934ValPro: 0.934 ± 0.024
1.035ValGln: 1.035 ± 0.02
0.865ValArg: 0.865 ± 0.018
2.495ValSer: 2.495 ± 0.027
1.429ValThr: 1.429 ± 0.025
1.455ValVal: 1.455 ± 0.028
0.195ValTrp: 0.195 ± 0.008
1.969ValTyr: 1.969 ± 0.027
0.0ValXaa: 0.0 ± 0.0
Trp
0.151TrpAla: 0.151 ± 0.007
0.099TrpCys: 0.099 ± 0.006
0.291TrpAsp: 0.291 ± 0.01
0.327TrpGlu: 0.327 ± 0.01
0.221TrpPhe: 0.221 ± 0.009
0.263TrpGly: 0.263 ± 0.01
0.079TrpHis: 0.079 ± 0.005
0.496TrpIle: 0.496 ± 0.011
0.6TrpLys: 0.6 ± 0.014
0.445TrpLeu: 0.445 ± 0.012
0.11TrpMet: 0.11 ± 0.006
0.563TrpAsn: 0.563 ± 0.014
0.104TrpPro: 0.104 ± 0.005
0.076TrpGln: 0.076 ± 0.005
0.188TrpArg: 0.188 ± 0.008
0.312TrpSer: 0.312 ± 0.011
0.163TrpThr: 0.163 ± 0.008
0.237TrpVal: 0.237 ± 0.01
0.045TrpTrp: 0.045 ± 0.004
0.197TrpTyr: 0.197 ± 0.008
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.358TyrAla: 1.358 ± 0.022
0.865TyrCys: 0.865 ± 0.018
3.499TyrAsp: 3.499 ± 0.041
3.729TyrGlu: 3.729 ± 0.04
3.4TyrPhe: 3.4 ± 0.041
1.592TyrGly: 1.592 ± 0.023
0.911TyrHis: 0.911 ± 0.016
6.481TyrIle: 6.481 ± 0.069
5.691TyrLys: 5.691 ± 0.051
4.699TyrLeu: 4.699 ± 0.042
1.187TyrMet: 1.187 ± 0.02
6.869TyrAsn: 6.869 ± 0.058
1.132TyrPro: 1.132 ± 0.019
1.209TyrGln: 1.209 ± 0.02
1.238TyrArg: 1.238 ± 0.02
3.977TyrSer: 3.977 ± 0.031
2.348TyrThr: 2.348 ± 0.026
2.124TyrVal: 2.124 ± 0.025
0.234TyrTrp: 0.234 ± 0.009
3.26TyrTyr: 3.26 ± 0.043
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4928 proteins (3405970 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski