Amino acid dipepetide frequency for Niastella vici

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.207AlaAla: 7.207 ± 0.075
0.808AlaCys: 0.808 ± 0.018
4.026AlaAsp: 4.026 ± 0.037
3.85AlaGlu: 3.85 ± 0.043
3.506AlaPhe: 3.506 ± 0.029
6.465AlaGly: 6.465 ± 0.054
1.282AlaHis: 1.282 ± 0.023
5.426AlaIle: 5.426 ± 0.05
3.962AlaLys: 3.962 ± 0.049
6.815AlaLeu: 6.815 ± 0.063
1.672AlaMet: 1.672 ± 0.027
3.973AlaAsn: 3.973 ± 0.046
2.717AlaPro: 2.717 ± 0.038
2.994AlaGln: 2.994 ± 0.038
2.927AlaArg: 2.927 ± 0.037
4.73AlaSer: 4.73 ± 0.049
4.979AlaThr: 4.979 ± 0.062
5.074AlaVal: 5.074 ± 0.045
1.039AlaTrp: 1.039 ± 0.02
2.894AlaTyr: 2.894 ± 0.03
0.0AlaXaa: 0.0 ± 0.0
Cys
0.585CysAla: 0.585 ± 0.015
0.141CysCys: 0.141 ± 0.006
0.42CysAsp: 0.42 ± 0.013
0.389CysGlu: 0.389 ± 0.013
0.486CysPhe: 0.486 ± 0.014
0.663CysGly: 0.663 ± 0.017
0.199CysHis: 0.199 ± 0.009
0.725CysIle: 0.725 ± 0.017
0.561CysLys: 0.561 ± 0.014
0.865CysLeu: 0.865 ± 0.017
0.235CysMet: 0.235 ± 0.01
0.513CysAsn: 0.513 ± 0.014
0.374CysPro: 0.374 ± 0.012
0.286CysGln: 0.286 ± 0.01
0.415CysArg: 0.415 ± 0.013
0.611CysSer: 0.611 ± 0.015
0.58CysThr: 0.58 ± 0.014
0.486CysVal: 0.486 ± 0.014
0.138CysTrp: 0.138 ± 0.008
0.375CysTyr: 0.375 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
4.006AspAla: 4.006 ± 0.039
0.422AspCys: 0.422 ± 0.013
2.346AspAsp: 2.346 ± 0.033
2.86AspGlu: 2.86 ± 0.036
2.556AspPhe: 2.556 ± 0.033
3.846AspGly: 3.846 ± 0.058
1.014AspHis: 1.014 ± 0.019
3.579AspIle: 3.579 ± 0.037
3.481AspLys: 3.481 ± 0.039
4.54AspLeu: 4.54 ± 0.039
1.074AspMet: 1.074 ± 0.019
2.818AspAsn: 2.818 ± 0.034
2.171AspPro: 2.171 ± 0.032
1.635AspGln: 1.635 ± 0.026
2.104AspArg: 2.104 ± 0.029
3.036AspSer: 3.036 ± 0.048
2.682AspThr: 2.682 ± 0.033
3.232AspVal: 3.232 ± 0.037
0.805AspTrp: 0.805 ± 0.017
2.407AspTyr: 2.407 ± 0.033
0.0AspXaa: 0.0 ± 0.0
Glu
3.979GluAla: 3.979 ± 0.045
0.365GluCys: 0.365 ± 0.011
2.289GluAsp: 2.289 ± 0.034
3.293GluGlu: 3.293 ± 0.046
2.174GluPhe: 2.174 ± 0.028
3.113GluGly: 3.113 ± 0.034
1.011GluHis: 1.011 ± 0.021
3.643GluIle: 3.643 ± 0.045
4.631GluLys: 4.631 ± 0.053
5.268GluLeu: 5.268 ± 0.055
1.398GluMet: 1.398 ± 0.022
2.948GluAsn: 2.948 ± 0.036
1.704GluPro: 1.704 ± 0.023
2.471GluGln: 2.471 ± 0.035
2.329GluArg: 2.329 ± 0.034
2.53GluSer: 2.53 ± 0.029
2.748GluThr: 2.748 ± 0.031
3.428GluVal: 3.428 ± 0.047
0.727GluTrp: 0.727 ± 0.016
2.057GluTyr: 2.057 ± 0.029
0.0GluXaa: 0.0 ± 0.0
Phe
3.148PheAla: 3.148 ± 0.034
0.499PheCys: 0.499 ± 0.013
2.672PheAsp: 2.672 ± 0.031
2.435PheGlu: 2.435 ± 0.03
2.412PhePhe: 2.412 ± 0.035
3.155PheGly: 3.155 ± 0.038
0.851PheHis: 0.851 ± 0.017
3.399PheIle: 3.399 ± 0.042
2.715PheLys: 2.715 ± 0.03
4.105PheLeu: 4.105 ± 0.043
1.095PheMet: 1.095 ± 0.019
2.993PheAsn: 2.993 ± 0.028
1.825PhePro: 1.825 ± 0.027
1.492PheGln: 1.492 ± 0.026
2.087PheArg: 2.087 ± 0.028
3.532PheSer: 3.532 ± 0.039
3.545PheThr: 3.545 ± 0.043
2.742PheVal: 2.742 ± 0.034
0.619PheTrp: 0.619 ± 0.016
2.088PheTyr: 2.088 ± 0.029
0.0PheXaa: 0.0 ± 0.0
Gly
4.86GlyAla: 4.86 ± 0.051
0.773GlyCys: 0.773 ± 0.022
3.402GlyAsp: 3.402 ± 0.039
3.266GlyGlu: 3.266 ± 0.037
3.447GlyPhe: 3.447 ± 0.032
5.087GlyGly: 5.087 ± 0.069
1.242GlyHis: 1.242 ± 0.023
5.25GlyIle: 5.25 ± 0.049
5.102GlyLys: 5.102 ± 0.048
5.935GlyLeu: 5.935 ± 0.044
1.73GlyMet: 1.73 ± 0.025
4.35GlyAsn: 4.35 ± 0.057
1.719GlyPro: 1.719 ± 0.03
2.338GlyGln: 2.338 ± 0.033
2.825GlyArg: 2.825 ± 0.037
4.718GlySer: 4.718 ± 0.051
4.808GlyThr: 4.808 ± 0.074
4.483GlyVal: 4.483 ± 0.045
1.135GlyTrp: 1.135 ± 0.018
3.339GlyTyr: 3.339 ± 0.041
0.0GlyXaa: 0.0 ± 0.0
His
1.27HisAla: 1.27 ± 0.021
0.224HisCys: 0.224 ± 0.008
0.922HisAsp: 0.922 ± 0.02
0.977HisGlu: 0.977 ± 0.021
1.213HisPhe: 1.213 ± 0.02
1.151HisGly: 1.151 ± 0.02
0.536HisHis: 0.536 ± 0.014
1.379HisIle: 1.379 ± 0.025
0.988HisLys: 0.988 ± 0.02
1.843HisLeu: 1.843 ± 0.03
0.408HisMet: 0.408 ± 0.013
0.974HisAsn: 0.974 ± 0.021
0.994HisPro: 0.994 ± 0.019
0.744HisGln: 0.744 ± 0.017
0.825HisArg: 0.825 ± 0.019
1.088HisSer: 1.088 ± 0.021
1.15HisThr: 1.15 ± 0.021
1.035HisVal: 1.035 ± 0.021
0.286HisTrp: 0.286 ± 0.01
0.927HisTyr: 0.927 ± 0.017
0.0HisXaa: 0.0 ± 0.0
Ile
5.452IleAla: 5.452 ± 0.046
0.693IleCys: 0.693 ± 0.018
4.006IleAsp: 4.006 ± 0.041
3.783IleGlu: 3.783 ± 0.04
2.748IlePhe: 2.748 ± 0.035
4.442IleGly: 4.442 ± 0.043
1.285IleHis: 1.285 ± 0.027
4.702IleIle: 4.702 ± 0.052
4.407IleLys: 4.407 ± 0.042
5.392IleLeu: 5.392 ± 0.048
1.338IleMet: 1.338 ± 0.023
4.054IleAsn: 4.054 ± 0.041
3.123IlePro: 3.123 ± 0.035
2.478IleGln: 2.478 ± 0.028
3.141IleArg: 3.141 ± 0.031
4.717IleSer: 4.717 ± 0.037
5.048IleThr: 5.048 ± 0.039
4.195IleVal: 4.195 ± 0.036
0.785IleTrp: 0.785 ± 0.015
2.529IleTyr: 2.529 ± 0.026
0.0IleXaa: 0.0 ± 0.0
Lys
4.866LysAla: 4.866 ± 0.052
0.365LysCys: 0.365 ± 0.011
3.728LysAsp: 3.728 ± 0.042
4.46LysGlu: 4.46 ± 0.047
2.206LysPhe: 2.206 ± 0.029
4.335LysGly: 4.335 ± 0.047
1.103LysHis: 1.103 ± 0.017
4.208LysIle: 4.208 ± 0.044
5.502LysLys: 5.502 ± 0.056
5.655LysLeu: 5.655 ± 0.057
1.797LysMet: 1.797 ± 0.026
3.79LysAsn: 3.79 ± 0.044
2.687LysPro: 2.687 ± 0.035
2.872LysGln: 2.872 ± 0.031
2.645LysArg: 2.645 ± 0.034
3.218LysSer: 3.218 ± 0.036
3.971LysThr: 3.971 ± 0.04
4.028LysVal: 4.028 ± 0.044
0.875LysTrp: 0.875 ± 0.019
2.732LysTyr: 2.732 ± 0.034
0.0LysXaa: 0.0 ± 0.0
Leu
6.625LeuAla: 6.625 ± 0.059
0.874LeuCys: 0.874 ± 0.019
4.114LeuAsp: 4.114 ± 0.033
4.355LeuGlu: 4.355 ± 0.046
4.799LeuPhe: 4.799 ± 0.047
5.258LeuGly: 5.258 ± 0.049
1.941LeuHis: 1.941 ± 0.03
5.857LeuIle: 5.857 ± 0.056
6.159LeuLys: 6.159 ± 0.052
9.73LeuLeu: 9.73 ± 0.083
2.041LeuMet: 2.041 ± 0.026
5.316LeuAsn: 5.316 ± 0.048
4.444LeuPro: 4.444 ± 0.044
4.624LeuGln: 4.624 ± 0.052
3.743LeuArg: 3.743 ± 0.037
6.35LeuSer: 6.35 ± 0.055
5.632LeuThr: 5.632 ± 0.043
5.624LeuVal: 5.624 ± 0.045
1.036LeuTrp: 1.036 ± 0.022
3.693LeuTyr: 3.693 ± 0.037
0.0LeuXaa: 0.0 ± 0.0
Met
1.858MetAla: 1.858 ± 0.025
0.148MetCys: 0.148 ± 0.008
1.096MetAsp: 1.096 ± 0.02
1.308MetGlu: 1.308 ± 0.026
0.716MetPhe: 0.716 ± 0.016
1.495MetGly: 1.495 ± 0.025
0.495MetHis: 0.495 ± 0.014
1.376MetIle: 1.376 ± 0.019
1.926MetLys: 1.926 ± 0.025
2.119MetLeu: 2.119 ± 0.031
0.565MetMet: 0.565 ± 0.015
1.362MetAsn: 1.362 ± 0.026
1.153MetPro: 1.153 ± 0.02
1.139MetGln: 1.139 ± 0.023
1.07MetArg: 1.07 ± 0.021
1.237MetSer: 1.237 ± 0.02
1.094MetThr: 1.094 ± 0.02
1.512MetVal: 1.512 ± 0.024
0.239MetTrp: 0.239 ± 0.01
0.736MetTyr: 0.736 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
4.316AsnAla: 4.316 ± 0.05
0.514AsnCys: 0.514 ± 0.015
3.047AsnAsp: 3.047 ± 0.033
3.006AsnGlu: 3.006 ± 0.034
2.471AsnPhe: 2.471 ± 0.034
4.748AsnGly: 4.748 ± 0.06
0.923AsnHis: 0.923 ± 0.018
4.052AsnIle: 4.052 ± 0.039
3.67AsnLys: 3.67 ± 0.037
4.594AsnLeu: 4.594 ± 0.044
1.225AsnMet: 1.225 ± 0.02
3.842AsnAsn: 3.842 ± 0.053
2.709AsnPro: 2.709 ± 0.032
2.002AsnGln: 2.002 ± 0.027
2.605AsnArg: 2.605 ± 0.031
3.508AsnSer: 3.508 ± 0.04
3.751AsnThr: 3.751 ± 0.045
3.266AsnVal: 3.266 ± 0.039
0.889AsnTrp: 0.889 ± 0.022
2.718AsnTyr: 2.718 ± 0.039
0.0AsnXaa: 0.0 ± 0.0
Pro
4.08ProAla: 4.08 ± 0.046
0.263ProCys: 0.263 ± 0.01
2.59ProAsp: 2.59 ± 0.032
2.504ProGlu: 2.504 ± 0.032
2.05ProPhe: 2.05 ± 0.027
3.435ProGly: 3.435 ± 0.035
0.736ProHis: 0.736 ± 0.017
2.084ProIle: 2.084 ± 0.028
1.863ProLys: 1.863 ± 0.028
3.583ProLeu: 3.583 ± 0.039
0.799ProMet: 0.799 ± 0.017
2.022ProAsn: 2.022 ± 0.025
1.394ProPro: 1.394 ± 0.024
1.472ProGln: 1.472 ± 0.024
1.285ProArg: 1.285 ± 0.023
2.203ProSer: 2.203 ± 0.028
2.059ProThr: 2.059 ± 0.032
4.125ProVal: 4.125 ± 0.04
0.512ProTrp: 0.512 ± 0.013
1.691ProTyr: 1.691 ± 0.029
0.0ProXaa: 0.0 ± 0.0
Gln
2.824GlnAla: 2.824 ± 0.034
0.286GlnCys: 0.286 ± 0.01
1.493GlnAsp: 1.493 ± 0.025
2.021GlnGlu: 2.021 ± 0.03
1.88GlnPhe: 1.88 ± 0.029
2.142GlnGly: 2.142 ± 0.034
0.953GlnHis: 0.953 ± 0.019
2.203GlnIle: 2.203 ± 0.027
2.642GlnLys: 2.642 ± 0.032
4.547GlnLeu: 4.547 ± 0.047
0.971GlnMet: 0.971 ± 0.015
2.009GlnAsn: 2.009 ± 0.026
1.879GlnPro: 1.879 ± 0.026
2.709GlnGln: 2.709 ± 0.044
1.742GlnArg: 1.742 ± 0.025
2.315GlnSer: 2.315 ± 0.03
2.334GlnThr: 2.334 ± 0.031
2.693GlnVal: 2.693 ± 0.034
0.672GlnTrp: 0.672 ± 0.021
1.834GlnTyr: 1.834 ± 0.025
0.0GlnXaa: 0.0 ± 0.0
Arg
2.491ArgAla: 2.491 ± 0.03
0.301ArgCys: 0.301 ± 0.011
1.982ArgAsp: 1.982 ± 0.025
2.343ArgGlu: 2.343 ± 0.034
2.323ArgPhe: 2.323 ± 0.031
2.198ArgGly: 2.198 ± 0.027
0.83ArgHis: 0.83 ± 0.018
3.179ArgIle: 3.179 ± 0.035
3.013ArgLys: 3.013 ± 0.034
4.213ArgLeu: 4.213 ± 0.044
1.126ArgMet: 1.126 ± 0.021
2.53ArgAsn: 2.53 ± 0.034
1.497ArgPro: 1.497 ± 0.025
1.759ArgGln: 1.759 ± 0.03
1.825ArgArg: 1.825 ± 0.027
2.573ArgSer: 2.573 ± 0.03
2.352ArgThr: 2.352 ± 0.029
2.452ArgVal: 2.452 ± 0.026
0.681ArgTrp: 0.681 ± 0.016
2.025ArgTyr: 2.025 ± 0.027
0.0ArgXaa: 0.0 ± 0.0
Ser
4.729SerAla: 4.729 ± 0.044
0.665SerCys: 0.665 ± 0.016
2.867SerAsp: 2.867 ± 0.039
2.601SerGlu: 2.601 ± 0.032
3.59SerPhe: 3.59 ± 0.039
5.112SerGly: 5.112 ± 0.061
1.094SerHis: 1.094 ± 0.021
4.526SerIle: 4.526 ± 0.043
3.46SerLys: 3.46 ± 0.038
6.066SerLeu: 6.066 ± 0.052
1.368SerMet: 1.368 ± 0.024
3.459SerAsn: 3.459 ± 0.046
2.439SerPro: 2.439 ± 0.032
2.106SerGln: 2.106 ± 0.025
2.535SerArg: 2.535 ± 0.033
4.008SerSer: 4.008 ± 0.049
3.903SerThr: 3.903 ± 0.054
4.078SerVal: 4.078 ± 0.037
0.954SerTrp: 0.954 ± 0.018
2.813SerTyr: 2.813 ± 0.038
0.0SerXaa: 0.0 ± 0.0
Thr
5.471ThrAla: 5.471 ± 0.065
0.511ThrCys: 0.511 ± 0.013
3.419ThrAsp: 3.419 ± 0.039
2.788ThrGlu: 2.788 ± 0.03
2.74ThrPhe: 2.74 ± 0.035
5.872ThrGly: 5.872 ± 0.068
1.075ThrHis: 1.075 ± 0.021
4.774ThrIle: 4.774 ± 0.039
2.996ThrLys: 2.996 ± 0.031
5.494ThrLeu: 5.494 ± 0.046
1.117ThrMet: 1.117 ± 0.023
3.355ThrAsn: 3.355 ± 0.044
2.993ThrPro: 2.993 ± 0.041
1.996ThrGln: 1.996 ± 0.027
2.318ThrArg: 2.318 ± 0.03
3.899ThrSer: 3.899 ± 0.052
4.419ThrThr: 4.419 ± 0.071
4.541ThrVal: 4.541 ± 0.05
0.934ThrTrp: 0.934 ± 0.021
2.548ThrTyr: 2.548 ± 0.041
0.0ThrXaa: 0.0 ± 0.0
Val
4.797ValAla: 4.797 ± 0.046
0.635ValCys: 0.635 ± 0.017
3.095ValAsp: 3.095 ± 0.036
3.108ValGlu: 3.108 ± 0.037
3.139ValPhe: 3.139 ± 0.038
3.473ValGly: 3.473 ± 0.043
1.206ValHis: 1.206 ± 0.022
4.567ValIle: 4.567 ± 0.044
4.298ValLys: 4.298 ± 0.038
6.101ValLeu: 6.101 ± 0.057
1.474ValMet: 1.474 ± 0.024
3.904ValAsn: 3.904 ± 0.04
2.716ValPro: 2.716 ± 0.027
2.584ValGln: 2.584 ± 0.034
2.562ValArg: 2.562 ± 0.029
4.402ValSer: 4.402 ± 0.045
4.585ValThr: 4.585 ± 0.056
4.292ValVal: 4.292 ± 0.047
0.808ValTrp: 0.808 ± 0.017
2.733ValTyr: 2.733 ± 0.036
0.0ValXaa: 0.0 ± 0.0
Trp
0.874TrpAla: 0.874 ± 0.016
0.138TrpCys: 0.138 ± 0.007
0.73TrpAsp: 0.73 ± 0.017
0.709TrpGlu: 0.709 ± 0.016
0.664TrpPhe: 0.664 ± 0.015
0.946TrpGly: 0.946 ± 0.021
0.301TrpHis: 0.301 ± 0.01
0.814TrpIle: 0.814 ± 0.019
0.982TrpLys: 0.982 ± 0.017
1.508TrpLeu: 1.508 ± 0.025
0.39TrpMet: 0.39 ± 0.013
0.856TrpAsn: 0.856 ± 0.018
0.442TrpPro: 0.442 ± 0.013
0.753TrpGln: 0.753 ± 0.016
0.615TrpArg: 0.615 ± 0.016
0.824TrpSer: 0.824 ± 0.023
0.714TrpThr: 0.714 ± 0.018
0.862TrpVal: 0.862 ± 0.02
0.261TrpTrp: 0.261 ± 0.011
0.637TrpTyr: 0.637 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.911TyrAla: 2.911 ± 0.032
0.426TyrCys: 0.426 ± 0.011
2.369TyrAsp: 2.369 ± 0.036
2.049TyrGlu: 2.049 ± 0.031
2.288TyrPhe: 2.288 ± 0.028
2.872TyrGly: 2.872 ± 0.035
0.857TyrHis: 0.857 ± 0.019
2.573TyrIle: 2.573 ± 0.032
2.685TyrLys: 2.685 ± 0.033
3.838TyrLeu: 3.838 ± 0.045
0.823TyrMet: 0.823 ± 0.016
2.791TyrAsn: 2.791 ± 0.038
1.77TyrPro: 1.77 ± 0.024
1.696TyrGln: 1.696 ± 0.026
2.094TyrArg: 2.094 ± 0.027
2.87TyrSer: 2.87 ± 0.032
2.907TyrThr: 2.907 ± 0.041
2.331TyrVal: 2.331 ± 0.028
0.627TyrTrp: 0.627 ± 0.016
2.155TyrTyr: 2.155 ± 0.033
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7808 proteins (2869664 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski