Amino acid dipepetide frequency for Hornefia porci

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.459AlaAla: 9.459 ± 0.173
1.206AlaCys: 1.206 ± 0.042
5.093AlaAsp: 5.093 ± 0.082
6.847AlaGlu: 6.847 ± 0.145
3.175AlaPhe: 3.175 ± 0.071
7.436AlaGly: 7.436 ± 0.134
1.23AlaHis: 1.23 ± 0.042
4.714AlaIle: 4.714 ± 0.09
4.3AlaLys: 4.3 ± 0.106
7.389AlaLeu: 7.389 ± 0.1
2.667AlaMet: 2.667 ± 0.064
2.338AlaAsn: 2.338 ± 0.051
2.438AlaPro: 2.438 ± 0.061
2.285AlaGln: 2.285 ± 0.071
4.337AlaArg: 4.337 ± 0.088
4.421AlaSer: 4.421 ± 0.106
3.438AlaThr: 3.438 ± 0.071
7.029AlaVal: 7.029 ± 0.101
0.658AlaTrp: 0.658 ± 0.029
2.601AlaTyr: 2.601 ± 0.057
0.0AlaXaa: 0.0 ± 0.0
Cys
1.163CysAla: 1.163 ± 0.042
0.355CysCys: 0.355 ± 0.025
0.972CysAsp: 0.972 ± 0.033
0.953CysGlu: 0.953 ± 0.039
0.588CysPhe: 0.588 ± 0.03
1.669CysGly: 1.669 ± 0.054
0.277CysHis: 0.277 ± 0.018
0.978CysIle: 0.978 ± 0.031
0.552CysLys: 0.552 ± 0.028
1.085CysLeu: 1.085 ± 0.039
0.384CysMet: 0.384 ± 0.023
0.483CysAsn: 0.483 ± 0.022
0.685CysPro: 0.685 ± 0.035
0.289CysGln: 0.289 ± 0.016
1.094CysArg: 1.094 ± 0.041
0.894CysSer: 0.894 ± 0.033
0.755CysThr: 0.755 ± 0.036
1.099CysVal: 1.099 ± 0.038
0.137CysTrp: 0.137 ± 0.015
0.552CysTyr: 0.552 ± 0.031
0.0CysXaa: 0.0 ± 0.0
Asp
4.89AspAla: 4.89 ± 0.1
0.877AspCys: 0.877 ± 0.033
3.092AspAsp: 3.092 ± 0.068
4.65AspGlu: 4.65 ± 0.086
2.788AspPhe: 2.788 ± 0.062
5.036AspGly: 5.036 ± 0.103
0.96AspHis: 0.96 ± 0.034
4.298AspIle: 4.298 ± 0.073
3.276AspLys: 3.276 ± 0.088
4.985AspLeu: 4.985 ± 0.092
1.888AspMet: 1.888 ± 0.047
1.953AspAsn: 1.953 ± 0.056
2.513AspPro: 2.513 ± 0.062
1.344AspGln: 1.344 ± 0.043
3.857AspArg: 3.857 ± 0.087
3.335AspSer: 3.335 ± 0.071
3.059AspThr: 3.059 ± 0.067
3.901AspVal: 3.901 ± 0.067
0.559AspTrp: 0.559 ± 0.028
2.605AspTyr: 2.605 ± 0.065
0.0AspXaa: 0.0 ± 0.0
Glu
5.86GluAla: 5.86 ± 0.112
0.783GluCys: 0.783 ± 0.036
4.414GluAsp: 4.414 ± 0.084
6.543GluGlu: 6.543 ± 0.12
2.377GluPhe: 2.377 ± 0.055
4.437GluGly: 4.437 ± 0.085
1.483GluHis: 1.483 ± 0.052
5.651GluIle: 5.651 ± 0.102
5.829GluLys: 5.829 ± 0.098
6.577GluLeu: 6.577 ± 0.112
2.389GluMet: 2.389 ± 0.057
3.631GluAsn: 3.631 ± 0.07
2.199GluPro: 2.199 ± 0.055
2.632GluGln: 2.632 ± 0.057
3.991GluArg: 3.991 ± 0.084
3.314GluSer: 3.314 ± 0.076
3.935GluThr: 3.935 ± 0.088
4.02GluVal: 4.02 ± 0.086
0.556GluTrp: 0.556 ± 0.03
2.923GluTyr: 2.923 ± 0.067
0.0GluXaa: 0.0 ± 0.0
Phe
3.063PheAla: 3.063 ± 0.067
0.743PheCys: 0.743 ± 0.035
2.525PheAsp: 2.525 ± 0.058
2.29PheGlu: 2.29 ± 0.055
1.606PhePhe: 1.606 ± 0.053
3.144PheGly: 3.144 ± 0.063
0.756PheHis: 0.756 ± 0.038
2.385PheIle: 2.385 ± 0.06
1.456PheLys: 1.456 ± 0.044
3.345PheLeu: 3.345 ± 0.078
1.071PheMet: 1.071 ± 0.041
1.446PheAsn: 1.446 ± 0.044
1.399PhePro: 1.399 ± 0.045
0.999PheGln: 0.999 ± 0.034
2.615PheArg: 2.615 ± 0.06
2.688PheSer: 2.688 ± 0.061
2.265PheThr: 2.265 ± 0.065
2.94PheVal: 2.94 ± 0.07
0.37PheTrp: 0.37 ± 0.021
1.461PheTyr: 1.461 ± 0.049
0.0PheXaa: 0.0 ± 0.0
Gly
5.98GlyAla: 5.98 ± 0.106
1.27GlyCys: 1.27 ± 0.045
4.289GlyAsp: 4.289 ± 0.085
5.18GlyGlu: 5.18 ± 0.094
3.175GlyPhe: 3.175 ± 0.075
5.895GlyGly: 5.895 ± 0.098
1.383GlyHis: 1.383 ± 0.042
5.998GlyIle: 5.998 ± 0.105
5.534GlyLys: 5.534 ± 0.108
6.067GlyLeu: 6.067 ± 0.083
2.497GlyMet: 2.497 ± 0.067
3.085GlyAsn: 3.085 ± 0.073
1.595GlyPro: 1.595 ± 0.045
2.015GlyGln: 2.015 ± 0.054
4.436GlyArg: 4.436 ± 0.081
4.469GlySer: 4.469 ± 0.087
4.816GlyThr: 4.816 ± 0.092
5.427GlyVal: 5.427 ± 0.086
0.74GlyTrp: 0.74 ± 0.043
3.17GlyTyr: 3.17 ± 0.06
0.0GlyXaa: 0.0 ± 0.0
His
1.185HisAla: 1.185 ± 0.043
0.307HisCys: 0.307 ± 0.02
0.945HisAsp: 0.945 ± 0.034
1.086HisGlu: 1.086 ± 0.042
0.705HisPhe: 0.705 ± 0.029
1.435HisGly: 1.435 ± 0.046
0.389HisHis: 0.389 ± 0.033
1.315HisIle: 1.315 ± 0.046
0.824HisLys: 0.824 ± 0.036
1.387HisLeu: 1.387 ± 0.045
0.497HisMet: 0.497 ± 0.026
0.588HisAsn: 0.588 ± 0.03
0.929HisPro: 0.929 ± 0.036
0.472HisGln: 0.472 ± 0.028
1.065HisArg: 1.065 ± 0.039
0.95HisSer: 0.95 ± 0.033
0.897HisThr: 0.897 ± 0.034
1.152HisVal: 1.152 ± 0.04
0.175HisTrp: 0.175 ± 0.017
0.788HisTyr: 0.788 ± 0.033
0.0HisXaa: 0.0 ± 0.0
Ile
5.86IleAla: 5.86 ± 0.088
1.28IleCys: 1.28 ± 0.042
4.411IleAsp: 4.411 ± 0.079
4.343IleGlu: 4.343 ± 0.084
2.638IlePhe: 2.638 ± 0.066
5.045IleGly: 5.045 ± 0.086
1.18IleHis: 1.18 ± 0.036
4.458IleIle: 4.458 ± 0.085
2.874IleLys: 2.874 ± 0.073
6.12IleLeu: 6.12 ± 0.117
1.871IleMet: 1.871 ± 0.056
2.459IleAsn: 2.459 ± 0.069
3.012IlePro: 3.012 ± 0.06
1.832IleGln: 1.832 ± 0.045
5.033IleArg: 5.033 ± 0.093
4.406IleSer: 4.406 ± 0.082
4.106IleThr: 4.106 ± 0.082
4.787IleVal: 4.787 ± 0.092
0.541IleTrp: 0.541 ± 0.024
2.289IleTyr: 2.289 ± 0.054
0.0IleXaa: 0.0 ± 0.0
Lys
5.108LysAla: 5.108 ± 0.124
0.617LysCys: 0.617 ± 0.03
3.77LysAsp: 3.77 ± 0.085
5.078LysGlu: 5.078 ± 0.11
1.703LysPhe: 1.703 ± 0.046
3.974LysGly: 3.974 ± 0.081
0.867LysHis: 0.867 ± 0.035
3.959LysIle: 3.959 ± 0.08
5.286LysLys: 5.286 ± 0.115
4.739LysLeu: 4.739 ± 0.087
1.883LysMet: 1.883 ± 0.048
2.79LysAsn: 2.79 ± 0.058
1.952LysPro: 1.952 ± 0.05
1.845LysGln: 1.845 ± 0.053
3.043LysArg: 3.043 ± 0.059
3.322LysSer: 3.322 ± 0.075
3.638LysThr: 3.638 ± 0.072
3.757LysVal: 3.757 ± 0.08
0.579LysTrp: 0.579 ± 0.029
2.655LysTyr: 2.655 ± 0.064
0.0LysXaa: 0.0 ± 0.0
Leu
7.025LeuAla: 7.025 ± 0.108
1.415LeuCys: 1.415 ± 0.046
4.849LeuAsp: 4.849 ± 0.086
5.675LeuGlu: 5.675 ± 0.103
3.407LeuPhe: 3.407 ± 0.077
5.7LeuGly: 5.7 ± 0.088
1.443LeuHis: 1.443 ± 0.047
5.839LeuIle: 5.839 ± 0.093
5.476LeuLys: 5.476 ± 0.089
7.561LeuLeu: 7.561 ± 0.129
2.606LeuMet: 2.606 ± 0.059
3.38LeuAsn: 3.38 ± 0.063
3.494LeuPro: 3.494 ± 0.064
2.444LeuGln: 2.444 ± 0.055
4.937LeuArg: 4.937 ± 0.101
5.927LeuSer: 5.927 ± 0.097
5.151LeuThr: 5.151 ± 0.084
4.972LeuVal: 4.972 ± 0.088
0.682LeuTrp: 0.682 ± 0.031
3.041LeuTyr: 3.041 ± 0.066
0.0LeuXaa: 0.0 ± 0.0
Met
2.508MetAla: 2.508 ± 0.059
0.339MetCys: 0.339 ± 0.021
1.855MetAsp: 1.855 ± 0.047
2.284MetGlu: 2.284 ± 0.054
1.007MetPhe: 1.007 ± 0.039
2.104MetGly: 2.104 ± 0.054
0.463MetHis: 0.463 ± 0.023
2.026MetIle: 2.026 ± 0.055
2.478MetLys: 2.478 ± 0.047
2.532MetLeu: 2.532 ± 0.059
0.94MetMet: 0.94 ± 0.035
1.607MetAsn: 1.607 ± 0.049
1.174MetPro: 1.174 ± 0.04
0.86MetGln: 0.86 ± 0.032
1.601MetArg: 1.601 ± 0.047
1.817MetSer: 1.817 ± 0.049
1.938MetThr: 1.938 ± 0.05
1.708MetVal: 1.708 ± 0.046
0.201MetTrp: 0.201 ± 0.015
0.86MetTyr: 0.86 ± 0.03
0.0MetXaa: 0.0 ± 0.0
Asn
2.932AsnAla: 2.932 ± 0.056
0.591AsnCys: 0.591 ± 0.028
2.049AsnAsp: 2.049 ± 0.055
2.346AsnGlu: 2.346 ± 0.065
1.426AsnPhe: 1.426 ± 0.046
3.35AsnGly: 3.35 ± 0.075
0.686AsnHis: 0.686 ± 0.032
2.915AsnIle: 2.915 ± 0.059
2.124AsnLys: 2.124 ± 0.054
3.296AsnLeu: 3.296 ± 0.061
1.11AsnMet: 1.11 ± 0.036
1.37AsnAsn: 1.37 ± 0.049
1.866AsnPro: 1.866 ± 0.058
1.002AsnGln: 1.002 ± 0.037
2.454AsnArg: 2.454 ± 0.059
2.085AsnSer: 2.085 ± 0.045
2.131AsnThr: 2.131 ± 0.059
2.644AsnVal: 2.644 ± 0.06
0.369AsnTrp: 0.369 ± 0.022
1.617AsnTyr: 1.617 ± 0.049
0.0AsnXaa: 0.0 ± 0.0
Pro
3.228ProAla: 3.228 ± 0.067
0.487ProCys: 0.487 ± 0.024
2.645ProAsp: 2.645 ± 0.062
4.012ProGlu: 4.012 ± 0.08
1.445ProPhe: 1.445 ± 0.043
3.03ProGly: 3.03 ± 0.055
0.612ProHis: 0.612 ± 0.027
1.86ProIle: 1.86 ± 0.055
1.772ProLys: 1.772 ± 0.044
2.799ProLeu: 2.799 ± 0.064
0.917ProMet: 0.917 ± 0.04
1.048ProAsn: 1.048 ± 0.037
0.994ProPro: 0.994 ± 0.035
1.072ProGln: 1.072 ± 0.05
1.543ProArg: 1.543 ± 0.044
1.933ProSer: 1.933 ± 0.062
1.579ProThr: 1.579 ± 0.047
3.29ProVal: 3.29 ± 0.067
0.33ProTrp: 0.33 ± 0.019
1.392ProTyr: 1.392 ± 0.041
0.0ProXaa: 0.0 ± 0.0
Gln
2.368GlnAla: 2.368 ± 0.066
0.336GlnCys: 0.336 ± 0.022
1.383GlnAsp: 1.383 ± 0.042
2.029GlnGlu: 2.029 ± 0.055
1.007GlnPhe: 1.007 ± 0.038
1.802GlnGly: 1.802 ± 0.049
0.474GlnHis: 0.474 ± 0.026
2.16GlnIle: 2.16 ± 0.059
2.228GlnLys: 2.228 ± 0.059
2.448GlnLeu: 2.448 ± 0.057
1.026GlnMet: 1.026 ± 0.038
1.247GlnAsn: 1.247 ± 0.037
0.896GlnPro: 0.896 ± 0.051
1.055GlnGln: 1.055 ± 0.056
1.586GlnArg: 1.586 ± 0.043
1.535GlnSer: 1.535 ± 0.042
1.545GlnThr: 1.545 ± 0.05
1.715GlnVal: 1.715 ± 0.05
0.239GlnTrp: 0.239 ± 0.017
1.182GlnTyr: 1.182 ± 0.042
0.0GlnXaa: 0.0 ± 0.0
Arg
3.972ArgAla: 3.972 ± 0.071
0.807ArgCys: 0.807 ± 0.034
3.404ArgAsp: 3.404 ± 0.071
5.209ArgGlu: 5.209 ± 0.094
2.246ArgPhe: 2.246 ± 0.064
3.781ArgGly: 3.781 ± 0.086
1.026ArgHis: 1.026 ± 0.041
4.504ArgIle: 4.504 ± 0.081
4.173ArgLys: 4.173 ± 0.089
5.058ArgLeu: 5.058 ± 0.097
1.91ArgMet: 1.91 ± 0.052
2.507ArgAsn: 2.507 ± 0.061
1.922ArgPro: 1.922 ± 0.049
2.137ArgGln: 2.137 ± 0.064
4.375ArgArg: 4.375 ± 0.101
3.092ArgSer: 3.092 ± 0.065
3.044ArgThr: 3.044 ± 0.063
3.521ArgVal: 3.521 ± 0.072
0.529ArgTrp: 0.529 ± 0.025
2.443ArgTyr: 2.443 ± 0.058
0.0ArgXaa: 0.0 ± 0.0
Ser
4.902SerAla: 4.902 ± 0.091
0.819SerCys: 0.819 ± 0.032
3.674SerAsp: 3.674 ± 0.072
3.787SerGlu: 3.787 ± 0.066
2.379SerPhe: 2.379 ± 0.064
5.921SerGly: 5.921 ± 0.109
0.972SerHis: 0.972 ± 0.037
3.676SerIle: 3.676 ± 0.068
2.752SerLys: 2.752 ± 0.066
4.812SerLeu: 4.812 ± 0.084
1.714SerMet: 1.714 ± 0.047
1.825SerAsn: 1.825 ± 0.056
2.024SerPro: 2.024 ± 0.047
1.498SerGln: 1.498 ± 0.048
3.856SerArg: 3.856 ± 0.088
3.677SerSer: 3.677 ± 0.096
2.896SerThr: 2.896 ± 0.069
4.505SerVal: 4.505 ± 0.092
0.58SerTrp: 0.58 ± 0.025
2.046SerTyr: 2.046 ± 0.053
0.0SerXaa: 0.0 ± 0.0
Thr
5.036ThrAla: 5.036 ± 0.108
0.726ThrCys: 0.726 ± 0.029
3.495ThrAsp: 3.495 ± 0.067
3.843ThrGlu: 3.843 ± 0.074
2.056ThrPhe: 2.056 ± 0.06
5.326ThrGly: 5.326 ± 0.099
0.872ThrHis: 0.872 ± 0.034
3.461ThrIle: 3.461 ± 0.071
2.856ThrLys: 2.856 ± 0.076
4.751ThrLeu: 4.751 ± 0.093
1.46ThrMet: 1.46 ± 0.043
1.822ThrAsn: 1.822 ± 0.061
2.45ThrPro: 2.45 ± 0.057
1.381ThrGln: 1.381 ± 0.045
2.603ThrArg: 2.603 ± 0.063
2.925ThrSer: 2.925 ± 0.063
2.785ThrThr: 2.785 ± 0.069
4.926ThrVal: 4.926 ± 0.11
0.505ThrTrp: 0.505 ± 0.029
1.894ThrTyr: 1.894 ± 0.068
0.0ThrXaa: 0.0 ± 0.0
Val
5.072ValAla: 5.072 ± 0.078
1.238ValCys: 1.238 ± 0.041
3.835ValAsp: 3.835 ± 0.076
4.215ValGlu: 4.215 ± 0.091
2.884ValPhe: 2.884 ± 0.071
4.265ValGly: 4.265 ± 0.082
1.129ValHis: 1.129 ± 0.04
5.362ValIle: 5.362 ± 0.102
4.047ValLys: 4.047 ± 0.093
6.16ValLeu: 6.16 ± 0.099
2.158ValMet: 2.158 ± 0.058
2.766ValAsn: 2.766 ± 0.058
2.813ValPro: 2.813 ± 0.058
1.657ValGln: 1.657 ± 0.054
4.279ValArg: 4.279 ± 0.091
4.695ValSer: 4.695 ± 0.087
4.524ValThr: 4.524 ± 0.131
4.586ValVal: 4.586 ± 0.101
0.619ValTrp: 0.619 ± 0.032
2.508ValTyr: 2.508 ± 0.057
0.0ValXaa: 0.0 ± 0.0
Trp
0.575TrpAla: 0.575 ± 0.029
0.155TrpCys: 0.155 ± 0.014
0.516TrpAsp: 0.516 ± 0.027
0.568TrpGlu: 0.568 ± 0.031
0.388TrpPhe: 0.388 ± 0.023
0.614TrpGly: 0.614 ± 0.029
0.169TrpHis: 0.169 ± 0.016
0.625TrpIle: 0.625 ± 0.03
0.643TrpLys: 0.643 ± 0.031
0.802TrpLeu: 0.802 ± 0.032
0.279TrpMet: 0.279 ± 0.021
0.485TrpAsn: 0.485 ± 0.029
0.268TrpPro: 0.268 ± 0.02
0.33TrpGln: 0.33 ± 0.02
0.452TrpArg: 0.452 ± 0.026
0.534TrpSer: 0.534 ± 0.035
0.462TrpThr: 0.462 ± 0.028
0.501TrpVal: 0.501 ± 0.029
0.092TrpTrp: 0.092 ± 0.012
0.371TrpTyr: 0.371 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.874TyrAla: 2.874 ± 0.07
0.613TyrCys: 0.613 ± 0.027
2.653TyrAsp: 2.653 ± 0.052
2.604TyrGlu: 2.604 ± 0.065
1.569TyrPhe: 1.569 ± 0.046
3.054TyrGly: 3.054 ± 0.068
0.72TyrHis: 0.72 ± 0.028
2.362TyrIle: 2.362 ± 0.057
2.087TyrLys: 2.087 ± 0.06
3.111TyrLeu: 3.111 ± 0.067
1.033TyrMet: 1.033 ± 0.037
1.533TyrAsn: 1.533 ± 0.052
1.368TyrPro: 1.368 ± 0.045
1.113TyrGln: 1.113 ± 0.041
2.504TyrArg: 2.504 ± 0.06
2.294TyrSer: 2.294 ± 0.068
2.205TyrThr: 2.205 ± 0.068
2.336TyrVal: 2.336 ± 0.052
0.366TyrTrp: 0.366 ± 0.026
1.652TyrTyr: 1.652 ± 0.051
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2432 proteins (794604 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski