Amino acid dipepetide frequency for Eurypyga helias (Sunbittern)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.682AlaAla: 5.682 ± 0.061
1.327AlaCys: 1.327 ± 0.026
3.06AlaAsp: 3.06 ± 0.031
4.622AlaGlu: 4.622 ± 0.05
2.653AlaPhe: 2.653 ± 0.034
3.866AlaGly: 3.866 ± 0.041
1.354AlaHis: 1.354 ± 0.023
3.093AlaIle: 3.093 ± 0.034
3.689AlaLys: 3.689 ± 0.036
6.482AlaLeu: 6.482 ± 0.056
1.5AlaMet: 1.5 ± 0.022
2.23AlaAsn: 2.23 ± 0.026
2.951AlaPro: 2.951 ± 0.038
2.741AlaGln: 2.741 ± 0.032
3.043AlaArg: 3.043 ± 0.038
5.219AlaSer: 5.219 ± 0.044
3.309AlaThr: 3.309 ± 0.037
5.036AlaVal: 5.036 ± 0.044
0.704AlaTrp: 0.704 ± 0.016
1.68AlaTyr: 1.68 ± 0.024
0.002AlaXaa: 0.002 ± 0.001
Cys
1.182CysAla: 1.182 ± 0.02
0.628CysCys: 0.628 ± 0.016
1.063CysAsp: 1.063 ± 0.029
1.273CysGlu: 1.273 ± 0.026
0.95CysPhe: 0.95 ± 0.019
1.455CysGly: 1.455 ± 0.026
0.637CysHis: 0.637 ± 0.016
1.119CysIle: 1.119 ± 0.023
1.343CysLys: 1.343 ± 0.025
2.168CysLeu: 2.168 ± 0.031
0.436CysMet: 0.436 ± 0.012
0.922CysAsn: 0.922 ± 0.021
1.197CysPro: 1.197 ± 0.027
1.049CysGln: 1.049 ± 0.024
1.22CysArg: 1.22 ± 0.023
1.998CysSer: 1.998 ± 0.027
1.193CysThr: 1.193 ± 0.023
1.342CysVal: 1.342 ± 0.022
0.296CysTrp: 0.296 ± 0.01
0.697CysTyr: 0.697 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
3.017AspAla: 3.017 ± 0.03
1.111AspCys: 1.111 ± 0.024
2.864AspAsp: 2.864 ± 0.04
3.746AspGlu: 3.746 ± 0.042
2.317AspPhe: 2.317 ± 0.022
3.284AspGly: 3.284 ± 0.038
1.196AspHis: 1.196 ± 0.023
3.022AspIle: 3.022 ± 0.038
2.825AspLys: 2.825 ± 0.035
4.996AspLeu: 4.996 ± 0.044
1.154AspMet: 1.154 ± 0.018
1.961AspAsn: 1.961 ± 0.03
2.662AspPro: 2.662 ± 0.032
1.841AspGln: 1.841 ± 0.025
2.388AspArg: 2.388 ± 0.036
4.108AspSer: 4.108 ± 0.045
2.551AspThr: 2.551 ± 0.029
3.347AspVal: 3.347 ± 0.033
0.673AspTrp: 0.673 ± 0.015
1.679AspTyr: 1.679 ± 0.024
0.0AspXaa: 0.0 ± 0.0
Glu
4.796GluAla: 4.796 ± 0.051
1.287GluCys: 1.287 ± 0.031
4.509GluAsp: 4.509 ± 0.042
7.817GluGlu: 7.817 ± 0.1
2.222GluPhe: 2.222 ± 0.03
3.917GluGly: 3.917 ± 0.042
1.565GluHis: 1.565 ± 0.021
3.528GluIle: 3.528 ± 0.042
5.791GluLys: 5.791 ± 0.073
6.278GluLeu: 6.278 ± 0.059
1.762GluMet: 1.762 ± 0.026
3.457GluAsn: 3.457 ± 0.04
2.518GluPro: 2.518 ± 0.031
3.157GluGln: 3.157 ± 0.041
3.992GluArg: 3.992 ± 0.055
4.473GluSer: 4.473 ± 0.041
3.586GluThr: 3.586 ± 0.04
4.359GluVal: 4.359 ± 0.04
0.753GluTrp: 0.753 ± 0.017
1.878GluTyr: 1.878 ± 0.024
0.001GluXaa: 0.001 ± 0.0
Phe
2.158PheAla: 2.158 ± 0.028
1.01PheCys: 1.01 ± 0.017
1.834PheAsp: 1.834 ± 0.03
2.153PheGlu: 2.153 ± 0.028
1.951PhePhe: 1.951 ± 0.034
2.295PheGly: 2.295 ± 0.03
1.092PheHis: 1.092 ± 0.015
2.109PheIle: 2.109 ± 0.03
2.215PheLys: 2.215 ± 0.031
4.311PheLeu: 4.311 ± 0.051
0.833PheMet: 0.833 ± 0.016
1.536PheAsn: 1.536 ± 0.025
2.018PhePro: 2.018 ± 0.029
1.849PheGln: 1.849 ± 0.025
2.031PheArg: 2.031 ± 0.03
3.471PheSer: 3.471 ± 0.034
2.314PheThr: 2.314 ± 0.032
2.485PheVal: 2.485 ± 0.03
0.543PheTrp: 0.543 ± 0.014
1.329PheTyr: 1.329 ± 0.022
0.001PheXaa: 0.001 ± 0.0
Gly
3.369GlyAla: 3.369 ± 0.041
1.232GlyCys: 1.232 ± 0.023
2.927GlyAsp: 2.927 ± 0.035
3.703GlyGlu: 3.703 ± 0.052
2.555GlyPhe: 2.555 ± 0.033
3.719GlyGly: 3.719 ± 0.045
1.501GlyHis: 1.501 ± 0.024
3.131GlyIle: 3.131 ± 0.04
3.975GlyLys: 3.975 ± 0.044
5.159GlyLeu: 5.159 ± 0.057
1.4GlyMet: 1.4 ± 0.026
2.617GlyAsn: 2.617 ± 0.032
2.622GlyPro: 2.622 ± 0.072
2.388GlyGln: 2.388 ± 0.034
3.203GlyArg: 3.203 ± 0.04
4.996GlySer: 4.996 ± 0.05
3.393GlyThr: 3.393 ± 0.033
3.483GlyVal: 3.483 ± 0.036
0.77GlyTrp: 0.77 ± 0.019
1.945GlyTyr: 1.945 ± 0.033
0.0GlyXaa: 0.0 ± 0.0
His
1.356HisAla: 1.356 ± 0.021
0.709HisCys: 0.709 ± 0.017
0.941HisAsp: 0.941 ± 0.017
1.359HisGlu: 1.359 ± 0.021
1.1HisPhe: 1.1 ± 0.017
1.486HisGly: 1.486 ± 0.025
0.869HisHis: 0.869 ± 0.019
1.348HisIle: 1.348 ± 0.021
1.399HisLys: 1.399 ± 0.023
2.8HisLeu: 2.8 ± 0.036
0.589HisMet: 0.589 ± 0.016
0.974HisAsn: 0.974 ± 0.017
1.496HisPro: 1.496 ± 0.023
1.169HisGln: 1.169 ± 0.022
1.461HisArg: 1.461 ± 0.024
2.197HisSer: 2.197 ± 0.03
1.333HisThr: 1.333 ± 0.026
1.57HisVal: 1.57 ± 0.024
0.435HisTrp: 0.435 ± 0.012
0.886HisTyr: 0.886 ± 0.017
0.0HisXaa: 0.0 ± 0.0
Ile
3.006IleAla: 3.006 ± 0.032
1.219IleCys: 1.219 ± 0.022
2.326IleAsp: 2.326 ± 0.024
2.869IleGlu: 2.869 ± 0.033
2.214IlePhe: 2.214 ± 0.035
2.519IleGly: 2.519 ± 0.032
1.438IleHis: 1.438 ± 0.023
2.777IleIle: 2.777 ± 0.036
3.038IleLys: 3.038 ± 0.035
4.944IleLeu: 4.944 ± 0.044
1.079IleMet: 1.079 ± 0.019
2.149IleAsn: 2.149 ± 0.032
2.901IlePro: 2.901 ± 0.035
2.409IleGln: 2.409 ± 0.033
2.617IleArg: 2.617 ± 0.03
4.102IleSer: 4.102 ± 0.038
2.9IleThr: 2.9 ± 0.034
2.957IleVal: 2.957 ± 0.038
0.597IleTrp: 0.597 ± 0.016
1.608IleTyr: 1.608 ± 0.03
0.0IleXaa: 0.0 ± 0.0
Lys
4.23LysAla: 4.23 ± 0.046
1.246LysCys: 1.246 ± 0.022
3.429LysAsp: 3.429 ± 0.041
5.68LysGlu: 5.68 ± 0.073
2.056LysPhe: 2.056 ± 0.028
3.373LysGly: 3.373 ± 0.05
1.63LysHis: 1.63 ± 0.025
3.245LysIle: 3.245 ± 0.036
5.504LysLys: 5.504 ± 0.072
5.852LysLeu: 5.852 ± 0.056
1.559LysMet: 1.559 ± 0.02
2.778LysAsn: 2.778 ± 0.035
3.121LysPro: 3.121 ± 0.039
2.961LysGln: 2.961 ± 0.041
3.611LysArg: 3.611 ± 0.037
4.305LysSer: 4.305 ± 0.049
3.431LysThr: 3.431 ± 0.035
3.76LysVal: 3.76 ± 0.034
0.687LysTrp: 0.687 ± 0.013
1.865LysTyr: 1.865 ± 0.025
0.001LysXaa: 0.001 ± 0.0
Leu
6.246LeuAla: 6.246 ± 0.057
2.17LeuCys: 2.17 ± 0.028
5.021LeuAsp: 5.021 ± 0.05
7.012LeuGlu: 7.012 ± 0.077
3.601LeuPhe: 3.601 ± 0.045
5.124LeuGly: 5.124 ± 0.046
2.684LeuHis: 2.684 ± 0.033
4.262LeuIle: 4.262 ± 0.042
6.503LeuLys: 6.503 ± 0.059
10.069LeuLeu: 10.069 ± 0.094
2.064LeuMet: 2.064 ± 0.032
3.865LeuAsn: 3.865 ± 0.041
5.473LeuPro: 5.473 ± 0.056
5.586LeuGln: 5.586 ± 0.061
5.186LeuArg: 5.186 ± 0.047
7.904LeuSer: 7.904 ± 0.064
5.0LeuThr: 5.0 ± 0.039
5.642LeuVal: 5.642 ± 0.053
1.074LeuTrp: 1.074 ± 0.022
2.846LeuTyr: 2.846 ± 0.032
0.001LeuXaa: 0.001 ± 0.001
Met
1.628MetAla: 1.628 ± 0.024
0.448MetCys: 0.448 ± 0.013
1.298MetAsp: 1.298 ± 0.019
1.928MetGlu: 1.928 ± 0.028
0.869MetPhe: 0.869 ± 0.018
1.248MetGly: 1.248 ± 0.021
0.502MetHis: 0.502 ± 0.014
0.983MetIle: 0.983 ± 0.019
1.631MetLys: 1.631 ± 0.023
2.095MetLeu: 2.095 ± 0.029
0.612MetMet: 0.612 ± 0.015
0.995MetAsn: 0.995 ± 0.019
1.027MetPro: 1.027 ± 0.024
1.021MetGln: 1.021 ± 0.022
1.052MetArg: 1.052 ± 0.02
1.536MetSer: 1.536 ± 0.022
1.156MetThr: 1.156 ± 0.022
1.461MetVal: 1.461 ± 0.021
0.257MetTrp: 0.257 ± 0.01
0.673MetTyr: 0.673 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
2.33AsnAla: 2.33 ± 0.028
0.969AsnCys: 0.969 ± 0.023
1.71AsnAsp: 1.71 ± 0.03
2.544AsnGlu: 2.544 ± 0.035
1.642AsnPhe: 1.642 ± 0.021
2.841AsnGly: 2.841 ± 0.04
1.003AsnHis: 1.003 ± 0.018
2.475AsnIle: 2.475 ± 0.033
2.571AsnLys: 2.571 ± 0.033
4.172AsnLeu: 4.172 ± 0.038
1.009AsnMet: 1.009 ± 0.02
1.836AsnAsn: 1.836 ± 0.028
2.259AsnPro: 2.259 ± 0.034
1.728AsnGln: 1.728 ± 0.024
2.137AsnArg: 2.137 ± 0.024
3.459AsnSer: 3.459 ± 0.038
2.231AsnThr: 2.231 ± 0.025
2.493AsnVal: 2.493 ± 0.031
0.503AsnTrp: 0.503 ± 0.012
1.335AsnTyr: 1.335 ± 0.02
0.0AsnXaa: 0.0 ± 0.0
Pro
3.75ProAla: 3.75 ± 0.047
1.033ProCys: 1.033 ± 0.018
2.658ProAsp: 2.658 ± 0.03
3.874ProGlu: 3.874 ± 0.045
1.938ProPhe: 1.938 ± 0.029
3.579ProGly: 3.579 ± 0.112
1.244ProHis: 1.244 ± 0.022
1.946ProIle: 1.946 ± 0.028
2.768ProLys: 2.768 ± 0.039
4.652ProLeu: 4.652 ± 0.042
0.946ProMet: 0.946 ± 0.017
1.885ProAsn: 1.885 ± 0.027
4.249ProPro: 4.249 ± 0.083
2.298ProGln: 2.298 ± 0.032
2.571ProArg: 2.571 ± 0.029
4.997ProSer: 4.997 ± 0.059
2.656ProThr: 2.656 ± 0.033
3.747ProVal: 3.747 ± 0.04
0.557ProTrp: 0.557 ± 0.013
1.48ProTyr: 1.48 ± 0.025
0.001ProXaa: 0.001 ± 0.0
Gln
3.036GlnAla: 3.036 ± 0.034
0.949GlnCys: 0.949 ± 0.021
2.217GlnAsp: 2.217 ± 0.026
3.659GlnGlu: 3.659 ± 0.044
1.466GlnPhe: 1.466 ± 0.021
2.425GlnGly: 2.425 ± 0.036
1.297GlnHis: 1.297 ± 0.025
2.185GlnIle: 2.185 ± 0.028
3.157GlnLys: 3.157 ± 0.041
4.535GlnLeu: 4.535 ± 0.051
1.121GlnMet: 1.121 ± 0.02
2.021GlnAsn: 2.021 ± 0.03
2.343GlnPro: 2.343 ± 0.037
2.893GlnGln: 2.893 ± 0.061
2.645GlnArg: 2.645 ± 0.033
3.172GlnSer: 3.172 ± 0.037
2.357GlnThr: 2.357 ± 0.029
2.728GlnVal: 2.728 ± 0.034
0.528GlnTrp: 0.528 ± 0.014
1.284GlnTyr: 1.284 ± 0.021
0.002GlnXaa: 0.002 ± 0.001
Arg
3.01ArgAla: 3.01 ± 0.03
1.126ArgCys: 1.126 ± 0.025
2.627ArgAsp: 2.627 ± 0.037
3.79ArgGlu: 3.79 ± 0.04
1.991ArgPhe: 1.991 ± 0.024
2.817ArgGly: 2.817 ± 0.043
1.537ArgHis: 1.537 ± 0.024
2.641ArgIle: 2.641 ± 0.032
4.001ArgLys: 4.001 ± 0.049
5.126ArgLeu: 5.126 ± 0.05
1.175ArgMet: 1.175 ± 0.017
2.309ArgAsn: 2.309 ± 0.025
2.414ArgPro: 2.414 ± 0.033
2.487ArgGln: 2.487 ± 0.034
3.686ArgArg: 3.686 ± 0.041
4.027ArgSer: 4.027 ± 0.059
2.69ArgThr: 2.69 ± 0.029
3.033ArgVal: 3.033 ± 0.033
0.627ArgTrp: 0.627 ± 0.018
1.651ArgTyr: 1.651 ± 0.024
0.001ArgXaa: 0.001 ± 0.0
Ser
5.205SerAla: 5.205 ± 0.045
1.781SerCys: 1.781 ± 0.025
4.065SerAsp: 4.065 ± 0.047
5.12SerGlu: 5.12 ± 0.047
3.155SerPhe: 3.155 ± 0.034
4.95SerGly: 4.95 ± 0.05
1.988SerHis: 1.988 ± 0.03
3.557SerIle: 3.557 ± 0.038
4.503SerLys: 4.503 ± 0.042
8.099SerLeu: 8.099 ± 0.062
1.606SerMet: 1.606 ± 0.023
3.159SerAsn: 3.159 ± 0.035
5.182SerPro: 5.182 ± 0.07
3.623SerGln: 3.623 ± 0.043
4.123SerArg: 4.123 ± 0.052
9.014SerSer: 9.014 ± 0.097
4.521SerThr: 4.521 ± 0.047
5.118SerVal: 5.118 ± 0.047
0.959SerTrp: 0.959 ± 0.018
2.264SerTyr: 2.264 ± 0.027
0.0SerXaa: 0.0 ± 0.0
Thr
3.844ThrAla: 3.844 ± 0.037
1.293ThrCys: 1.293 ± 0.029
2.778ThrAsp: 2.778 ± 0.035
3.8ThrGlu: 3.8 ± 0.042
2.182ThrPhe: 2.182 ± 0.029
3.396ThrGly: 3.396 ± 0.043
1.237ThrHis: 1.237 ± 0.021
2.574ThrIle: 2.574 ± 0.035
2.899ThrLys: 2.899 ± 0.03
5.181ThrLeu: 5.181 ± 0.041
1.127ThrMet: 1.127 ± 0.02
1.949ThrAsn: 1.949 ± 0.025
3.135ThrPro: 3.135 ± 0.039
2.122ThrGln: 2.122 ± 0.029
2.338ThrArg: 2.338 ± 0.027
4.632ThrSer: 4.632 ± 0.053
2.931ThrThr: 2.931 ± 0.039
4.134ThrVal: 4.134 ± 0.046
0.664ThrTrp: 0.664 ± 0.016
1.571ThrTyr: 1.571 ± 0.024
0.002ThrXaa: 0.002 ± 0.001
Val
4.123ValAla: 4.123 ± 0.04
1.591ValCys: 1.591 ± 0.031
3.224ValAsp: 3.224 ± 0.03
4.027ValGlu: 4.027 ± 0.04
2.795ValPhe: 2.795 ± 0.031
3.388ValGly: 3.388 ± 0.034
1.602ValHis: 1.602 ± 0.022
3.337ValIle: 3.337 ± 0.033
3.874ValLys: 3.874 ± 0.037
6.303ValLeu: 6.303 ± 0.054
1.446ValMet: 1.446 ± 0.022
2.57ValAsn: 2.57 ± 0.036
3.431ValPro: 3.431 ± 0.038
2.819ValGln: 2.819 ± 0.034
3.056ValArg: 3.056 ± 0.033
5.051ValSer: 5.051 ± 0.045
3.92ValThr: 3.92 ± 0.039
4.373ValVal: 4.373 ± 0.04
0.736ValTrp: 0.736 ± 0.016
1.899ValTyr: 1.899 ± 0.029
0.001ValXaa: 0.001 ± 0.0
Trp
0.654TrpAla: 0.654 ± 0.015
0.251TrpCys: 0.251 ± 0.01
0.684TrpAsp: 0.684 ± 0.018
0.808TrpGlu: 0.808 ± 0.017
0.464TrpPhe: 0.464 ± 0.012
0.633TrpGly: 0.633 ± 0.016
0.301TrpHis: 0.301 ± 0.01
0.643TrpIle: 0.643 ± 0.014
0.892TrpLys: 0.892 ± 0.016
1.161TrpLeu: 1.161 ± 0.021
0.316TrpMet: 0.316 ± 0.01
0.717TrpAsn: 0.717 ± 0.016
0.438TrpPro: 0.438 ± 0.011
0.552TrpGln: 0.552 ± 0.014
0.672TrpArg: 0.672 ± 0.015
0.905TrpSer: 0.905 ± 0.018
0.646TrpThr: 0.646 ± 0.014
0.675TrpVal: 0.675 ± 0.016
0.197TrpTrp: 0.197 ± 0.009
0.378TrpTyr: 0.378 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.623TyrAla: 1.623 ± 0.025
0.789TyrCys: 0.789 ± 0.018
1.506TyrAsp: 1.506 ± 0.021
1.86TyrGlu: 1.86 ± 0.026
1.416TyrPhe: 1.416 ± 0.025
1.773TyrGly: 1.773 ± 0.026
0.801TyrHis: 0.801 ± 0.018
1.691TyrIle: 1.691 ± 0.025
1.746TyrLys: 1.746 ± 0.025
2.949TyrLeu: 2.949 ± 0.035
0.687TyrMet: 0.687 ± 0.015
1.316TyrAsn: 1.316 ± 0.023
1.39TyrPro: 1.39 ± 0.026
1.326TyrGln: 1.326 ± 0.024
1.735TyrArg: 1.735 ± 0.025
2.424TyrSer: 2.424 ± 0.03
1.673TyrThr: 1.673 ± 0.025
1.82TyrVal: 1.82 ± 0.023
0.424TyrTrp: 0.424 ± 0.017
1.135TyrTyr: 1.135 ± 0.024
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.002XaaAsp: 0.002 ± 0.001
0.001XaaGlu: 0.001 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.001XaaLys: 0.001 ± 0.0
0.001XaaLeu: 0.001 ± 0.001
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.002XaaPro: 0.002 ± 0.001
0.002XaaGln: 0.002 ± 0.001
0.0XaaArg: 0.0 ± 0.0
0.001XaaSer: 0.001 ± 0.001
0.001XaaThr: 0.001 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.064XaaXaa: 0.064 ± 0.011
Statistics based on 7979 proteins (3218414 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski