Amino acid dipepetide frequency for Phaethon lepturus (White-tailed tropicbird)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.532AlaAla: 5.532 ± 0.058
1.305AlaCys: 1.305 ± 0.02
2.966AlaAsp: 2.966 ± 0.03
4.535AlaGlu: 4.535 ± 0.041
2.625AlaPhe: 2.625 ± 0.03
3.816AlaGly: 3.816 ± 0.038
1.354AlaHis: 1.354 ± 0.02
3.114AlaIle: 3.114 ± 0.032
3.749AlaLys: 3.749 ± 0.036
6.36AlaLeu: 6.36 ± 0.05
1.518AlaMet: 1.518 ± 0.022
2.253AlaAsn: 2.253 ± 0.024
2.875AlaPro: 2.875 ± 0.037
2.738AlaGln: 2.738 ± 0.032
2.847AlaArg: 2.847 ± 0.029
5.152AlaSer: 5.152 ± 0.043
3.327AlaThr: 3.327 ± 0.027
4.828AlaVal: 4.828 ± 0.041
0.697AlaTrp: 0.697 ± 0.013
1.634AlaTyr: 1.634 ± 0.02
0.001AlaXaa: 0.001 ± 0.0
Cys
1.155CysAla: 1.155 ± 0.022
0.652CysCys: 0.652 ± 0.016
1.053CysAsp: 1.053 ± 0.027
1.314CysGlu: 1.314 ± 0.031
0.934CysPhe: 0.934 ± 0.017
1.423CysGly: 1.423 ± 0.028
0.639CysHis: 0.639 ± 0.016
1.189CysIle: 1.189 ± 0.025
1.353CysLys: 1.353 ± 0.024
2.125CysLeu: 2.125 ± 0.028
0.456CysMet: 0.456 ± 0.012
0.932CysAsn: 0.932 ± 0.016
1.221CysPro: 1.221 ± 0.032
1.07CysGln: 1.07 ± 0.024
1.219CysArg: 1.219 ± 0.022
2.046CysSer: 2.046 ± 0.032
1.192CysThr: 1.192 ± 0.024
1.344CysVal: 1.344 ± 0.024
0.299CysTrp: 0.299 ± 0.009
0.687CysTyr: 0.687 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
2.917AspAla: 2.917 ± 0.027
1.106AspCys: 1.106 ± 0.021
2.915AspAsp: 2.915 ± 0.044
3.731AspGlu: 3.731 ± 0.041
2.316AspPhe: 2.316 ± 0.024
3.31AspGly: 3.31 ± 0.039
1.161AspHis: 1.161 ± 0.019
3.031AspIle: 3.031 ± 0.03
2.852AspLys: 2.852 ± 0.029
5.119AspLeu: 5.119 ± 0.043
1.199AspMet: 1.199 ± 0.017
2.015AspAsn: 2.015 ± 0.027
2.644AspPro: 2.644 ± 0.027
1.875AspGln: 1.875 ± 0.022
2.388AspArg: 2.388 ± 0.03
4.173AspSer: 4.173 ± 0.04
2.534AspThr: 2.534 ± 0.024
3.25AspVal: 3.25 ± 0.031
0.667AspTrp: 0.667 ± 0.013
1.655AspTyr: 1.655 ± 0.021
0.001AspXaa: 0.001 ± 0.0
Glu
4.658GluAla: 4.658 ± 0.04
1.345GluCys: 1.345 ± 0.037
4.569GluAsp: 4.569 ± 0.043
7.992GluGlu: 7.992 ± 0.085
2.224GluPhe: 2.224 ± 0.025
3.958GluGly: 3.958 ± 0.048
1.572GluHis: 1.572 ± 0.019
3.604GluIle: 3.604 ± 0.038
5.865GluLys: 5.865 ± 0.068
6.392GluLeu: 6.392 ± 0.058
1.813GluMet: 1.813 ± 0.023
3.534GluAsn: 3.534 ± 0.032
2.498GluPro: 2.498 ± 0.027
3.275GluGln: 3.275 ± 0.041
3.904GluArg: 3.904 ± 0.048
4.545GluSer: 4.545 ± 0.048
3.632GluThr: 3.632 ± 0.037
4.448GluVal: 4.448 ± 0.039
0.753GluTrp: 0.753 ± 0.016
1.86GluTyr: 1.86 ± 0.025
0.0GluXaa: 0.0 ± 0.0
Phe
2.147PheAla: 2.147 ± 0.022
1.018PheCys: 1.018 ± 0.016
1.847PheAsp: 1.847 ± 0.022
2.169PheGlu: 2.169 ± 0.024
1.867PhePhe: 1.867 ± 0.03
2.315PheGly: 2.315 ± 0.03
1.094PheHis: 1.094 ± 0.018
2.13PheIle: 2.13 ± 0.029
2.135PheLys: 2.135 ± 0.025
4.22PheLeu: 4.22 ± 0.046
0.803PheMet: 0.803 ± 0.015
1.549PheAsn: 1.549 ± 0.025
1.958PhePro: 1.958 ± 0.028
1.853PheGln: 1.853 ± 0.023
1.971PheArg: 1.971 ± 0.027
3.498PheSer: 3.498 ± 0.037
2.307PheThr: 2.307 ± 0.026
2.392PheVal: 2.392 ± 0.031
0.521PheTrp: 0.521 ± 0.014
1.348PheTyr: 1.348 ± 0.023
0.0PheXaa: 0.0 ± 0.0
Gly
3.366GlyAla: 3.366 ± 0.035
1.226GlyCys: 1.226 ± 0.024
2.945GlyAsp: 2.945 ± 0.034
3.743GlyGlu: 3.743 ± 0.043
2.483GlyPhe: 2.483 ± 0.035
3.777GlyGly: 3.777 ± 0.055
1.472GlyHis: 1.472 ± 0.024
3.144GlyIle: 3.144 ± 0.032
4.007GlyLys: 4.007 ± 0.043
5.093GlyLeu: 5.093 ± 0.049
1.398GlyMet: 1.398 ± 0.023
2.646GlyAsn: 2.646 ± 0.034
2.757GlyPro: 2.757 ± 0.083
2.492GlyGln: 2.492 ± 0.037
3.169GlyArg: 3.169 ± 0.05
4.973GlySer: 4.973 ± 0.05
3.385GlyThr: 3.385 ± 0.036
3.41GlyVal: 3.41 ± 0.037
0.75GlyTrp: 0.75 ± 0.018
1.881GlyTyr: 1.881 ± 0.027
0.001GlyXaa: 0.001 ± 0.001
His
1.344HisAla: 1.344 ± 0.018
0.719HisCys: 0.719 ± 0.015
0.946HisAsp: 0.946 ± 0.017
1.373HisGlu: 1.373 ± 0.02
1.092HisPhe: 1.092 ± 0.018
1.445HisGly: 1.445 ± 0.023
0.849HisHis: 0.849 ± 0.019
1.365HisIle: 1.365 ± 0.018
1.408HisLys: 1.408 ± 0.019
2.79HisLeu: 2.79 ± 0.024
0.606HisMet: 0.606 ± 0.013
1.0HisAsn: 1.0 ± 0.017
1.455HisPro: 1.455 ± 0.023
1.177HisGln: 1.177 ± 0.021
1.458HisArg: 1.458 ± 0.023
2.167HisSer: 2.167 ± 0.026
1.284HisThr: 1.284 ± 0.019
1.536HisVal: 1.536 ± 0.02
0.399HisTrp: 0.399 ± 0.011
0.859HisTyr: 0.859 ± 0.015
0.0HisXaa: 0.0 ± 0.0
Ile
3.056IleAla: 3.056 ± 0.031
1.242IleCys: 1.242 ± 0.022
2.389IleAsp: 2.389 ± 0.027
2.981IleGlu: 2.981 ± 0.034
2.213IlePhe: 2.213 ± 0.027
2.525IleGly: 2.525 ± 0.029
1.408IleHis: 1.408 ± 0.018
2.756IleIle: 2.756 ± 0.032
3.1IleLys: 3.1 ± 0.033
5.035IleLeu: 5.035 ± 0.048
1.109IleMet: 1.109 ± 0.017
2.224IleAsn: 2.224 ± 0.027
2.852IlePro: 2.852 ± 0.03
2.524IleGln: 2.524 ± 0.027
2.594IleArg: 2.594 ± 0.024
4.15IleSer: 4.15 ± 0.036
2.93IleThr: 2.93 ± 0.028
2.993IleVal: 2.993 ± 0.033
0.593IleTrp: 0.593 ± 0.014
1.619IleTyr: 1.619 ± 0.024
0.0IleXaa: 0.0 ± 0.0
Lys
4.083LysAla: 4.083 ± 0.034
1.253LysCys: 1.253 ± 0.022
3.593LysAsp: 3.593 ± 0.036
5.923LysGlu: 5.923 ± 0.075
1.992LysPhe: 1.992 ± 0.023
3.437LysGly: 3.437 ± 0.041
1.627LysHis: 1.627 ± 0.021
3.288LysIle: 3.288 ± 0.031
5.55LysLys: 5.55 ± 0.068
5.824LysLeu: 5.824 ± 0.057
1.585LysMet: 1.585 ± 0.021
2.898LysAsn: 2.898 ± 0.027
3.097LysPro: 3.097 ± 0.036
3.06LysGln: 3.06 ± 0.033
3.557LysArg: 3.557 ± 0.037
4.306LysSer: 4.306 ± 0.044
3.466LysThr: 3.466 ± 0.036
3.852LysVal: 3.852 ± 0.037
0.685LysTrp: 0.685 ± 0.014
1.898LysTyr: 1.898 ± 0.022
0.0LysXaa: 0.0 ± 0.0
Leu
6.154LeuAla: 6.154 ± 0.053
2.184LeuCys: 2.184 ± 0.029
4.928LeuAsp: 4.928 ± 0.046
7.1LeuGlu: 7.1 ± 0.067
3.601LeuPhe: 3.601 ± 0.042
5.127LeuGly: 5.127 ± 0.044
2.705LeuHis: 2.705 ± 0.032
4.322LeuIle: 4.322 ± 0.037
6.503LeuLys: 6.503 ± 0.051
9.974LeuLeu: 9.974 ± 0.086
2.062LeuMet: 2.062 ± 0.026
3.983LeuAsn: 3.983 ± 0.035
5.38LeuPro: 5.38 ± 0.046
5.692LeuGln: 5.692 ± 0.057
5.146LeuArg: 5.146 ± 0.047
7.77LeuSer: 7.77 ± 0.049
5.009LeuThr: 5.009 ± 0.045
5.447LeuVal: 5.447 ± 0.044
1.082LeuTrp: 1.082 ± 0.022
2.806LeuTyr: 2.806 ± 0.035
0.001LeuXaa: 0.001 ± 0.0
Met
1.628MetAla: 1.628 ± 0.021
0.444MetCys: 0.444 ± 0.012
1.327MetAsp: 1.327 ± 0.019
1.941MetGlu: 1.941 ± 0.027
0.856MetPhe: 0.856 ± 0.016
1.247MetGly: 1.247 ± 0.021
0.529MetHis: 0.529 ± 0.011
1.006MetIle: 1.006 ± 0.017
1.625MetLys: 1.625 ± 0.024
2.133MetLeu: 2.133 ± 0.027
0.637MetMet: 0.637 ± 0.014
1.007MetAsn: 1.007 ± 0.018
1.048MetPro: 1.048 ± 0.018
1.043MetGln: 1.043 ± 0.019
1.06MetArg: 1.06 ± 0.018
1.578MetSer: 1.578 ± 0.022
1.172MetThr: 1.172 ± 0.017
1.478MetVal: 1.478 ± 0.019
0.259MetTrp: 0.259 ± 0.008
0.675MetTyr: 0.675 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
2.377AsnAla: 2.377 ± 0.026
1.003AsnCys: 1.003 ± 0.021
1.758AsnAsp: 1.758 ± 0.024
2.643AsnGlu: 2.643 ± 0.031
1.685AsnPhe: 1.685 ± 0.024
2.806AsnGly: 2.806 ± 0.037
1.027AsnHis: 1.027 ± 0.015
2.602AsnIle: 2.602 ± 0.028
2.671AsnLys: 2.671 ± 0.031
4.149AsnLeu: 4.149 ± 0.035
1.046AsnMet: 1.046 ± 0.015
1.873AsnAsn: 1.873 ± 0.026
2.32AsnPro: 2.32 ± 0.03
1.754AsnGln: 1.754 ± 0.024
2.077AsnArg: 2.077 ± 0.026
3.5AsnSer: 3.5 ± 0.037
2.316AsnThr: 2.316 ± 0.024
2.524AsnVal: 2.524 ± 0.027
0.513AsnTrp: 0.513 ± 0.012
1.345AsnTyr: 1.345 ± 0.017
0.0AsnXaa: 0.0 ± 0.0
Pro
3.633ProAla: 3.633 ± 0.044
1.034ProCys: 1.034 ± 0.026
2.615ProAsp: 2.615 ± 0.029
3.836ProGlu: 3.836 ± 0.038
1.902ProPhe: 1.902 ± 0.029
3.579ProGly: 3.579 ± 0.094
1.216ProHis: 1.216 ± 0.023
1.992ProIle: 1.992 ± 0.028
2.734ProLys: 2.734 ± 0.032
4.592ProLeu: 4.592 ± 0.041
0.969ProMet: 0.969 ± 0.017
1.941ProAsn: 1.941 ± 0.029
4.171ProPro: 4.171 ± 0.074
2.333ProGln: 2.333 ± 0.034
2.524ProArg: 2.524 ± 0.031
4.858ProSer: 4.858 ± 0.051
2.661ProThr: 2.661 ± 0.031
3.695ProVal: 3.695 ± 0.035
0.553ProTrp: 0.553 ± 0.014
1.476ProTyr: 1.476 ± 0.019
0.0ProXaa: 0.0 ± 0.0
Gln
3.053GlnAla: 3.053 ± 0.036
0.978GlnCys: 0.978 ± 0.021
2.283GlnAsp: 2.283 ± 0.026
3.747GlnGlu: 3.747 ± 0.043
1.485GlnPhe: 1.485 ± 0.02
2.507GlnGly: 2.507 ± 0.037
1.306GlnHis: 1.306 ± 0.019
2.297GlnIle: 2.297 ± 0.026
3.244GlnLys: 3.244 ± 0.043
4.647GlnLeu: 4.647 ± 0.049
1.132GlnMet: 1.132 ± 0.019
2.111GlnAsn: 2.111 ± 0.025
2.322GlnPro: 2.322 ± 0.04
3.088GlnGln: 3.088 ± 0.064
2.637GlnArg: 2.637 ± 0.026
3.228GlnSer: 3.228 ± 0.039
2.395GlnThr: 2.395 ± 0.031
2.754GlnVal: 2.754 ± 0.031
0.52GlnTrp: 0.52 ± 0.012
1.33GlnTyr: 1.33 ± 0.021
0.0GlnXaa: 0.0 ± 0.0
Arg
2.974ArgAla: 2.974 ± 0.03
1.112ArgCys: 1.112 ± 0.028
2.579ArgAsp: 2.579 ± 0.032
3.765ArgGlu: 3.765 ± 0.046
1.955ArgPhe: 1.955 ± 0.023
2.819ArgGly: 2.819 ± 0.044
1.422ArgHis: 1.422 ± 0.02
2.609ArgIle: 2.609 ± 0.03
4.05ArgLys: 4.05 ± 0.049
4.936ArgLeu: 4.936 ± 0.045
1.175ArgMet: 1.175 ± 0.017
2.267ArgAsn: 2.267 ± 0.028
2.367ArgPro: 2.367 ± 0.03
2.471ArgGln: 2.471 ± 0.027
3.62ArgArg: 3.62 ± 0.044
3.893ArgSer: 3.893 ± 0.046
2.675ArgThr: 2.675 ± 0.031
2.941ArgVal: 2.941 ± 0.03
0.624ArgTrp: 0.624 ± 0.014
1.594ArgTyr: 1.594 ± 0.022
0.001ArgXaa: 0.001 ± 0.0
Ser
5.045SerAla: 5.045 ± 0.049
1.817SerCys: 1.817 ± 0.029
4.086SerAsp: 4.086 ± 0.038
5.135SerGlu: 5.135 ± 0.047
3.167SerPhe: 3.167 ± 0.033
4.905SerGly: 4.905 ± 0.05
2.009SerHis: 2.009 ± 0.028
3.567SerIle: 3.567 ± 0.032
4.576SerLys: 4.576 ± 0.047
7.987SerLeu: 7.987 ± 0.059
1.631SerMet: 1.631 ± 0.021
3.147SerAsn: 3.147 ± 0.032
5.1SerPro: 5.1 ± 0.062
3.686SerGln: 3.686 ± 0.045
4.014SerArg: 4.014 ± 0.043
9.102SerSer: 9.102 ± 0.108
4.544SerThr: 4.544 ± 0.039
5.082SerVal: 5.082 ± 0.04
0.981SerTrp: 0.981 ± 0.017
2.233SerTyr: 2.233 ± 0.029
0.001SerXaa: 0.001 ± 0.0
Thr
3.805ThrAla: 3.805 ± 0.034
1.306ThrCys: 1.306 ± 0.031
2.752ThrAsp: 2.752 ± 0.027
3.827ThrGlu: 3.827 ± 0.039
2.219ThrPhe: 2.219 ± 0.024
3.412ThrGly: 3.412 ± 0.036
1.174ThrHis: 1.174 ± 0.019
2.607ThrIle: 2.607 ± 0.033
2.994ThrLys: 2.994 ± 0.029
5.15ThrLeu: 5.15 ± 0.04
1.145ThrMet: 1.145 ± 0.019
1.999ThrAsn: 1.999 ± 0.024
3.066ThrPro: 3.066 ± 0.038
2.172ThrGln: 2.172 ± 0.028
2.302ThrArg: 2.302 ± 0.026
4.645ThrSer: 4.645 ± 0.047
3.039ThrThr: 3.039 ± 0.039
4.163ThrVal: 4.163 ± 0.043
0.661ThrTrp: 0.661 ± 0.013
1.586ThrTyr: 1.586 ± 0.025
0.0ThrXaa: 0.0 ± 0.0
Val
4.055ValAla: 4.055 ± 0.038
1.537ValCys: 1.537 ± 0.025
3.141ValAsp: 3.141 ± 0.03
4.067ValGlu: 4.067 ± 0.038
2.67ValPhe: 2.67 ± 0.034
3.279ValGly: 3.279 ± 0.036
1.595ValHis: 1.595 ± 0.021
3.347ValIle: 3.347 ± 0.032
3.931ValLys: 3.931 ± 0.037
6.282ValLeu: 6.282 ± 0.06
1.4ValMet: 1.4 ± 0.02
2.635ValAsn: 2.635 ± 0.033
3.389ValPro: 3.389 ± 0.031
2.866ValGln: 2.866 ± 0.028
2.991ValArg: 2.991 ± 0.026
4.962ValSer: 4.962 ± 0.04
3.858ValThr: 3.858 ± 0.039
4.291ValVal: 4.291 ± 0.045
0.724ValTrp: 0.724 ± 0.016
1.857ValTyr: 1.857 ± 0.023
0.0ValXaa: 0.0 ± 0.0
Trp
0.667TrpAla: 0.667 ± 0.014
0.239TrpCys: 0.239 ± 0.008
0.684TrpAsp: 0.684 ± 0.015
0.775TrpGlu: 0.775 ± 0.017
0.453TrpPhe: 0.453 ± 0.01
0.644TrpGly: 0.644 ± 0.018
0.313TrpHis: 0.313 ± 0.008
0.63TrpIle: 0.63 ± 0.013
0.903TrpLys: 0.903 ± 0.014
1.177TrpLeu: 1.177 ± 0.021
0.32TrpMet: 0.32 ± 0.009
0.655TrpAsn: 0.655 ± 0.015
0.428TrpPro: 0.428 ± 0.012
0.556TrpGln: 0.556 ± 0.012
0.66TrpArg: 0.66 ± 0.013
0.885TrpSer: 0.885 ± 0.02
0.64TrpThr: 0.64 ± 0.016
0.663TrpVal: 0.663 ± 0.013
0.186TrpTrp: 0.186 ± 0.008
0.391TrpTyr: 0.391 ± 0.011
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.577TyrAla: 1.577 ± 0.021
0.784TyrCys: 0.784 ± 0.017
1.478TyrAsp: 1.478 ± 0.021
1.848TyrGlu: 1.848 ± 0.023
1.406TyrPhe: 1.406 ± 0.027
1.786TyrGly: 1.786 ± 0.023
0.801TyrHis: 0.801 ± 0.013
1.695TyrIle: 1.695 ± 0.022
1.723TyrLys: 1.723 ± 0.023
2.99TyrLeu: 2.99 ± 0.034
0.687TyrMet: 0.687 ± 0.013
1.319TyrAsn: 1.319 ± 0.019
1.366TyrPro: 1.366 ± 0.019
1.34TyrGln: 1.34 ± 0.018
1.71TyrArg: 1.71 ± 0.025
2.383TyrSer: 2.383 ± 0.027
1.659TyrThr: 1.659 ± 0.021
1.786TyrVal: 1.786 ± 0.023
0.4TyrTrp: 0.4 ± 0.012
1.117TyrTyr: 1.117 ± 0.019
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.001XaaHis: 0.001 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.027XaaXaa: 0.027 ± 0.007
Statistics based on 9152 proteins (3916303 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski