Amino acid dipepetide frequency for Cercocebus atys (Sooty mangabey) (Cercocebus torquatus atys)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.786AlaAla: 6.786 ± 0.032
1.367AlaCys: 1.367 ± 0.009
2.916AlaAsp: 2.916 ± 0.012
4.801AlaGlu: 4.801 ± 0.022
2.62AlaPhe: 2.62 ± 0.012
4.723AlaGly: 4.723 ± 0.022
1.552AlaHis: 1.552 ± 0.009
2.769AlaIle: 2.769 ± 0.01
3.46AlaLys: 3.46 ± 0.017
7.045AlaLeu: 7.045 ± 0.024
1.488AlaMet: 1.488 ± 0.009
2.014AlaAsn: 2.014 ± 0.01
4.143AlaPro: 4.143 ± 0.023
3.244AlaGln: 3.244 ± 0.018
3.639AlaArg: 3.639 ± 0.016
5.71AlaSer: 5.71 ± 0.022
3.63AlaThr: 3.63 ± 0.015
4.682AlaVal: 4.682 ± 0.018
0.801AlaTrp: 0.801 ± 0.008
1.516AlaTyr: 1.516 ± 0.009
0.0AlaXaa: 0.0 ± 0.0
Cys
1.214CysAla: 1.214 ± 0.008
0.636CysCys: 0.636 ± 0.007
1.014CysAsp: 1.014 ± 0.009
1.33CysGlu: 1.33 ± 0.01
0.831CysPhe: 0.831 ± 0.008
1.752CysGly: 1.752 ± 0.019
0.669CysHis: 0.669 ± 0.007
0.95CysIle: 0.95 ± 0.009
1.176CysLys: 1.176 ± 0.01
2.18CysLeu: 2.18 ± 0.013
0.414CysMet: 0.414 ± 0.004
0.827CysAsn: 0.827 ± 0.009
1.38CysPro: 1.38 ± 0.013
1.075CysGln: 1.075 ± 0.01
1.299CysArg: 1.299 ± 0.009
2.001CysSer: 2.001 ± 0.014
1.083CysThr: 1.083 ± 0.008
1.301CysVal: 1.301 ± 0.011
0.297CysTrp: 0.297 ± 0.004
0.57CysTyr: 0.57 ± 0.006
0.0CysXaa: 0.0 ± 0.0
Asp
2.879AspAla: 2.879 ± 0.014
1.062AspCys: 1.062 ± 0.011
2.66AspAsp: 2.66 ± 0.016
3.49AspGlu: 3.49 ± 0.017
2.109AspPhe: 2.109 ± 0.013
3.326AspGly: 3.326 ± 0.019
1.137AspHis: 1.137 ± 0.007
2.612AspIle: 2.612 ± 0.012
2.572AspLys: 2.572 ± 0.011
5.012AspLeu: 5.012 ± 0.019
1.113AspMet: 1.113 ± 0.008
1.688AspAsn: 1.688 ± 0.009
2.893AspPro: 2.893 ± 0.013
1.841AspGln: 1.841 ± 0.009
2.437AspArg: 2.437 ± 0.012
4.206AspSer: 4.206 ± 0.017
2.492AspThr: 2.492 ± 0.011
3.088AspVal: 3.088 ± 0.016
0.622AspTrp: 0.622 ± 0.006
1.489AspTyr: 1.489 ± 0.009
0.0AspXaa: 0.0 ± 0.0
Glu
5.287GluAla: 5.287 ± 0.024
1.515GluCys: 1.515 ± 0.02
4.506GluAsp: 4.506 ± 0.019
8.082GluGlu: 8.082 ± 0.041
2.059GluPhe: 2.059 ± 0.01
4.211GluGly: 4.211 ± 0.016
1.525GluHis: 1.525 ± 0.008
3.192GluIle: 3.192 ± 0.018
5.563GluLys: 5.563 ± 0.028
6.518GluLeu: 6.518 ± 0.03
1.698GluMet: 1.698 ± 0.01
3.193GluAsn: 3.193 ± 0.015
3.284GluPro: 3.284 ± 0.023
3.172GluGln: 3.172 ± 0.018
4.04GluArg: 4.04 ± 0.017
4.406GluSer: 4.406 ± 0.018
3.44GluThr: 3.44 ± 0.015
4.212GluVal: 4.212 ± 0.02
0.71GluTrp: 0.71 ± 0.005
1.636GluTyr: 1.636 ± 0.013
0.0GluXaa: 0.0 ± 0.0
Phe
1.95PheAla: 1.95 ± 0.01
0.912PheCys: 0.912 ± 0.007
1.681PheAsp: 1.681 ± 0.01
2.023PheGlu: 2.023 ± 0.01
1.733PhePhe: 1.733 ± 0.023
2.183PheGly: 2.183 ± 0.016
1.047PheHis: 1.047 ± 0.007
1.801PheIle: 1.801 ± 0.011
1.817PheLys: 1.817 ± 0.01
4.019PheLeu: 4.019 ± 0.019
0.776PheMet: 0.776 ± 0.007
1.352PheAsn: 1.352 ± 0.009
1.977PhePro: 1.977 ± 0.011
1.797PheGln: 1.797 ± 0.009
1.988PheArg: 1.988 ± 0.013
3.395PheSer: 3.395 ± 0.017
2.037PheThr: 2.037 ± 0.011
2.109PheVal: 2.109 ± 0.012
0.488PheTrp: 0.488 ± 0.005
1.162PheTyr: 1.162 ± 0.009
0.0PheXaa: 0.0 ± 0.0
Gly
4.462GlyAla: 4.462 ± 0.022
1.262GlyCys: 1.262 ± 0.011
3.152GlyAsp: 3.152 ± 0.014
4.229GlyGlu: 4.229 ± 0.024
2.379GlyPhe: 2.379 ± 0.017
5.021GlyGly: 5.021 ± 0.033
1.682GlyHis: 1.682 ± 0.011
2.771GlyIle: 2.771 ± 0.013
3.885GlyLys: 3.885 ± 0.019
5.836GlyLeu: 5.836 ± 0.021
1.312GlyMet: 1.312 ± 0.009
2.331GlyAsn: 2.331 ± 0.011
4.245GlyPro: 4.245 ± 0.038
2.76GlyGln: 2.76 ± 0.015
3.742GlyArg: 3.742 ± 0.018
5.813GlySer: 5.813 ± 0.023
3.564GlyThr: 3.564 ± 0.015
3.536GlyVal: 3.536 ± 0.015
0.807GlyTrp: 0.807 ± 0.007
1.726GlyTyr: 1.726 ± 0.011
0.0GlyXaa: 0.0 ± 0.0
His
1.329HisAla: 1.329 ± 0.008
0.731HisCys: 0.731 ± 0.007
0.892HisAsp: 0.892 ± 0.007
1.347HisGlu: 1.347 ± 0.008
1.072HisPhe: 1.072 ± 0.007
1.542HisGly: 1.542 ± 0.011
0.918HisHis: 0.918 ± 0.009
1.258HisIle: 1.258 ± 0.008
1.304HisLys: 1.304 ± 0.01
2.93HisLeu: 2.93 ± 0.015
0.59HisMet: 0.59 ± 0.005
0.877HisAsn: 0.877 ± 0.006
1.654HisPro: 1.654 ± 0.01
1.363HisGln: 1.363 ± 0.012
1.607HisArg: 1.607 ± 0.009
2.32HisSer: 2.32 ± 0.012
1.577HisThr: 1.577 ± 0.015
1.48HisVal: 1.48 ± 0.009
0.353HisTrp: 0.353 ± 0.004
0.797HisTyr: 0.797 ± 0.006
0.0HisXaa: 0.0 ± 0.0
Ile
2.577IleAla: 2.577 ± 0.011
1.067IleCys: 1.067 ± 0.008
2.038IleAsp: 2.038 ± 0.012
2.635IleGlu: 2.635 ± 0.015
1.881IlePhe: 1.881 ± 0.012
2.203IleGly: 2.203 ± 0.011
1.403IleHis: 1.403 ± 0.012
2.377IleIle: 2.377 ± 0.015
2.68IleLys: 2.68 ± 0.016
4.551IleLeu: 4.551 ± 0.018
0.989IleMet: 0.989 ± 0.007
1.846IleAsn: 1.846 ± 0.011
2.609IlePro: 2.609 ± 0.013
2.31IleGln: 2.31 ± 0.012
2.374IleArg: 2.374 ± 0.01
3.705IleSer: 3.705 ± 0.016
2.575IleThr: 2.575 ± 0.017
2.502IleVal: 2.502 ± 0.016
0.506IleTrp: 0.506 ± 0.005
1.364IleTyr: 1.364 ± 0.009
0.0IleXaa: 0.0 ± 0.0
Lys
4.052LysAla: 4.052 ± 0.021
1.214LysCys: 1.214 ± 0.012
3.21LysAsp: 3.21 ± 0.019
5.224LysGlu: 5.224 ± 0.026
1.746LysPhe: 1.746 ± 0.01
3.268LysGly: 3.268 ± 0.021
1.425LysHis: 1.425 ± 0.009
2.855LysIle: 2.855 ± 0.014
4.884LysLys: 4.884 ± 0.031
5.223LysLeu: 5.223 ± 0.021
1.451LysMet: 1.451 ± 0.008
2.444LysAsn: 2.444 ± 0.013
3.207LysPro: 3.207 ± 0.022
2.649LysGln: 2.649 ± 0.015
3.334LysArg: 3.334 ± 0.015
3.928LysSer: 3.928 ± 0.018
3.156LysThr: 3.156 ± 0.013
3.511LysVal: 3.511 ± 0.019
0.612LysTrp: 0.612 ± 0.006
1.585LysTyr: 1.585 ± 0.012
0.0LysXaa: 0.0 ± 0.0
Leu
6.709LeuAla: 6.709 ± 0.021
2.188LeuCys: 2.188 ± 0.014
4.718LeuAsp: 4.718 ± 0.016
7.248LeuGlu: 7.248 ± 0.032
3.345LeuPhe: 3.345 ± 0.017
5.826LeuGly: 5.826 ± 0.022
2.718LeuHis: 2.718 ± 0.013
3.857LeuIle: 3.857 ± 0.015
5.869LeuLys: 5.869 ± 0.023
10.68LeuLeu: 10.68 ± 0.035
2.005LeuMet: 2.005 ± 0.009
3.487LeuAsn: 3.487 ± 0.016
5.98LeuPro: 5.98 ± 0.024
5.824LeuGln: 5.824 ± 0.03
5.893LeuArg: 5.893 ± 0.02
7.934LeuSer: 7.934 ± 0.025
5.071LeuThr: 5.071 ± 0.018
5.435LeuVal: 5.435 ± 0.016
1.134LeuTrp: 1.134 ± 0.008
2.532LeuTyr: 2.532 ± 0.014
0.0LeuXaa: 0.0 ± 0.0
Met
1.912MetAla: 1.912 ± 0.009
0.401MetCys: 0.401 ± 0.005
1.264MetAsp: 1.264 ± 0.009
1.89MetGlu: 1.89 ± 0.01
0.717MetPhe: 0.717 ± 0.006
1.295MetGly: 1.295 ± 0.008
0.476MetHis: 0.476 ± 0.004
0.844MetIle: 0.844 ± 0.006
1.463MetLys: 1.463 ± 0.009
1.957MetLeu: 1.957 ± 0.009
0.573MetMet: 0.573 ± 0.005
0.909MetAsn: 0.909 ± 0.007
1.094MetPro: 1.094 ± 0.01
0.926MetGln: 0.926 ± 0.007
1.041MetArg: 1.041 ± 0.007
1.55MetSer: 1.55 ± 0.009
1.125MetThr: 1.125 ± 0.008
1.379MetVal: 1.379 ± 0.008
0.238MetTrp: 0.238 ± 0.003
0.555MetTyr: 0.555 ± 0.005
0.0MetXaa: 0.0 ± 0.0
Asn
2.065AsnAla: 2.065 ± 0.01
0.842AsnCys: 0.842 ± 0.008
1.53AsnAsp: 1.53 ± 0.009
2.252AsnGlu: 2.252 ± 0.012
1.472AsnPhe: 1.472 ± 0.009
2.461AsnGly: 2.461 ± 0.012
0.952AsnHis: 0.952 ± 0.006
2.128AsnIle: 2.128 ± 0.011
2.232AsnLys: 2.232 ± 0.012
3.759AsnLeu: 3.759 ± 0.014
0.906AsnMet: 0.906 ± 0.007
1.536AsnAsn: 1.536 ± 0.011
2.162AsnPro: 2.162 ± 0.012
1.682AsnGln: 1.682 ± 0.009
1.852AsnArg: 1.852 ± 0.012
3.138AsnSer: 3.138 ± 0.014
1.956AsnThr: 1.956 ± 0.012
2.202AsnVal: 2.202 ± 0.011
0.452AsnTrp: 0.452 ± 0.004
1.126AsnTyr: 1.126 ± 0.008
0.0AsnXaa: 0.0 ± 0.0
Pro
4.864ProAla: 4.864 ± 0.028
1.135ProCys: 1.135 ± 0.011
2.795ProAsp: 2.795 ± 0.012
4.475ProGlu: 4.475 ± 0.021
1.905ProPhe: 1.905 ± 0.011
5.256ProGly: 5.256 ± 0.048
1.442ProHis: 1.442 ± 0.009
1.91ProIle: 1.91 ± 0.014
2.778ProLys: 2.778 ± 0.018
5.234ProLeu: 5.234 ± 0.02
1.031ProMet: 1.031 ± 0.008
1.82ProAsn: 1.82 ± 0.012
6.145ProPro: 6.145 ± 0.043
2.839ProGln: 2.839 ± 0.016
3.354ProArg: 3.354 ± 0.017
5.703ProSer: 5.703 ± 0.028
3.149ProThr: 3.149 ± 0.019
3.814ProVal: 3.814 ± 0.02
0.698ProTrp: 0.698 ± 0.006
1.568ProTyr: 1.568 ± 0.015
0.0ProXaa: 0.0 ± 0.0
Gln
3.536GlnAla: 3.536 ± 0.018
0.956GlnCys: 0.956 ± 0.009
2.337GlnAsp: 2.337 ± 0.009
3.963GlnGlu: 3.963 ± 0.02
1.387GlnPhe: 1.387 ± 0.007
2.838GlnGly: 2.838 ± 0.015
1.318GlnHis: 1.318 ± 0.01
1.991GlnIle: 1.991 ± 0.011
3.014GlnLys: 3.014 ± 0.016
4.729GlnLeu: 4.729 ± 0.023
1.117GlnMet: 1.117 ± 0.008
1.87GlnAsn: 1.87 ± 0.01
2.82GlnPro: 2.82 ± 0.017
3.165GlnGln: 3.165 ± 0.033
2.994GlnArg: 2.994 ± 0.017
3.159GlnSer: 3.159 ± 0.017
2.296GlnThr: 2.296 ± 0.012
2.817GlnVal: 2.817 ± 0.011
0.549GlnTrp: 0.549 ± 0.005
1.148GlnTyr: 1.148 ± 0.007
0.0GlnXaa: 0.0 ± 0.0
Arg
3.824ArgAla: 3.824 ± 0.015
1.231ArgCys: 1.231 ± 0.011
2.765ArgAsp: 2.765 ± 0.012
4.015ArgGlu: 4.015 ± 0.017
1.841ArgPhe: 1.841 ± 0.009
3.599ArgGly: 3.599 ± 0.022
1.557ArgHis: 1.557 ± 0.009
2.467ArgIle: 2.467 ± 0.012
3.705ArgLys: 3.705 ± 0.015
5.41ArgLeu: 5.41 ± 0.021
1.209ArgMet: 1.209 ± 0.008
2.132ArgAsn: 2.132 ± 0.011
3.218ArgPro: 3.218 ± 0.015
2.642ArgGln: 2.642 ± 0.014
4.344ArgArg: 4.344 ± 0.019
4.248ArgSer: 4.248 ± 0.024
2.82ArgThr: 2.82 ± 0.012
3.164ArgVal: 3.164 ± 0.017
0.71ArgTrp: 0.71 ± 0.006
1.431ArgTyr: 1.431 ± 0.009
0.0ArgXaa: 0.0 ± 0.0
Ser
5.276SerAla: 5.276 ± 0.019
1.836SerCys: 1.836 ± 0.012
3.899SerAsp: 3.899 ± 0.019
5.221SerGlu: 5.221 ± 0.02
3.029SerPhe: 3.029 ± 0.014
5.642SerGly: 5.642 ± 0.026
2.152SerHis: 2.152 ± 0.012
3.203SerIle: 3.203 ± 0.013
4.094SerLys: 4.094 ± 0.016
8.176SerLeu: 8.176 ± 0.026
1.6SerMet: 1.6 ± 0.009
2.715SerAsn: 2.715 ± 0.014
6.054SerPro: 6.054 ± 0.029
3.872SerGln: 3.872 ± 0.019
4.547SerArg: 4.547 ± 0.022
9.36SerSer: 9.36 ± 0.045
4.468SerThr: 4.468 ± 0.024
4.849SerVal: 4.849 ± 0.018
1.074SerTrp: 1.074 ± 0.008
2.049SerTyr: 2.049 ± 0.011
0.0SerXaa: 0.0 ± 0.0
Thr
3.767ThrAla: 3.767 ± 0.018
1.29ThrCys: 1.29 ± 0.012
2.469ThrAsp: 2.469 ± 0.011
3.565ThrGlu: 3.565 ± 0.016
2.094ThrPhe: 2.094 ± 0.012
3.617ThrGly: 3.617 ± 0.017
1.337ThrHis: 1.337 ± 0.01
2.38ThrIle: 2.38 ± 0.013
2.693ThrLys: 2.693 ± 0.017
5.287ThrLeu: 5.287 ± 0.015
1.108ThrMet: 1.108 ± 0.007
1.747ThrAsn: 1.747 ± 0.009
3.601ThrPro: 3.601 ± 0.022
2.306ThrGln: 2.306 ± 0.012
2.481ThrArg: 2.481 ± 0.011
4.647ThrSer: 4.647 ± 0.024
3.054ThrThr: 3.054 ± 0.027
3.849ThrVal: 3.849 ± 0.019
0.702ThrTrp: 0.702 ± 0.008
1.402ThrTyr: 1.402 ± 0.008
0.0ThrXaa: 0.0 ± 0.0
Val
4.242ValAla: 4.242 ± 0.016
1.433ValCys: 1.433 ± 0.009
2.953ValAsp: 2.953 ± 0.014
3.896ValGlu: 3.896 ± 0.021
2.387ValPhe: 2.387 ± 0.012
3.374ValGly: 3.374 ± 0.015
1.569ValHis: 1.569 ± 0.01
2.903ValIle: 2.903 ± 0.017
3.418ValLys: 3.418 ± 0.019
6.095ValLeu: 6.095 ± 0.019
1.312ValMet: 1.312 ± 0.008
2.271ValAsn: 2.271 ± 0.012
3.63ValPro: 3.63 ± 0.019
2.732ValGln: 2.732 ± 0.011
3.017ValArg: 3.017 ± 0.015
4.842ValSer: 4.842 ± 0.018
3.779ValThr: 3.779 ± 0.026
3.933ValVal: 3.933 ± 0.019
0.714ValTrp: 0.714 ± 0.006
1.611ValTyr: 1.611 ± 0.009
0.0ValXaa: 0.0 ± 0.0
Trp
0.796TrpAla: 0.796 ± 0.007
0.249TrpCys: 0.249 ± 0.004
0.65TrpAsp: 0.65 ± 0.005
0.827TrpGlu: 0.827 ± 0.007
0.443TrpPhe: 0.443 ± 0.005
0.733TrpGly: 0.733 ± 0.007
0.311TrpHis: 0.311 ± 0.004
0.541TrpIle: 0.541 ± 0.006
0.817TrpLys: 0.817 ± 0.007
1.209TrpLeu: 1.209 ± 0.009
0.32TrpMet: 0.32 ± 0.004
0.539TrpAsn: 0.539 ± 0.005
0.534TrpPro: 0.534 ± 0.005
0.537TrpGln: 0.537 ± 0.005
0.735TrpArg: 0.735 ± 0.006
0.885TrpSer: 0.885 ± 0.007
0.678TrpThr: 0.678 ± 0.007
0.692TrpVal: 0.692 ± 0.006
0.195TrpTrp: 0.195 ± 0.003
0.334TrpTyr: 0.334 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.382TyrAla: 1.382 ± 0.009
0.673TyrCys: 0.673 ± 0.006
1.279TyrAsp: 1.279 ± 0.008
1.735TyrGlu: 1.735 ± 0.011
1.198TyrPhe: 1.198 ± 0.009
1.643TyrGly: 1.643 ± 0.011
0.753TyrHis: 0.753 ± 0.006
1.383TyrIle: 1.383 ± 0.009
1.553TyrLys: 1.553 ± 0.017
2.604TyrLeu: 2.604 ± 0.012
0.598TyrMet: 0.598 ± 0.006
1.108TyrAsn: 1.108 ± 0.009
1.285TyrPro: 1.285 ± 0.008
1.266TyrGln: 1.266 ± 0.008
1.616TyrArg: 1.616 ± 0.01
2.165TyrSer: 2.165 ± 0.012
1.447TyrThr: 1.447 ± 0.008
1.553TyrVal: 1.553 ± 0.009
0.363TyrTrp: 0.363 ± 0.005
0.936TyrTyr: 0.936 ± 0.006
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.035XaaXaa: 0.035 ± 0.018
Statistics based on 45022 proteins (24628997 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski