Amino acid dipepetide frequency for Sapajus apella (Brown-capped capuchin) (Cebus apella)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.863AlaAla: 6.863 ± 0.028
1.303AlaCys: 1.303 ± 0.008
2.985AlaAsp: 2.985 ± 0.011
5.028AlaGlu: 5.028 ± 0.023
2.463AlaPhe: 2.463 ± 0.011
4.715AlaGly: 4.715 ± 0.026
1.514AlaHis: 1.514 ± 0.007
2.682AlaIle: 2.682 ± 0.01
3.587AlaLys: 3.587 ± 0.022
6.958AlaLeu: 6.958 ± 0.023
1.489AlaMet: 1.489 ± 0.008
2.005AlaAsn: 2.005 ± 0.009
4.253AlaPro: 4.253 ± 0.023
3.448AlaGln: 3.448 ± 0.016
3.717AlaArg: 3.717 ± 0.016
6.057AlaSer: 6.057 ± 0.017
3.697AlaThr: 3.697 ± 0.017
4.566AlaVal: 4.566 ± 0.017
0.765AlaTrp: 0.765 ± 0.005
1.464AlaTyr: 1.464 ± 0.01
0.003AlaXaa: 0.003 ± 0.0
Cys
1.157CysAla: 1.157 ± 0.008
0.545CysCys: 0.545 ± 0.006
0.965CysAsp: 0.965 ± 0.007
1.318CysGlu: 1.318 ± 0.013
0.758CysPhe: 0.758 ± 0.006
1.662CysGly: 1.662 ± 0.013
0.628CysHis: 0.628 ± 0.004
0.873CysIle: 0.873 ± 0.007
1.103CysLys: 1.103 ± 0.008
2.002CysLeu: 2.002 ± 0.01
0.381CysMet: 0.381 ± 0.004
0.752CysAsn: 0.752 ± 0.006
1.268CysPro: 1.268 ± 0.01
1.019CysGln: 1.019 ± 0.007
1.184CysArg: 1.184 ± 0.006
1.892CysSer: 1.892 ± 0.009
1.013CysThr: 1.013 ± 0.006
1.225CysVal: 1.225 ± 0.008
0.257CysTrp: 0.257 ± 0.003
0.522CysTyr: 0.522 ± 0.004
0.001CysXaa: 0.001 ± 0.0
Asp
2.965AspAla: 2.965 ± 0.014
0.998AspCys: 0.998 ± 0.007
2.739AspAsp: 2.739 ± 0.012
3.517AspGlu: 3.517 ± 0.014
2.063AspPhe: 2.063 ± 0.01
3.354AspGly: 3.354 ± 0.018
1.133AspHis: 1.133 ± 0.007
2.615AspIle: 2.615 ± 0.011
2.586AspLys: 2.586 ± 0.012
5.106AspLeu: 5.106 ± 0.015
1.144AspMet: 1.144 ± 0.007
1.655AspAsn: 1.655 ± 0.009
2.894AspPro: 2.894 ± 0.011
1.922AspGln: 1.922 ± 0.007
2.443AspArg: 2.443 ± 0.01
4.405AspSer: 4.405 ± 0.018
2.577AspThr: 2.577 ± 0.011
3.106AspVal: 3.106 ± 0.013
0.585AspTrp: 0.585 ± 0.005
1.463AspTyr: 1.463 ± 0.009
0.001AspXaa: 0.001 ± 0.0
Glu
5.47GluAla: 5.47 ± 0.026
1.438GluCys: 1.438 ± 0.013
4.641GluAsp: 4.641 ± 0.016
8.49GluGlu: 8.49 ± 0.037
1.969GluPhe: 1.969 ± 0.009
4.355GluGly: 4.355 ± 0.02
1.54GluHis: 1.54 ± 0.008
3.203GluIle: 3.203 ± 0.015
5.787GluLys: 5.787 ± 0.03
6.677GluLeu: 6.677 ± 0.028
1.733GluMet: 1.733 ± 0.008
3.251GluAsn: 3.251 ± 0.013
3.408GluPro: 3.408 ± 0.016
3.43GluGln: 3.43 ± 0.017
4.236GluArg: 4.236 ± 0.018
4.686GluSer: 4.686 ± 0.015
3.547GluThr: 3.547 ± 0.013
4.304GluVal: 4.304 ± 0.017
0.703GluTrp: 0.703 ± 0.005
1.6GluTyr: 1.6 ± 0.011
0.002GluXaa: 0.002 ± 0.0
Phe
1.837PheAla: 1.837 ± 0.009
0.818PheCys: 0.818 ± 0.005
1.612PheAsp: 1.612 ± 0.008
1.959PheGlu: 1.959 ± 0.009
1.397PhePhe: 1.397 ± 0.009
2.018PheGly: 2.018 ± 0.012
0.998PheHis: 0.998 ± 0.006
1.69PheIle: 1.69 ± 0.01
1.713PheLys: 1.713 ± 0.008
3.725PheLeu: 3.725 ± 0.015
0.731PheMet: 0.731 ± 0.005
1.268PheAsn: 1.268 ± 0.008
1.928PhePro: 1.928 ± 0.008
1.761PheGln: 1.761 ± 0.009
1.892PheArg: 1.892 ± 0.01
3.303PheSer: 3.303 ± 0.011
1.922PheThr: 1.922 ± 0.01
1.932PheVal: 1.932 ± 0.009
0.437PheTrp: 0.437 ± 0.004
1.048PheTyr: 1.048 ± 0.007
0.002PheXaa: 0.002 ± 0.0
Gly
4.419GlyAla: 4.419 ± 0.022
1.177GlyCys: 1.177 ± 0.007
3.131GlyAsp: 3.131 ± 0.013
4.245GlyGlu: 4.245 ± 0.021
2.22GlyPhe: 2.22 ± 0.013
4.863GlyGly: 4.863 ± 0.03
1.67GlyHis: 1.67 ± 0.009
2.615GlyIle: 2.615 ± 0.011
3.821GlyLys: 3.821 ± 0.015
5.641GlyLeu: 5.641 ± 0.019
1.262GlyMet: 1.262 ± 0.007
2.286GlyAsn: 2.286 ± 0.011
4.233GlyPro: 4.233 ± 0.035
2.818GlyGln: 2.818 ± 0.013
3.752GlyArg: 3.752 ± 0.014
5.962GlySer: 5.962 ± 0.019
3.596GlyThr: 3.596 ± 0.016
3.374GlyVal: 3.374 ± 0.016
0.757GlyTrp: 0.757 ± 0.007
1.626GlyTyr: 1.626 ± 0.01
0.003GlyXaa: 0.003 ± 0.0
His
1.346HisAla: 1.346 ± 0.008
0.68HisCys: 0.68 ± 0.005
0.879HisAsp: 0.879 ± 0.005
1.361HisGlu: 1.361 ± 0.008
1.007HisPhe: 1.007 ± 0.006
1.478HisGly: 1.478 ± 0.008
0.941HisHis: 0.941 ± 0.008
1.262HisIle: 1.262 ± 0.008
1.324HisLys: 1.324 ± 0.009
2.889HisLeu: 2.889 ± 0.012
0.567HisMet: 0.567 ± 0.004
0.843HisAsn: 0.843 ± 0.006
1.671HisPro: 1.671 ± 0.008
1.394HisGln: 1.394 ± 0.009
1.583HisArg: 1.583 ± 0.008
2.412HisSer: 2.412 ± 0.01
1.569HisThr: 1.569 ± 0.011
1.504HisVal: 1.504 ± 0.008
0.325HisTrp: 0.325 ± 0.004
0.784HisTyr: 0.784 ± 0.005
0.001HisXaa: 0.001 ± 0.0
Ile
2.519IleAla: 2.519 ± 0.01
0.978IleCys: 0.978 ± 0.006
1.994IleAsp: 1.994 ± 0.011
2.645IleGlu: 2.645 ± 0.011
1.699IlePhe: 1.699 ± 0.01
2.105IleGly: 2.105 ± 0.009
1.374IleHis: 1.374 ± 0.01
2.289IleIle: 2.289 ± 0.011
2.618IleLys: 2.618 ± 0.013
4.36IleLeu: 4.36 ± 0.016
0.939IleMet: 0.939 ± 0.006
1.797IleAsn: 1.797 ± 0.009
2.607IlePro: 2.607 ± 0.012
2.318IleGln: 2.318 ± 0.012
2.336IleArg: 2.336 ± 0.009
3.774IleSer: 3.774 ± 0.013
2.536IleThr: 2.536 ± 0.013
2.341IleVal: 2.341 ± 0.012
0.452IleTrp: 0.452 ± 0.003
1.268IleTyr: 1.268 ± 0.007
0.001IleXaa: 0.001 ± 0.0
Lys
4.199LysAla: 4.199 ± 0.018
1.136LysCys: 1.136 ± 0.01
3.321LysAsp: 3.321 ± 0.017
5.447LysGlu: 5.447 ± 0.027
1.699LysPhe: 1.699 ± 0.009
3.343LysGly: 3.343 ± 0.019
1.438LysHis: 1.438 ± 0.009
2.732LysIle: 2.732 ± 0.013
4.843LysLys: 4.843 ± 0.027
5.253LysLeu: 5.253 ± 0.019
1.45LysMet: 1.45 ± 0.008
2.431LysAsn: 2.431 ± 0.012
3.219LysPro: 3.219 ± 0.017
2.773LysGln: 2.773 ± 0.014
3.334LysArg: 3.334 ± 0.013
4.083LysSer: 4.083 ± 0.016
3.201LysThr: 3.201 ± 0.013
3.463LysVal: 3.463 ± 0.012
0.596LysTrp: 0.596 ± 0.005
1.548LysTyr: 1.548 ± 0.014
0.002LysXaa: 0.002 ± 0.0
Leu
6.622LeuAla: 6.622 ± 0.019
2.018LeuCys: 2.018 ± 0.011
4.71LeuAsp: 4.71 ± 0.016
7.524LeuGlu: 7.524 ± 0.03
3.082LeuPhe: 3.082 ± 0.014
5.595LeuGly: 5.595 ± 0.018
2.725LeuHis: 2.725 ± 0.011
3.65LeuIle: 3.65 ± 0.014
5.882LeuLys: 5.882 ± 0.024
10.303LeuLeu: 10.303 ± 0.037
1.916LeuMet: 1.916 ± 0.009
3.479LeuAsn: 3.479 ± 0.013
6.066LeuPro: 6.066 ± 0.019
6.043LeuGln: 6.043 ± 0.025
5.923LeuArg: 5.923 ± 0.02
8.091LeuSer: 8.091 ± 0.02
4.979LeuThr: 4.979 ± 0.014
5.185LeuVal: 5.185 ± 0.018
1.036LeuTrp: 1.036 ± 0.007
2.401LeuTyr: 2.401 ± 0.012
0.004LeuXaa: 0.004 ± 0.0
Met
1.801MetAla: 1.801 ± 0.008
0.376MetCys: 0.376 ± 0.004
1.222MetAsp: 1.222 ± 0.006
1.931MetGlu: 1.931 ± 0.008
0.672MetPhe: 0.672 ± 0.005
1.226MetGly: 1.226 ± 0.008
0.479MetHis: 0.479 ± 0.004
0.796MetIle: 0.796 ± 0.006
1.447MetLys: 1.447 ± 0.007
1.95MetLeu: 1.95 ± 0.009
0.562MetMet: 0.562 ± 0.005
0.905MetAsn: 0.905 ± 0.006
1.139MetPro: 1.139 ± 0.009
0.967MetGln: 0.967 ± 0.005
1.039MetArg: 1.039 ± 0.006
1.559MetSer: 1.559 ± 0.006
1.085MetThr: 1.085 ± 0.006
1.352MetVal: 1.352 ± 0.007
0.231MetTrp: 0.231 ± 0.003
0.522MetTyr: 0.522 ± 0.004
0.0MetXaa: 0.0 ± 0.0
Asn
2.071AsnAla: 2.071 ± 0.009
0.755AsnCys: 0.755 ± 0.005
1.551AsnAsp: 1.551 ± 0.008
2.253AsnGlu: 2.253 ± 0.01
1.388AsnPhe: 1.388 ± 0.008
2.391AsnGly: 2.391 ± 0.014
0.939AsnHis: 0.939 ± 0.006
2.068AsnIle: 2.068 ± 0.011
2.232AsnLys: 2.232 ± 0.011
3.702AsnLeu: 3.702 ± 0.013
0.903AsnMet: 0.903 ± 0.006
1.483AsnAsn: 1.483 ± 0.009
2.072AsnPro: 2.072 ± 0.009
1.752AsnGln: 1.752 ± 0.01
1.794AsnArg: 1.794 ± 0.008
3.229AsnSer: 3.229 ± 0.012
1.993AsnThr: 1.993 ± 0.009
2.157AsnVal: 2.157 ± 0.009
0.425AsnTrp: 0.425 ± 0.004
1.084AsnTyr: 1.084 ± 0.01
0.001AsnXaa: 0.001 ± 0.0
Pro
5.06ProAla: 5.06 ± 0.025
1.077ProCys: 1.077 ± 0.008
2.87ProAsp: 2.87 ± 0.015
4.617ProGlu: 4.617 ± 0.018
1.821ProPhe: 1.821 ± 0.009
5.302ProGly: 5.302 ± 0.043
1.462ProHis: 1.462 ± 0.007
1.917ProIle: 1.917 ± 0.01
2.915ProLys: 2.915 ± 0.019
5.256ProLeu: 5.256 ± 0.017
1.038ProMet: 1.038 ± 0.006
1.805ProAsn: 1.805 ± 0.01
6.366ProPro: 6.366 ± 0.041
2.98ProGln: 2.98 ± 0.014
3.37ProArg: 3.37 ± 0.016
6.045ProSer: 6.045 ± 0.026
3.232ProThr: 3.232 ± 0.016
3.885ProVal: 3.885 ± 0.016
0.652ProTrp: 0.652 ± 0.005
1.498ProTyr: 1.498 ± 0.012
0.004ProXaa: 0.004 ± 0.0
Gln
3.763GlnAla: 3.763 ± 0.018
0.929GlnCys: 0.929 ± 0.007
2.42GlnAsp: 2.42 ± 0.009
4.211GlnGlu: 4.211 ± 0.019
1.355GlnPhe: 1.355 ± 0.006
2.891GlnGly: 2.891 ± 0.013
1.358GlnHis: 1.358 ± 0.008
2.042GlnIle: 2.042 ± 0.009
3.065GlnLys: 3.065 ± 0.015
4.947GlnLeu: 4.947 ± 0.022
1.132GlnMet: 1.132 ± 0.006
1.927GlnAsn: 1.927 ± 0.011
3.026GlnPro: 3.026 ± 0.015
3.505GlnGln: 3.505 ± 0.027
3.102GlnArg: 3.102 ± 0.014
3.416GlnSer: 3.416 ± 0.014
2.417GlnThr: 2.417 ± 0.01
2.845GlnVal: 2.845 ± 0.01
0.546GlnTrp: 0.546 ± 0.005
1.132GlnTyr: 1.132 ± 0.007
0.002GlnXaa: 0.002 ± 0.0
Arg
3.842ArgAla: 3.842 ± 0.016
1.154ArgCys: 1.154 ± 0.009
2.773ArgAsp: 2.773 ± 0.012
4.165ArgGlu: 4.165 ± 0.017
1.758ArgPhe: 1.758 ± 0.008
3.56ArgGly: 3.56 ± 0.016
1.529ArgHis: 1.529 ± 0.008
2.392ArgIle: 2.392 ± 0.011
3.755ArgLys: 3.755 ± 0.014
5.446ArgLeu: 5.446 ± 0.018
1.175ArgMet: 1.175 ± 0.006
2.107ArgAsn: 2.107 ± 0.009
3.321ArgPro: 3.321 ± 0.014
2.784ArgGln: 2.784 ± 0.014
4.444ArgArg: 4.444 ± 0.021
4.446ArgSer: 4.446 ± 0.019
2.879ArgThr: 2.879 ± 0.012
3.139ArgVal: 3.139 ± 0.012
0.671ArgTrp: 0.671 ± 0.005
1.365ArgTyr: 1.365 ± 0.006
0.002ArgXaa: 0.002 ± 0.0
Ser
5.563SerAla: 5.563 ± 0.019
1.767SerCys: 1.767 ± 0.011
4.159SerAsp: 4.159 ± 0.022
5.602SerGlu: 5.602 ± 0.017
2.959SerPhe: 2.959 ± 0.011
5.656SerGly: 5.656 ± 0.021
2.22SerHis: 2.22 ± 0.009
3.248SerIle: 3.248 ± 0.011
4.293SerLys: 4.293 ± 0.017
8.347SerLeu: 8.347 ± 0.022
1.646SerMet: 1.646 ± 0.008
2.772SerAsn: 2.772 ± 0.011
6.481SerPro: 6.481 ± 0.03
4.166SerGln: 4.166 ± 0.015
4.702SerArg: 4.702 ± 0.017
10.162SerSer: 10.162 ± 0.044
4.687SerThr: 4.687 ± 0.02
4.93SerVal: 4.93 ± 0.015
1.024SerTrp: 1.024 ± 0.006
1.975SerTyr: 1.975 ± 0.009
0.003SerXaa: 0.003 ± 0.0
Thr
3.814ThrAla: 3.814 ± 0.013
1.219ThrCys: 1.219 ± 0.009
2.481ThrAsp: 2.481 ± 0.012
3.731ThrGlu: 3.731 ± 0.013
1.981ThrPhe: 1.981 ± 0.009
3.568ThrGly: 3.568 ± 0.016
1.312ThrHis: 1.312 ± 0.008
2.312ThrIle: 2.312 ± 0.011
2.692ThrLys: 2.692 ± 0.011
5.261ThrLeu: 5.261 ± 0.016
1.077ThrMet: 1.077 ± 0.005
1.694ThrAsn: 1.694 ± 0.009
3.852ThrPro: 3.852 ± 0.019
2.416ThrGln: 2.416 ± 0.01
2.52ThrArg: 2.52 ± 0.01
4.909ThrSer: 4.909 ± 0.022
3.084ThrThr: 3.084 ± 0.019
3.812ThrVal: 3.812 ± 0.017
0.669ThrTrp: 0.669 ± 0.006
1.354ThrTyr: 1.354 ± 0.006
0.002ThrXaa: 0.002 ± 0.0
Val
4.234ValAla: 4.234 ± 0.016
1.322ValCys: 1.322 ± 0.009
2.977ValAsp: 2.977 ± 0.012
3.909ValGlu: 3.909 ± 0.016
2.219ValPhe: 2.219 ± 0.01
3.2ValGly: 3.2 ± 0.014
1.553ValHis: 1.553 ± 0.008
2.751ValIle: 2.751 ± 0.014
3.452ValLys: 3.452 ± 0.014
5.811ValLeu: 5.811 ± 0.017
1.263ValMet: 1.263 ± 0.007
2.246ValAsn: 2.246 ± 0.011
3.679ValPro: 3.679 ± 0.017
2.79ValGln: 2.79 ± 0.011
3.023ValArg: 3.023 ± 0.016
4.909ValSer: 4.909 ± 0.016
3.707ValThr: 3.707 ± 0.019
3.821ValVal: 3.821 ± 0.017
0.66ValTrp: 0.66 ± 0.005
1.49ValTyr: 1.49 ± 0.007
0.002ValXaa: 0.002 ± 0.0
Trp
0.724TrpAla: 0.724 ± 0.005
0.221TrpCys: 0.221 ± 0.003
0.618TrpAsp: 0.618 ± 0.005
0.794TrpGlu: 0.794 ± 0.005
0.404TrpPhe: 0.404 ± 0.004
0.668TrpGly: 0.668 ± 0.006
0.296TrpHis: 0.296 ± 0.003
0.509TrpIle: 0.509 ± 0.004
0.75TrpLys: 0.75 ± 0.005
1.161TrpLeu: 1.161 ± 0.009
0.305TrpMet: 0.305 ± 0.003
0.517TrpAsn: 0.517 ± 0.005
0.497TrpPro: 0.497 ± 0.004
0.509TrpGln: 0.509 ± 0.004
0.719TrpArg: 0.719 ± 0.005
0.85TrpSer: 0.85 ± 0.007
0.637TrpThr: 0.637 ± 0.005
0.624TrpVal: 0.624 ± 0.005
0.173TrpTrp: 0.173 ± 0.002
0.319TrpTyr: 0.319 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.29TyrAla: 1.29 ± 0.007
0.611TyrCys: 0.611 ± 0.005
1.219TyrAsp: 1.219 ± 0.008
1.72TyrGlu: 1.72 ± 0.011
1.076TyrPhe: 1.076 ± 0.006
1.518TyrGly: 1.518 ± 0.009
0.713TyrHis: 0.713 ± 0.005
1.304TyrIle: 1.304 ± 0.007
1.648TyrLys: 1.648 ± 0.028
2.439TyrLeu: 2.439 ± 0.01
0.548TyrMet: 0.548 ± 0.005
1.02TyrAsn: 1.02 ± 0.006
1.187TyrPro: 1.187 ± 0.007
1.234TyrGln: 1.234 ± 0.007
1.593TyrArg: 1.593 ± 0.01
2.166TyrSer: 2.166 ± 0.011
1.4TyrThr: 1.4 ± 0.009
1.447TyrVal: 1.447 ± 0.007
0.333TyrTrp: 0.333 ± 0.004
0.846TyrTyr: 0.846 ± 0.006
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.003XaaAla: 0.003 ± 0.0
0.001XaaCys: 0.001 ± 0.0
0.002XaaAsp: 0.002 ± 0.0
0.002XaaGlu: 0.002 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.004XaaGly: 0.004 ± 0.0
0.001XaaHis: 0.001 ± 0.0
0.002XaaIle: 0.002 ± 0.0
0.002XaaLys: 0.002 ± 0.0
0.003XaaLeu: 0.003 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.003XaaPro: 0.003 ± 0.0
0.002XaaGln: 0.002 ± 0.0
0.002XaaArg: 0.002 ± 0.0
0.003XaaSer: 0.003 ± 0.0
0.002XaaThr: 0.002 ± 0.0
0.003XaaVal: 0.003 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.007XaaXaa: 0.007 ± 0.002
Statistics based on 49368 proteins (36383883 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski