Amino acid dipepetide frequency for Chelonia mydas (Green sea-turtle) (Chelonia agassizi)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.218AlaAla: 6.218 ± 0.044
1.38AlaCys: 1.38 ± 0.015
2.974AlaAsp: 2.974 ± 0.019
4.717AlaGlu: 4.717 ± 0.031
2.467AlaPhe: 2.467 ± 0.018
4.28AlaGly: 4.28 ± 0.031
1.45AlaHis: 1.45 ± 0.016
3.046AlaIle: 3.046 ± 0.019
3.679AlaLys: 3.679 ± 0.028
6.801AlaLeu: 6.801 ± 0.04
1.615AlaMet: 1.615 ± 0.017
2.285AlaAsn: 2.285 ± 0.016
3.643AlaPro: 3.643 ± 0.034
2.977AlaGln: 2.977 ± 0.025
3.205AlaArg: 3.205 ± 0.024
5.426AlaSer: 5.426 ± 0.031
3.497AlaThr: 3.497 ± 0.026
4.88AlaVal: 4.88 ± 0.027
0.778AlaTrp: 0.778 ± 0.01
1.563AlaTyr: 1.563 ± 0.015
0.002AlaXaa: 0.002 ± 0.0
Cys
1.248CysAla: 1.248 ± 0.017
0.73CysCys: 0.73 ± 0.012
1.028CysAsp: 1.028 ± 0.016
1.32CysGlu: 1.32 ± 0.02
0.896CysPhe: 0.896 ± 0.01
1.828CysGly: 1.828 ± 0.03
0.672CysHis: 0.672 ± 0.011
1.062CysIle: 1.062 ± 0.013
1.298CysLys: 1.298 ± 0.014
2.138CysLeu: 2.138 ± 0.022
0.5CysMet: 0.5 ± 0.01
0.907CysAsn: 0.907 ± 0.011
1.331CysPro: 1.331 ± 0.018
1.073CysGln: 1.073 ± 0.015
1.405CysArg: 1.405 ± 0.017
2.195CysSer: 2.195 ± 0.022
1.28CysThr: 1.28 ± 0.015
1.266CysVal: 1.266 ± 0.017
0.306CysTrp: 0.306 ± 0.007
0.632CysTyr: 0.632 ± 0.009
0.001CysXaa: 0.001 ± 0.0
Asp
2.886AspAla: 2.886 ± 0.02
1.128AspCys: 1.128 ± 0.018
2.633AspAsp: 2.633 ± 0.021
3.521AspGlu: 3.521 ± 0.029
2.118AspPhe: 2.118 ± 0.017
3.112AspGly: 3.112 ± 0.027
1.079AspHis: 1.079 ± 0.011
2.74AspIle: 2.74 ± 0.023
2.642AspLys: 2.642 ± 0.02
5.021AspLeu: 5.021 ± 0.031
1.19AspMet: 1.19 ± 0.012
1.812AspAsn: 1.812 ± 0.018
2.801AspPro: 2.801 ± 0.02
1.814AspGln: 1.814 ± 0.015
2.462AspArg: 2.462 ± 0.018
4.263AspSer: 4.263 ± 0.027
2.594AspThr: 2.594 ± 0.021
3.061AspVal: 3.061 ± 0.024
0.631AspTrp: 0.631 ± 0.009
1.501AspTyr: 1.501 ± 0.016
0.001AspXaa: 0.001 ± 0.0
Glu
4.795GluAla: 4.795 ± 0.029
1.508GluCys: 1.508 ± 0.033
4.289GluAsp: 4.289 ± 0.029
8.02GluGlu: 8.02 ± 0.06
2.063GluPhe: 2.063 ± 0.017
4.043GluGly: 4.043 ± 0.028
1.541GluHis: 1.541 ± 0.013
3.399GluIle: 3.399 ± 0.03
5.483GluLys: 5.483 ± 0.047
6.644GluLeu: 6.644 ± 0.043
1.797GluMet: 1.797 ± 0.014
3.251GluAsn: 3.251 ± 0.022
2.9GluPro: 2.9 ± 0.028
3.285GluGln: 3.285 ± 0.025
4.207GluArg: 4.207 ± 0.033
4.647GluSer: 4.647 ± 0.031
3.577GluThr: 3.577 ± 0.023
4.136GluVal: 4.136 ± 0.029
0.744GluTrp: 0.744 ± 0.01
1.654GluTyr: 1.654 ± 0.02
0.001GluXaa: 0.001 ± 0.0
Phe
2.021PheAla: 2.021 ± 0.019
0.958PheCys: 0.958 ± 0.012
1.686PheAsp: 1.686 ± 0.014
1.936PheGlu: 1.936 ± 0.019
1.506PhePhe: 1.506 ± 0.017
2.185PheGly: 2.185 ± 0.021
1.038PheHis: 1.038 ± 0.015
1.942PheIle: 1.942 ± 0.017
1.838PheLys: 1.838 ± 0.018
3.893PheLeu: 3.893 ± 0.029
0.798PheMet: 0.798 ± 0.011
1.452PheAsn: 1.452 ± 0.014
1.923PhePro: 1.923 ± 0.018
1.792PheGln: 1.792 ± 0.016
1.882PheArg: 1.882 ± 0.019
3.455PheSer: 3.455 ± 0.027
2.162PheThr: 2.162 ± 0.017
2.164PheVal: 2.164 ± 0.019
0.486PheTrp: 0.486 ± 0.008
1.251PheTyr: 1.251 ± 0.013
0.0PheXaa: 0.0 ± 0.0
Gly
4.011GlyAla: 4.011 ± 0.034
1.253GlyCys: 1.253 ± 0.013
3.021GlyAsp: 3.021 ± 0.023
4.325GlyGlu: 4.325 ± 0.036
2.322GlyPhe: 2.322 ± 0.02
4.145GlyGly: 4.145 ± 0.04
1.579GlyHis: 1.579 ± 0.017
2.905GlyIle: 2.905 ± 0.02
4.14GlyLys: 4.14 ± 0.032
5.331GlyLeu: 5.331 ± 0.035
1.371GlyMet: 1.371 ± 0.014
2.533GlyAsn: 2.533 ± 0.019
3.102GlyPro: 3.102 ± 0.042
2.677GlyGln: 2.677 ± 0.02
3.298GlyArg: 3.298 ± 0.027
5.487GlySer: 5.487 ± 0.034
3.583GlyThr: 3.583 ± 0.023
3.371GlyVal: 3.371 ± 0.023
0.742GlyTrp: 0.742 ± 0.012
1.806GlyTyr: 1.806 ± 0.016
0.002GlyXaa: 0.002 ± 0.0
His
1.381HisAla: 1.381 ± 0.018
0.751HisCys: 0.751 ± 0.012
0.901HisAsp: 0.901 ± 0.01
1.33HisGlu: 1.33 ± 0.014
1.081HisPhe: 1.081 ± 0.013
1.497HisGly: 1.497 ± 0.015
0.826HisHis: 0.826 ± 0.012
1.261HisIle: 1.261 ± 0.014
1.371HisLys: 1.371 ± 0.013
2.785HisLeu: 2.785 ± 0.021
0.597HisMet: 0.597 ± 0.009
0.95HisAsn: 0.95 ± 0.01
1.54HisPro: 1.54 ± 0.017
1.501HisGln: 1.501 ± 0.021
1.6HisArg: 1.6 ± 0.018
2.311HisSer: 2.311 ± 0.021
1.644HisThr: 1.644 ± 0.024
1.463HisVal: 1.463 ± 0.015
0.373HisTrp: 0.373 ± 0.007
0.822HisTyr: 0.822 ± 0.011
0.0HisXaa: 0.0 ± 0.0
Ile
2.916IleAla: 2.916 ± 0.021
1.179IleCys: 1.179 ± 0.013
2.184IleAsp: 2.184 ± 0.02
2.636IleGlu: 2.636 ± 0.024
1.909IlePhe: 1.909 ± 0.019
2.375IleGly: 2.375 ± 0.021
1.481IleHis: 1.481 ± 0.02
2.501IleIle: 2.501 ± 0.023
2.799IleLys: 2.799 ± 0.022
4.849IleLeu: 4.849 ± 0.027
1.045IleMet: 1.045 ± 0.011
1.996IleAsn: 1.996 ± 0.017
2.812IlePro: 2.812 ± 0.02
2.401IleGln: 2.401 ± 0.019
2.457IleArg: 2.457 ± 0.018
4.09IleSer: 4.09 ± 0.028
2.798IleThr: 2.798 ± 0.029
2.757IleVal: 2.757 ± 0.021
0.548IleTrp: 0.548 ± 0.008
1.466IleTyr: 1.466 ± 0.013
0.001IleXaa: 0.001 ± 0.0
Lys
4.065LysAla: 4.065 ± 0.027
1.291LysCys: 1.291 ± 0.019
3.218LysAsp: 3.218 ± 0.027
5.491LysGlu: 5.491 ± 0.046
1.688LysPhe: 1.688 ± 0.015
3.44LysGly: 3.44 ± 0.033
1.539LysHis: 1.539 ± 0.015
2.959LysIle: 2.959 ± 0.024
4.801LysLys: 4.801 ± 0.05
5.404LysLeu: 5.404 ± 0.03
1.526LysMet: 1.526 ± 0.016
2.515LysAsn: 2.515 ± 0.02
3.066LysPro: 3.066 ± 0.029
2.885LysGln: 2.885 ± 0.022
3.493LysArg: 3.493 ± 0.025
4.26LysSer: 4.26 ± 0.031
3.236LysThr: 3.236 ± 0.026
3.55LysVal: 3.55 ± 0.03
0.637LysTrp: 0.637 ± 0.011
1.671LysTyr: 1.671 ± 0.02
0.001LysXaa: 0.001 ± 0.0
Leu
6.466LeuAla: 6.466 ± 0.04
2.196LeuCys: 2.196 ± 0.018
4.714LeuAsp: 4.714 ± 0.025
6.854LeuGlu: 6.854 ± 0.044
3.448LeuPhe: 3.448 ± 0.027
5.307LeuGly: 5.307 ± 0.035
2.811LeuHis: 2.811 ± 0.024
4.168LeuIle: 4.168 ± 0.025
5.968LeuLys: 5.968 ± 0.034
10.118LeuLeu: 10.118 ± 0.065
2.06LeuMet: 2.06 ± 0.017
3.643LeuAsn: 3.643 ± 0.026
5.666LeuPro: 5.666 ± 0.042
5.763LeuGln: 5.763 ± 0.038
5.412LeuArg: 5.412 ± 0.032
7.945LeuSer: 7.945 ± 0.039
5.101LeuThr: 5.101 ± 0.029
5.436LeuVal: 5.436 ± 0.032
1.122LeuTrp: 1.122 ± 0.013
2.583LeuTyr: 2.583 ± 0.022
0.003LeuXaa: 0.003 ± 0.001
Met
1.855MetAla: 1.855 ± 0.016
0.426MetCys: 0.426 ± 0.008
1.337MetAsp: 1.337 ± 0.014
2.023MetGlu: 2.023 ± 0.017
0.766MetPhe: 0.766 ± 0.01
1.389MetGly: 1.389 ± 0.015
0.524MetHis: 0.524 ± 0.009
0.923MetIle: 0.923 ± 0.011
1.544MetLys: 1.544 ± 0.014
2.099MetLeu: 2.099 ± 0.019
0.616MetMet: 0.616 ± 0.01
0.931MetAsn: 0.931 ± 0.013
1.143MetPro: 1.143 ± 0.015
1.113MetGln: 1.113 ± 0.012
1.062MetArg: 1.062 ± 0.013
1.616MetSer: 1.616 ± 0.013
1.119MetThr: 1.119 ± 0.013
1.446MetVal: 1.446 ± 0.014
0.245MetTrp: 0.245 ± 0.005
0.663MetTyr: 0.663 ± 0.01
0.0MetXaa: 0.0 ± 0.0
Asn
2.299AsnAla: 2.299 ± 0.016
0.954AsnCys: 0.954 ± 0.013
1.586AsnAsp: 1.586 ± 0.015
2.414AsnGlu: 2.414 ± 0.017
1.502AsnPhe: 1.502 ± 0.015
2.65AsnGly: 2.65 ± 0.024
0.979AsnHis: 0.979 ± 0.012
2.275AsnIle: 2.275 ± 0.018
2.363AsnLys: 2.363 ± 0.019
3.863AsnLeu: 3.863 ± 0.027
0.972AsnMet: 0.972 ± 0.011
1.678AsnAsn: 1.678 ± 0.018
2.259AsnPro: 2.259 ± 0.021
1.76AsnGln: 1.76 ± 0.017
2.032AsnArg: 2.032 ± 0.016
3.372AsnSer: 3.372 ± 0.024
2.152AsnThr: 2.152 ± 0.018
2.36AsnVal: 2.36 ± 0.02
0.47AsnTrp: 0.47 ± 0.007
1.204AsnTyr: 1.204 ± 0.013
0.001AsnXaa: 0.001 ± 0.0
Pro
4.325ProAla: 4.325 ± 0.037
1.114ProCys: 1.114 ± 0.015
2.727ProAsp: 2.727 ± 0.025
4.065ProGlu: 4.065 ± 0.027
1.918ProPhe: 1.918 ± 0.018
4.057ProGly: 4.057 ± 0.052
1.385ProHis: 1.385 ± 0.018
2.04ProIle: 2.04 ± 0.021
2.766ProLys: 2.766 ± 0.032
4.967ProLeu: 4.967 ± 0.039
1.086ProMet: 1.086 ± 0.012
1.884ProAsn: 1.884 ± 0.02
4.742ProPro: 4.742 ± 0.052
2.544ProGln: 2.544 ± 0.024
2.688ProArg: 2.688 ± 0.023
5.126ProSer: 5.126 ± 0.032
2.881ProThr: 2.881 ± 0.021
3.785ProVal: 3.785 ± 0.027
0.627ProTrp: 0.627 ± 0.01
1.637ProTyr: 1.637 ± 0.021
0.002ProXaa: 0.002 ± 0.0
Gln
3.24GlnAla: 3.24 ± 0.023
1.126GlnCys: 1.126 ± 0.016
2.255GlnAsp: 2.255 ± 0.017
3.889GlnGlu: 3.889 ± 0.028
1.439GlnPhe: 1.439 ± 0.013
2.73GlnGly: 2.73 ± 0.02
1.384GlnHis: 1.384 ± 0.013
2.19GlnIle: 2.19 ± 0.018
3.074GlnLys: 3.074 ± 0.024
4.743GlnLeu: 4.743 ± 0.033
1.14GlnMet: 1.14 ± 0.013
2.033GlnAsn: 2.033 ± 0.017
2.52GlnPro: 2.52 ± 0.024
3.113GlnGln: 3.113 ± 0.034
3.014GlnArg: 3.014 ± 0.029
3.433GlnSer: 3.433 ± 0.027
2.508GlnThr: 2.508 ± 0.024
2.676GlnVal: 2.676 ± 0.019
0.563GlnTrp: 0.563 ± 0.008
1.248GlnTyr: 1.248 ± 0.014
0.001GlnXaa: 0.001 ± 0.0
Arg
3.445ArgAla: 3.445 ± 0.026
1.223ArgCys: 1.223 ± 0.016
2.757ArgAsp: 2.757 ± 0.021
3.863ArgGlu: 3.863 ± 0.029
1.877ArgPhe: 1.877 ± 0.018
3.184ArgGly: 3.184 ± 0.03
1.519ArgHis: 1.519 ± 0.017
2.632ArgIle: 2.632 ± 0.021
3.739ArgLys: 3.739 ± 0.029
4.78ArgLeu: 4.78 ± 0.03
1.205ArgMet: 1.205 ± 0.014
2.179ArgAsn: 2.179 ± 0.017
2.724ArgPro: 2.724 ± 0.024
2.599ArgGln: 2.599 ± 0.019
3.794ArgArg: 3.794 ± 0.034
4.305ArgSer: 4.305 ± 0.034
2.852ArgThr: 2.852 ± 0.022
2.999ArgVal: 2.999 ± 0.022
0.648ArgTrp: 0.648 ± 0.009
1.543ArgTyr: 1.543 ± 0.016
0.002ArgXaa: 0.002 ± 0.0
Ser
5.372SerAla: 5.372 ± 0.032
1.981SerCys: 1.981 ± 0.017
4.011SerAsp: 4.011 ± 0.031
5.35SerGlu: 5.35 ± 0.032
3.191SerPhe: 3.191 ± 0.024
5.295SerGly: 5.295 ± 0.028
2.178SerHis: 2.178 ± 0.015
3.534SerIle: 3.534 ± 0.024
4.314SerLys: 4.314 ± 0.031
7.986SerLeu: 7.986 ± 0.037
1.726SerMet: 1.726 ± 0.016
2.983SerAsn: 2.983 ± 0.025
5.512SerPro: 5.512 ± 0.045
4.041SerGln: 4.041 ± 0.029
4.236SerArg: 4.236 ± 0.028
9.442SerSer: 9.442 ± 0.062
4.841SerThr: 4.841 ± 0.031
5.092SerVal: 5.092 ± 0.026
1.01SerTrp: 1.01 ± 0.014
2.21SerTyr: 2.21 ± 0.019
0.002SerXaa: 0.002 ± 0.001
Thr
3.981ThrAla: 3.981 ± 0.025
1.451ThrCys: 1.451 ± 0.02
2.689ThrAsp: 2.689 ± 0.02
3.813ThrGlu: 3.813 ± 0.027
2.138ThrPhe: 2.138 ± 0.018
3.765ThrGly: 3.765 ± 0.032
1.378ThrHis: 1.378 ± 0.018
2.57ThrIle: 2.57 ± 0.021
2.766ThrLys: 2.766 ± 0.023
5.37ThrLeu: 5.37 ± 0.03
1.198ThrMet: 1.198 ± 0.012
1.901ThrAsn: 1.901 ± 0.017
3.378ThrPro: 3.378 ± 0.025
2.288ThrGln: 2.288 ± 0.02
2.406ThrArg: 2.406 ± 0.019
4.767ThrSer: 4.767 ± 0.04
3.072ThrThr: 3.072 ± 0.032
4.021ThrVal: 4.021 ± 0.037
0.664ThrTrp: 0.664 ± 0.011
1.476ThrTyr: 1.476 ± 0.014
0.002ThrXaa: 0.002 ± 0.0
Val
4.11ValAla: 4.11 ± 0.024
1.454ValCys: 1.454 ± 0.016
2.973ValAsp: 2.973 ± 0.023
3.879ValGlu: 3.879 ± 0.027
2.34ValPhe: 2.34 ± 0.018
3.246ValGly: 3.246 ± 0.023
1.542ValHis: 1.542 ± 0.015
2.961ValIle: 2.961 ± 0.022
3.662ValLys: 3.662 ± 0.03
6.152ValLeu: 6.152 ± 0.034
1.436ValMet: 1.436 ± 0.014
2.447ValAsn: 2.447 ± 0.022
3.512ValPro: 3.512 ± 0.029
2.805ValGln: 2.805 ± 0.02
2.898ValArg: 2.898 ± 0.019
4.957ValSer: 4.957 ± 0.033
3.912ValThr: 3.912 ± 0.035
3.943ValVal: 3.943 ± 0.027
0.756ValTrp: 0.756 ± 0.011
1.726ValTyr: 1.726 ± 0.016
0.001ValXaa: 0.001 ± 0.0
Trp
0.725TrpAla: 0.725 ± 0.01
0.248TrpCys: 0.248 ± 0.006
0.649TrpAsp: 0.649 ± 0.01
0.814TrpGlu: 0.814 ± 0.01
0.45TrpPhe: 0.45 ± 0.009
0.751TrpGly: 0.751 ± 0.011
0.314TrpHis: 0.314 ± 0.006
0.59TrpIle: 0.59 ± 0.009
0.802TrpLys: 0.802 ± 0.011
1.185TrpLeu: 1.185 ± 0.015
0.315TrpMet: 0.315 ± 0.006
0.554TrpAsn: 0.554 ± 0.009
0.484TrpPro: 0.484 ± 0.007
0.561TrpGln: 0.561 ± 0.009
0.677TrpArg: 0.677 ± 0.011
0.901TrpSer: 0.901 ± 0.012
0.664TrpThr: 0.664 ± 0.01
0.683TrpVal: 0.683 ± 0.009
0.185TrpTrp: 0.185 ± 0.005
0.365TrpTyr: 0.365 ± 0.008
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.52TyrAla: 1.52 ± 0.015
0.762TyrCys: 0.762 ± 0.01
1.375TyrAsp: 1.375 ± 0.016
1.726TyrGlu: 1.726 ± 0.017
1.25TyrPhe: 1.25 ± 0.012
1.721TyrGly: 1.721 ± 0.018
0.766TyrHis: 0.766 ± 0.01
1.501TyrIle: 1.501 ± 0.015
1.686TyrLys: 1.686 ± 0.021
2.655TyrLeu: 2.655 ± 0.02
0.646TyrMet: 0.646 ± 0.009
1.211TyrAsn: 1.211 ± 0.015
1.306TyrPro: 1.306 ± 0.014
1.324TyrGln: 1.324 ± 0.014
1.638TyrArg: 1.638 ± 0.017
2.308TyrSer: 2.308 ± 0.018
1.621TyrThr: 1.621 ± 0.018
1.625TyrVal: 1.625 ± 0.014
0.379TyrTrp: 0.379 ± 0.008
0.995TyrTyr: 0.995 ± 0.014
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.002XaaAla: 0.002 ± 0.0
0.001XaaCys: 0.001 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.003XaaGly: 0.003 ± 0.001
0.001XaaHis: 0.001 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.001XaaLys: 0.001 ± 0.0
0.002XaaLeu: 0.002 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.003XaaPro: 0.003 ± 0.001
0.001XaaGln: 0.001 ± 0.0
0.003XaaArg: 0.003 ± 0.001
0.002XaaSer: 0.002 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.32XaaXaa: 0.32 ± 0.035
Statistics based on 18960 proteins (9158208 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski