Amino acid dipepetide frequency for Scyliorhinus torazame (Cloudy catshark)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.436AlaAla: 6.436 ± 0.085
1.206AlaCys: 1.206 ± 0.014
3.114AlaAsp: 3.114 ± 0.023
4.958AlaGlu: 4.958 ± 0.057
2.272AlaPhe: 2.272 ± 0.018
4.104AlaGly: 4.104 ± 0.037
1.37AlaHis: 1.37 ± 0.019
3.06AlaIle: 3.06 ± 0.022
3.527AlaLys: 3.527 ± 0.026
6.06AlaLeu: 6.06 ± 0.044
1.57AlaMet: 1.57 ± 0.025
2.377AlaAsn: 2.377 ± 0.019
2.821AlaPro: 2.821 ± 0.028
2.805AlaGln: 2.805 ± 0.029
3.144AlaArg: 3.144 ± 0.032
4.909AlaSer: 4.909 ± 0.034
3.486AlaThr: 3.486 ± 0.023
4.899AlaVal: 4.899 ± 0.043
0.653AlaTrp: 0.653 ± 0.012
1.423AlaTyr: 1.423 ± 0.017
0.0AlaXaa: 0.0 ± 0.0
Cys
1.18CysAla: 1.18 ± 0.015
0.611CysCys: 0.611 ± 0.011
1.21CysAsp: 1.21 ± 0.026
1.474CysGlu: 1.474 ± 0.029
0.827CysPhe: 0.827 ± 0.011
1.45CysGly: 1.45 ± 0.021
0.594CysHis: 0.594 ± 0.01
1.092CysIle: 1.092 ± 0.013
1.241CysLys: 1.241 ± 0.014
2.01CysLeu: 2.01 ± 0.02
0.441CysMet: 0.441 ± 0.007
0.962CysAsn: 0.962 ± 0.013
1.139CysPro: 1.139 ± 0.017
1.023CysGln: 1.023 ± 0.014
1.254CysArg: 1.254 ± 0.016
1.939CysSer: 1.939 ± 0.019
1.232CysThr: 1.232 ± 0.016
1.419CysVal: 1.419 ± 0.021
0.279CysTrp: 0.279 ± 0.006
0.609CysTyr: 0.609 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
3.035AspAla: 3.035 ± 0.03
1.214AspCys: 1.214 ± 0.017
3.248AspAsp: 3.248 ± 0.042
3.909AspGlu: 3.909 ± 0.036
2.029AspPhe: 2.029 ± 0.017
3.647AspGly: 3.647 ± 0.044
1.25AspHis: 1.25 ± 0.018
3.123AspIle: 3.123 ± 0.031
2.772AspLys: 2.772 ± 0.024
5.04AspLeu: 5.04 ± 0.034
1.222AspMet: 1.222 ± 0.016
2.165AspAsn: 2.165 ± 0.023
2.783AspPro: 2.783 ± 0.035
2.101AspGln: 2.101 ± 0.022
2.957AspArg: 2.957 ± 0.043
4.47AspSer: 4.47 ± 0.043
3.118AspThr: 3.118 ± 0.052
3.61AspVal: 3.61 ± 0.049
0.699AspTrp: 0.699 ± 0.01
1.657AspTyr: 1.657 ± 0.018
0.0AspXaa: 0.0 ± 0.0
Glu
4.531GluAla: 4.531 ± 0.036
1.413GluCys: 1.413 ± 0.02
4.495GluAsp: 4.495 ± 0.038
7.532GluGlu: 7.532 ± 0.075
2.172GluPhe: 2.172 ± 0.023
4.004GluGly: 4.004 ± 0.04
1.714GluHis: 1.714 ± 0.021
3.774GluIle: 3.774 ± 0.034
5.263GluLys: 5.263 ± 0.044
6.482GluLeu: 6.482 ± 0.052
2.005GluMet: 2.005 ± 0.022
3.464GluAsn: 3.464 ± 0.024
2.628GluPro: 2.628 ± 0.026
3.562GluGln: 3.562 ± 0.033
5.106GluArg: 5.106 ± 0.067
4.917GluSer: 4.917 ± 0.04
4.079GluThr: 4.079 ± 0.042
4.246GluVal: 4.246 ± 0.035
0.754GluTrp: 0.754 ± 0.011
1.869GluTyr: 1.869 ± 0.026
0.0GluXaa: 0.0 ± 0.0
Phe
2.02PheAla: 2.02 ± 0.017
0.872PheCys: 0.872 ± 0.011
1.886PheAsp: 1.886 ± 0.02
2.105PheGlu: 2.105 ± 0.02
1.396PhePhe: 1.396 ± 0.015
2.18PheGly: 2.18 ± 0.019
0.957PheHis: 0.957 ± 0.013
1.951PheIle: 1.951 ± 0.017
1.919PheLys: 1.919 ± 0.015
3.627PheLeu: 3.627 ± 0.031
0.761PheMet: 0.761 ± 0.014
1.557PheAsn: 1.557 ± 0.016
1.742PhePro: 1.742 ± 0.015
1.7PheGln: 1.7 ± 0.015
1.797PheArg: 1.797 ± 0.019
2.98PheSer: 2.98 ± 0.021
2.159PheThr: 2.159 ± 0.023
2.238PheVal: 2.238 ± 0.021
0.448PheTrp: 0.448 ± 0.007
1.146PheTyr: 1.146 ± 0.012
0.0PheXaa: 0.0 ± 0.0
Gly
3.7GlyAla: 3.7 ± 0.033
1.178GlyCys: 1.178 ± 0.014
3.284GlyAsp: 3.284 ± 0.044
4.136GlyGlu: 4.136 ± 0.041
2.283GlyPhe: 2.283 ± 0.022
3.991GlyGly: 3.991 ± 0.039
1.478GlyHis: 1.478 ± 0.018
3.245GlyIle: 3.245 ± 0.03
3.832GlyLys: 3.832 ± 0.025
5.169GlyLeu: 5.169 ± 0.045
1.442GlyMet: 1.442 ± 0.016
2.832GlyAsn: 2.832 ± 0.026
2.705GlyPro: 2.705 ± 0.049
2.655GlyGln: 2.655 ± 0.027
3.348GlyArg: 3.348 ± 0.028
5.124GlySer: 5.124 ± 0.031
3.567GlyThr: 3.567 ± 0.035
3.689GlyVal: 3.689 ± 0.043
0.717GlyTrp: 0.717 ± 0.011
1.804GlyTyr: 1.804 ± 0.021
0.0GlyXaa: 0.0 ± 0.0
His
1.303HisAla: 1.303 ± 0.021
0.758HisCys: 0.758 ± 0.012
0.996HisAsp: 0.996 ± 0.015
1.31HisGlu: 1.31 ± 0.016
1.042HisPhe: 1.042 ± 0.012
1.435HisGly: 1.435 ± 0.016
0.833HisHis: 0.833 ± 0.013
1.329HisIle: 1.329 ± 0.014
1.319HisLys: 1.319 ± 0.012
2.602HisLeu: 2.602 ± 0.024
0.568HisMet: 0.568 ± 0.011
1.007HisAsn: 1.007 ± 0.012
1.371HisPro: 1.371 ± 0.016
1.278HisGln: 1.278 ± 0.013
1.482HisArg: 1.482 ± 0.017
2.205HisSer: 2.205 ± 0.021
1.312HisThr: 1.312 ± 0.016
1.489HisVal: 1.489 ± 0.016
0.346HisTrp: 0.346 ± 0.006
0.834HisTyr: 0.834 ± 0.012
0.0HisXaa: 0.0 ± 0.0
Ile
3.057IleAla: 3.057 ± 0.03
1.219IleCys: 1.219 ± 0.013
2.71IleAsp: 2.71 ± 0.037
3.197IleGlu: 3.197 ± 0.032
1.988IlePhe: 1.988 ± 0.018
2.75IleGly: 2.75 ± 0.023
1.337IleHis: 1.337 ± 0.016
2.837IleIle: 2.837 ± 0.023
2.957IleLys: 2.957 ± 0.021
4.921IleLeu: 4.921 ± 0.032
1.128IleMet: 1.128 ± 0.014
2.335IleAsn: 2.335 ± 0.022
2.866IlePro: 2.866 ± 0.024
2.557IleGln: 2.557 ± 0.022
2.601IleArg: 2.601 ± 0.028
4.197IleSer: 4.197 ± 0.033
3.069IleThr: 3.069 ± 0.03
3.08IleVal: 3.08 ± 0.027
0.578IleTrp: 0.578 ± 0.01
1.498IleTyr: 1.498 ± 0.016
0.0IleXaa: 0.0 ± 0.0
Lys
3.749LysAla: 3.749 ± 0.023
1.214LysCys: 1.214 ± 0.015
3.378LysAsp: 3.378 ± 0.028
5.398LysGlu: 5.398 ± 0.043
1.759LysPhe: 1.759 ± 0.015
3.309LysGly: 3.309 ± 0.028
1.522LysHis: 1.522 ± 0.017
3.109LysIle: 3.109 ± 0.024
4.679LysLys: 4.679 ± 0.037
5.641LysLeu: 5.641 ± 0.033
1.653LysMet: 1.653 ± 0.015
2.64LysAsn: 2.64 ± 0.02
2.809LysPro: 2.809 ± 0.024
2.936LysGln: 2.936 ± 0.025
3.558LysArg: 3.558 ± 0.023
4.175LysSer: 4.175 ± 0.03
3.302LysThr: 3.302 ± 0.023
3.727LysVal: 3.727 ± 0.027
0.648LysTrp: 0.648 ± 0.009
1.714LysTyr: 1.714 ± 0.018
0.0LysXaa: 0.0 ± 0.0
Leu
5.836LeuAla: 5.836 ± 0.04
2.095LeuCys: 2.095 ± 0.027
4.987LeuAsp: 4.987 ± 0.044
6.631LeuGlu: 6.631 ± 0.042
3.255LeuPhe: 3.255 ± 0.025
5.051LeuGly: 5.051 ± 0.039
2.619LeuHis: 2.619 ± 0.022
4.365LeuIle: 4.365 ± 0.042
6.048LeuLys: 6.048 ± 0.034
9.627LeuLeu: 9.627 ± 0.078
2.153LeuMet: 2.153 ± 0.021
4.103LeuAsn: 4.103 ± 0.03
4.801LeuPro: 4.801 ± 0.031
5.456LeuGln: 5.456 ± 0.041
5.07LeuArg: 5.07 ± 0.038
7.422LeuSer: 7.422 ± 0.041
5.273LeuThr: 5.273 ± 0.046
5.33LeuVal: 5.33 ± 0.044
0.989LeuTrp: 0.989 ± 0.014
2.648LeuTyr: 2.648 ± 0.031
0.0LeuXaa: 0.0 ± 0.0
Met
1.767MetAla: 1.767 ± 0.016
0.486MetCys: 0.486 ± 0.01
1.493MetAsp: 1.493 ± 0.022
2.05MetGlu: 2.05 ± 0.019
0.832MetPhe: 0.832 ± 0.01
1.346MetGly: 1.346 ± 0.021
0.52MetHis: 0.52 ± 0.009
1.043MetIle: 1.043 ± 0.012
1.653MetLys: 1.653 ± 0.013
2.095MetLeu: 2.095 ± 0.02
0.691MetMet: 0.691 ± 0.011
1.049MetAsn: 1.049 ± 0.013
1.052MetPro: 1.052 ± 0.016
1.061MetGln: 1.061 ± 0.011
1.145MetArg: 1.145 ± 0.017
1.728MetSer: 1.728 ± 0.018
1.251MetThr: 1.251 ± 0.018
1.548MetVal: 1.548 ± 0.021
0.257MetTrp: 0.257 ± 0.005
0.681MetTyr: 0.681 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
2.515AsnAla: 2.515 ± 0.024
1.034AsnCys: 1.034 ± 0.014
2.048AsnAsp: 2.048 ± 0.021
2.864AsnGlu: 2.864 ± 0.027
1.538AsnPhe: 1.538 ± 0.014
2.953AsnGly: 2.953 ± 0.028
1.015AsnHis: 1.015 ± 0.012
2.589AsnIle: 2.589 ± 0.016
2.624AsnLys: 2.624 ± 0.022
4.155AsnLeu: 4.155 ± 0.028
1.085AsnMet: 1.085 ± 0.013
2.016AsnAsn: 2.016 ± 0.023
2.315AsnPro: 2.315 ± 0.025
1.926AsnGln: 1.926 ± 0.018
2.154AsnArg: 2.154 ± 0.016
3.55AsnSer: 3.55 ± 0.023
2.485AsnThr: 2.485 ± 0.025
2.795AsnVal: 2.795 ± 0.025
0.508AsnTrp: 0.508 ± 0.008
1.305AsnTyr: 1.305 ± 0.012
0.0AsnXaa: 0.0 ± 0.0
Pro
3.456ProAla: 3.456 ± 0.03
0.954ProCys: 0.954 ± 0.012
2.739ProAsp: 2.739 ± 0.031
3.769ProGlu: 3.769 ± 0.035
1.696ProPhe: 1.696 ± 0.013
3.707ProGly: 3.707 ± 0.063
1.191ProHis: 1.191 ± 0.015
2.094ProIle: 2.094 ± 0.02
2.523ProLys: 2.523 ± 0.023
4.265ProLeu: 4.265 ± 0.034
0.951ProMet: 0.951 ± 0.011
1.982ProAsn: 1.982 ± 0.021
3.678ProPro: 3.678 ± 0.045
2.16ProGln: 2.16 ± 0.02
2.457ProArg: 2.457 ± 0.019
4.394ProSer: 4.394 ± 0.035
2.686ProThr: 2.686 ± 0.025
3.558ProVal: 3.558 ± 0.031
0.501ProTrp: 0.501 ± 0.008
1.257ProTyr: 1.257 ± 0.015
0.0ProXaa: 0.0 ± 0.0
Gln
2.943GlnAla: 2.943 ± 0.026
1.056GlnCys: 1.056 ± 0.023
2.422GlnAsp: 2.422 ± 0.035
3.605GlnGlu: 3.605 ± 0.031
1.471GlnPhe: 1.471 ± 0.016
2.493GlnGly: 2.493 ± 0.026
1.315GlnHis: 1.315 ± 0.016
2.348GlnIle: 2.348 ± 0.019
2.962GlnLys: 2.962 ± 0.023
4.637GlnLeu: 4.637 ± 0.034
1.198GlnMet: 1.198 ± 0.012
2.178GlnAsn: 2.178 ± 0.017
2.227GlnPro: 2.227 ± 0.021
3.239GlnGln: 3.239 ± 0.037
3.046GlnArg: 3.046 ± 0.034
3.53GlnSer: 3.53 ± 0.028
2.586GlnThr: 2.586 ± 0.022
2.647GlnVal: 2.647 ± 0.021
0.542GlnTrp: 0.542 ± 0.009
1.363GlnTyr: 1.363 ± 0.018
0.0GlnXaa: 0.0 ± 0.0
Arg
3.561ArgAla: 3.561 ± 0.045
1.13ArgCys: 1.13 ± 0.015
3.014ArgAsp: 3.014 ± 0.035
4.469ArgGlu: 4.469 ± 0.057
1.86ArgPhe: 1.86 ± 0.017
3.03ArgGly: 3.03 ± 0.033
1.436ArgHis: 1.436 ± 0.018
2.689ArgIle: 2.689 ± 0.027
3.846ArgLys: 3.846 ± 0.024
4.881ArgLeu: 4.881 ± 0.036
1.322ArgMet: 1.322 ± 0.014
2.504ArgAsn: 2.504 ± 0.021
2.419ArgPro: 2.419 ± 0.024
2.651ArgGln: 2.651 ± 0.027
3.744ArgArg: 3.744 ± 0.035
4.171ArgSer: 4.171 ± 0.041
2.813ArgThr: 2.813 ± 0.029
3.066ArgVal: 3.066 ± 0.031
0.632ArgTrp: 0.632 ± 0.009
1.536ArgTyr: 1.536 ± 0.016
0.0ArgXaa: 0.0 ± 0.0
Ser
5.041SerAla: 5.041 ± 0.034
1.79SerCys: 1.79 ± 0.022
4.427SerAsp: 4.427 ± 0.037
5.589SerGlu: 5.589 ± 0.05
2.828SerPhe: 2.828 ± 0.022
5.211SerGly: 5.211 ± 0.04
1.891SerHis: 1.891 ± 0.017
3.704SerIle: 3.704 ± 0.022
4.434SerLys: 4.434 ± 0.027
7.411SerLeu: 7.411 ± 0.042
1.729SerMet: 1.729 ± 0.017
3.401SerAsn: 3.401 ± 0.026
4.657SerPro: 4.657 ± 0.044
3.585SerGln: 3.585 ± 0.025
4.104SerArg: 4.104 ± 0.03
8.132SerSer: 8.132 ± 0.062
4.653SerThr: 4.653 ± 0.037
5.17SerVal: 5.17 ± 0.034
0.897SerTrp: 0.897 ± 0.011
2.002SerTyr: 2.002 ± 0.019
0.0SerXaa: 0.0 ± 0.0
Thr
3.972ThrAla: 3.972 ± 0.033
1.29ThrCys: 1.29 ± 0.021
3.166ThrAsp: 3.166 ± 0.033
4.419ThrGlu: 4.419 ± 0.039
2.129ThrPhe: 2.129 ± 0.018
3.838ThrGly: 3.838 ± 0.041
1.198ThrHis: 1.198 ± 0.015
2.91ThrIle: 2.91 ± 0.025
2.955ThrLys: 2.955 ± 0.026
5.305ThrLeu: 5.305 ± 0.039
1.3ThrMet: 1.3 ± 0.016
2.2ThrAsn: 2.2 ± 0.018
3.048ThrPro: 3.048 ± 0.03
2.34ThrGln: 2.34 ± 0.036
2.447ThrArg: 2.447 ± 0.023
4.617ThrSer: 4.617 ± 0.031
3.185ThrThr: 3.185 ± 0.032
4.41ThrVal: 4.41 ± 0.046
0.658ThrTrp: 0.658 ± 0.013
1.417ThrTyr: 1.417 ± 0.014
0.0ThrXaa: 0.0 ± 0.0
Val
4.023ValAla: 4.023 ± 0.035
1.489ValCys: 1.489 ± 0.02
3.352ValAsp: 3.352 ± 0.038
4.202ValGlu: 4.202 ± 0.035
2.378ValPhe: 2.378 ± 0.029
3.484ValGly: 3.484 ± 0.04
1.543ValHis: 1.543 ± 0.017
3.38ValIle: 3.38 ± 0.038
3.963ValLys: 3.963 ± 0.031
6.013ValLeu: 6.013 ± 0.049
1.556ValMet: 1.556 ± 0.024
2.799ValAsn: 2.799 ± 0.026
3.22ValPro: 3.22 ± 0.03
3.007ValGln: 3.007 ± 0.028
3.047ValArg: 3.047 ± 0.024
4.985ValSer: 4.985 ± 0.043
4.191ValThr: 4.191 ± 0.042
4.048ValVal: 4.048 ± 0.038
0.709ValTrp: 0.709 ± 0.011
1.776ValTyr: 1.776 ± 0.024
0.0ValXaa: 0.0 ± 0.0
Trp
0.656TrpAla: 0.656 ± 0.011
0.245TrpCys: 0.245 ± 0.005
0.654TrpAsp: 0.654 ± 0.014
0.751TrpGlu: 0.751 ± 0.01
0.441TrpPhe: 0.441 ± 0.008
0.58TrpGly: 0.58 ± 0.01
0.274TrpHis: 0.274 ± 0.005
0.663TrpIle: 0.663 ± 0.011
0.843TrpLys: 0.843 ± 0.01
1.095TrpLeu: 1.095 ± 0.014
0.32TrpMet: 0.32 ± 0.007
0.61TrpAsn: 0.61 ± 0.008
0.394TrpPro: 0.394 ± 0.007
0.491TrpGln: 0.491 ± 0.007
0.676TrpArg: 0.676 ± 0.011
0.871TrpSer: 0.871 ± 0.014
0.663TrpThr: 0.663 ± 0.011
0.625TrpVal: 0.625 ± 0.009
0.179TrpTrp: 0.179 ± 0.005
0.35TrpTyr: 0.35 ± 0.007
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.415TyrAla: 1.415 ± 0.024
0.733TyrCys: 0.733 ± 0.01
1.424TyrAsp: 1.424 ± 0.025
1.643TyrGlu: 1.643 ± 0.02
1.246TyrPhe: 1.246 ± 0.013
1.616TyrGly: 1.616 ± 0.02
0.763TyrHis: 0.763 ± 0.012
1.575TyrIle: 1.575 ± 0.018
1.561TyrLys: 1.561 ± 0.015
2.727TyrLeu: 2.727 ± 0.025
0.653TyrMet: 0.653 ± 0.01
1.292TyrAsn: 1.292 ± 0.014
1.343TyrPro: 1.343 ± 0.024
1.274TyrGln: 1.274 ± 0.013
1.634TyrArg: 1.634 ± 0.018
2.342TyrSer: 2.342 ± 0.023
1.693TyrThr: 1.693 ± 0.024
1.57TyrVal: 1.57 ± 0.018
0.385TyrTrp: 0.385 ± 0.008
1.047TyrTyr: 1.047 ± 0.015
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 27602 proteins (10337261 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski