Amino acid dipepetide frequency for Alternaria arborescens victorivirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.08AlaAla: 18.08 ± 8.264
1.247AlaCys: 1.247 ± 0.027
3.741AlaAsp: 3.741 ± 0.91
4.364AlaGlu: 4.364 ± 1.338
1.247AlaPhe: 1.247 ± 0.027
12.469AlaGly: 12.469 ± 3.586
5.611AlaHis: 5.611 ± 2.193
3.741AlaIle: 3.741 ± 1.738
4.364AlaLys: 4.364 ± 1.146
11.845AlaLeu: 11.845 ± 0.674
1.247AlaMet: 1.247 ± 0.027
6.234AlaAsn: 6.234 ± 1.793
8.728AlaPro: 8.728 ± 1.848
1.87AlaGln: 1.87 ± 1.283
8.105AlaArg: 8.105 ± 1.064
10.599AlaSer: 10.599 ± 0.647
4.988AlaThr: 4.988 ± 2.375
5.611AlaVal: 5.611 ± 0.291
2.494AlaTrp: 2.494 ± 0.055
1.87AlaTyr: 1.87 ± 0.455
0.0AlaXaa: 0.0 ± 0.0
Cys
0.623CysAla: 0.623 ± 0.4
0.0CysCys: 0.0 ± 0.0
1.247CysAsp: 1.247 ± 0.027
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.623CysLeu: 0.623 ± 0.4
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.623CysPro: 0.623 ± 0.4
0.0CysGln: 0.0 ± 0.0
0.623CysArg: 0.623 ± 0.4
0.623CysSer: 0.623 ± 0.4
0.623CysThr: 0.623 ± 0.428
1.87CysVal: 1.87 ± 0.373
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.234AspAla: 6.234 ± 0.137
0.0AspCys: 0.0 ± 0.0
6.234AspAsp: 6.234 ± 1.519
3.117AspGlu: 3.117 ± 1.311
1.247AspPhe: 1.247 ± 0.027
2.494AspGly: 2.494 ± 0.883
1.247AspHis: 1.247 ± 0.027
1.87AspIle: 1.87 ± 0.373
0.623AspLys: 0.623 ± 0.4
4.364AspLeu: 4.364 ± 0.51
0.623AspMet: 0.623 ± 0.4
1.87AspAsn: 1.87 ± 0.373
2.494AspPro: 2.494 ± 0.883
1.247AspGln: 1.247 ± 0.027
4.364AspArg: 4.364 ± 0.318
2.494AspSer: 2.494 ± 0.055
4.364AspThr: 4.364 ± 0.51
6.234AspVal: 6.234 ± 0.137
1.247AspTrp: 1.247 ± 0.027
2.494AspTyr: 2.494 ± 0.773
0.0AspXaa: 0.0 ± 0.0
Glu
8.105GluAla: 8.105 ± 1.42
0.0GluCys: 0.0 ± 0.0
1.87GluAsp: 1.87 ± 1.201
3.117GluGlu: 3.117 ± 1.174
2.494GluPhe: 2.494 ± 0.773
3.741GluGly: 3.741 ± 2.566
3.117GluHis: 3.117 ± 0.346
0.0GluIle: 0.0 ± 0.0
0.623GluLys: 0.623 ± 0.4
6.234GluLeu: 6.234 ± 1.519
0.623GluMet: 0.623 ± 0.4
0.0GluAsn: 0.0 ± 0.0
1.247GluPro: 1.247 ± 0.855
1.247GluGln: 1.247 ± 0.027
3.117GluArg: 3.117 ± 0.482
2.494GluSer: 2.494 ± 1.602
2.494GluThr: 2.494 ± 0.773
2.494GluVal: 2.494 ± 0.055
1.247GluTrp: 1.247 ± 0.801
2.494GluTyr: 2.494 ± 1.602
0.0GluXaa: 0.0 ± 0.0
Phe
2.494PheAla: 2.494 ± 0.883
0.623PheCys: 0.623 ± 0.428
1.87PheAsp: 1.87 ± 0.455
1.87PheGlu: 1.87 ± 0.455
0.623PhePhe: 0.623 ± 0.4
2.494PheGly: 2.494 ± 0.883
0.0PheHis: 0.0 ± 0.0
1.247PheIle: 1.247 ± 0.855
1.247PheLys: 1.247 ± 0.027
3.741PheLeu: 3.741 ± 0.746
0.0PheMet: 0.0 ± 0.0
1.247PheAsn: 1.247 ± 0.801
0.623PhePro: 0.623 ± 0.4
0.623PheGln: 0.623 ± 0.4
0.623PheArg: 0.623 ± 0.428
3.117PheSer: 3.117 ± 1.311
1.247PheThr: 1.247 ± 0.027
0.0PheVal: 0.0 ± 0.0
0.623PheTrp: 0.623 ± 0.428
0.623PheTyr: 0.623 ± 0.428
0.0PheXaa: 0.0 ± 0.0
Gly
9.975GlyAla: 9.975 ± 4.36
0.623GlyCys: 0.623 ± 0.4
7.481GlyAsp: 7.481 ± 1.82
5.611GlyGlu: 5.611 ± 0.537
2.494GlyPhe: 2.494 ± 0.883
9.975GlyGly: 9.975 ± 4.36
3.117GlyHis: 3.117 ± 0.346
3.117GlyIle: 3.117 ± 1.311
4.364GlyLys: 4.364 ± 2.803
6.234GlyLeu: 6.234 ± 0.137
1.247GlyMet: 1.247 ± 0.027
1.87GlyAsn: 1.87 ± 0.373
3.117GlyPro: 3.117 ± 1.311
1.247GlyGln: 1.247 ± 0.027
6.858GlyArg: 6.858 ± 2.221
3.117GlySer: 3.117 ± 1.311
6.234GlyThr: 6.234 ± 1.793
8.105GlyVal: 8.105 ± 0.592
0.623GlyTrp: 0.623 ± 0.4
2.494GlyTyr: 2.494 ± 0.773
0.0GlyXaa: 0.0 ± 0.0
His
2.494HisAla: 2.494 ± 0.883
0.623HisCys: 0.623 ± 0.428
0.623HisAsp: 0.623 ± 0.428
0.623HisGlu: 0.623 ± 0.428
0.623HisPhe: 0.623 ± 0.428
3.741HisGly: 3.741 ± 0.91
1.87HisHis: 1.87 ± 0.455
1.87HisIle: 1.87 ± 0.373
0.623HisLys: 0.623 ± 0.428
2.494HisLeu: 2.494 ± 0.773
0.623HisMet: 0.623 ± 0.428
0.623HisAsn: 0.623 ± 0.428
0.623HisPro: 0.623 ± 0.4
1.87HisGln: 1.87 ± 0.455
2.494HisArg: 2.494 ± 0.773
1.87HisSer: 1.87 ± 0.455
1.87HisThr: 1.87 ± 0.373
1.87HisVal: 1.87 ± 0.373
0.623HisTrp: 0.623 ± 0.4
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.741IleAla: 3.741 ± 0.91
0.0IleCys: 0.0 ± 0.0
3.117IleAsp: 3.117 ± 0.346
1.247IleGlu: 1.247 ± 0.027
3.117IlePhe: 3.117 ± 1.311
2.494IleGly: 2.494 ± 0.883
0.623IleHis: 0.623 ± 0.4
1.247IleIle: 1.247 ± 0.027
0.623IleLys: 0.623 ± 0.4
1.247IleLeu: 1.247 ± 0.027
0.623IleMet: 0.623 ± 0.4
3.117IleAsn: 3.117 ± 0.482
3.741IlePro: 3.741 ± 1.738
1.247IleGln: 1.247 ± 0.855
1.87IleArg: 1.87 ± 1.201
1.247IleSer: 1.247 ± 0.027
2.494IleThr: 2.494 ± 0.055
1.87IleVal: 1.87 ± 0.455
0.0IleTrp: 0.0 ± 0.0
1.87IleTyr: 1.87 ± 1.201
0.0IleXaa: 0.0 ± 0.0
Lys
1.247LysAla: 1.247 ± 0.801
0.0LysCys: 0.0 ± 0.0
1.87LysAsp: 1.87 ± 1.201
0.623LysGlu: 0.623 ± 0.4
0.0LysPhe: 0.0 ± 0.0
1.87LysGly: 1.87 ± 0.373
1.247LysHis: 1.247 ± 0.027
0.623LysIle: 0.623 ± 0.4
1.247LysLys: 1.247 ± 0.801
3.741LysLeu: 3.741 ± 1.574
0.623LysMet: 0.623 ± 0.428
2.494LysAsn: 2.494 ± 1.602
1.247LysPro: 1.247 ± 0.801
0.623LysGln: 0.623 ± 0.4
1.247LysArg: 1.247 ± 0.801
1.247LysSer: 1.247 ± 0.801
2.494LysThr: 2.494 ± 0.055
1.87LysVal: 1.87 ± 1.201
1.247LysTrp: 1.247 ± 0.801
1.247LysTyr: 1.247 ± 0.027
0.0LysXaa: 0.0 ± 0.0
Leu
11.845LeuAla: 11.845 ± 0.674
1.247LeuCys: 1.247 ± 0.801
3.741LeuAsp: 3.741 ± 0.082
6.234LeuGlu: 6.234 ± 3.176
1.247LeuPhe: 1.247 ± 0.801
9.352LeuGly: 9.352 ± 1.037
1.87LeuHis: 1.87 ± 0.373
3.117LeuIle: 3.117 ± 0.346
2.494LeuLys: 2.494 ± 0.773
8.105LeuLeu: 8.105 ± 4.377
2.494LeuMet: 2.494 ± 1.602
3.741LeuAsn: 3.741 ± 0.746
2.494LeuPro: 2.494 ± 0.055
0.0LeuGln: 0.0 ± 0.0
9.975LeuArg: 9.975 ± 0.219
6.858LeuSer: 6.858 ± 1.92
6.234LeuThr: 6.234 ± 1.519
5.611LeuVal: 5.611 ± 1.947
1.247LeuTrp: 1.247 ± 0.027
2.494LeuTyr: 2.494 ± 0.055
0.0LeuXaa: 0.0 ± 0.0
Met
3.117MetAla: 3.117 ± 1.174
0.0MetCys: 0.0 ± 0.0
2.494MetAsp: 2.494 ± 0.055
0.623MetGlu: 0.623 ± 0.428
0.0MetPhe: 0.0 ± 0.0
0.623MetGly: 0.623 ± 0.4
0.0MetHis: 0.0 ± 0.0
0.623MetIle: 0.623 ± 0.4
0.623MetLys: 0.623 ± 0.428
1.87MetLeu: 1.87 ± 1.201
0.623MetMet: 0.623 ± 0.428
1.247MetAsn: 1.247 ± 0.027
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
3.117MetArg: 3.117 ± 0.346
2.494MetSer: 2.494 ± 0.773
0.0MetThr: 0.0 ± 0.0
1.247MetVal: 1.247 ± 0.027
0.0MetTrp: 0.0 ± 0.0
0.623MetTyr: 0.623 ± 0.4
0.0MetXaa: 0.0 ± 0.0
Asn
2.494AsnAla: 2.494 ± 0.055
0.623AsnCys: 0.623 ± 0.4
1.87AsnAsp: 1.87 ± 0.455
0.623AsnGlu: 0.623 ± 0.428
1.87AsnPhe: 1.87 ± 1.283
3.117AsnGly: 3.117 ± 1.174
0.623AsnHis: 0.623 ± 0.428
0.623AsnIle: 0.623 ± 0.428
1.247AsnLys: 1.247 ± 0.801
2.494AsnLeu: 2.494 ± 0.773
0.623AsnMet: 0.623 ± 0.428
0.623AsnAsn: 0.623 ± 0.428
3.117AsnPro: 3.117 ± 1.174
1.87AsnGln: 1.87 ± 1.283
2.494AsnArg: 2.494 ± 1.602
4.364AsnSer: 4.364 ± 0.318
1.87AsnThr: 1.87 ± 0.455
3.117AsnVal: 3.117 ± 0.482
1.247AsnTrp: 1.247 ± 0.801
1.247AsnTyr: 1.247 ± 0.027
0.0AsnXaa: 0.0 ± 0.0
Pro
7.481ProAla: 7.481 ± 3.477
0.0ProCys: 0.0 ± 0.0
1.247ProAsp: 1.247 ± 0.027
4.364ProGlu: 4.364 ± 1.146
0.623ProPhe: 0.623 ± 0.4
3.117ProGly: 3.117 ± 0.482
1.87ProHis: 1.87 ± 0.455
1.87ProIle: 1.87 ± 0.455
0.623ProLys: 0.623 ± 0.4
7.481ProLeu: 7.481 ± 0.664
0.0ProMet: 0.0 ± 0.0
0.0ProAsn: 0.0 ± 0.0
8.105ProPro: 8.105 ± 4.733
3.117ProGln: 3.117 ± 0.482
1.87ProArg: 1.87 ± 0.373
2.494ProSer: 2.494 ± 0.773
7.481ProThr: 7.481 ± 1.82
5.611ProVal: 5.611 ± 0.537
0.0ProTrp: 0.0 ± 0.0
3.741ProTyr: 3.741 ± 0.082
0.0ProXaa: 0.0 ± 0.0
Gln
3.117GlnAla: 3.117 ± 0.346
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
0.0GlnGlu: 0.0 ± 0.0
1.247GlnPhe: 1.247 ± 0.027
1.87GlnGly: 1.87 ± 1.283
0.623GlnHis: 0.623 ± 0.4
0.623GlnIle: 0.623 ± 0.4
0.623GlnLys: 0.623 ± 0.4
0.623GlnLeu: 0.623 ± 0.4
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
2.494GlnPro: 2.494 ± 0.883
0.0GlnGln: 0.0 ± 0.0
0.0GlnArg: 0.0 ± 0.0
3.741GlnSer: 3.741 ± 0.082
0.623GlnThr: 0.623 ± 0.428
3.741GlnVal: 3.741 ± 0.91
1.247GlnTrp: 1.247 ± 0.855
1.247GlnTyr: 1.247 ± 0.855
0.0GlnXaa: 0.0 ± 0.0
Arg
9.975ArgAla: 9.975 ± 0.609
0.0ArgCys: 0.0 ± 0.0
3.117ArgAsp: 3.117 ± 1.174
2.494ArgGlu: 2.494 ± 0.883
2.494ArgPhe: 2.494 ± 1.711
6.234ArgGly: 6.234 ± 2.621
2.494ArgHis: 2.494 ± 0.883
4.988ArgIle: 4.988 ± 0.719
1.247ArgLys: 1.247 ± 0.801
4.364ArgLeu: 4.364 ± 0.318
3.117ArgMet: 3.117 ± 2.002
1.87ArgAsn: 1.87 ± 0.373
1.247ArgPro: 1.247 ± 0.801
1.87ArgGln: 1.87 ± 0.373
4.988ArgArg: 4.988 ± 0.938
5.611ArgSer: 5.611 ± 1.119
4.364ArgThr: 4.364 ± 0.318
4.988ArgVal: 4.988 ± 0.109
1.247ArgTrp: 1.247 ± 0.801
4.988ArgTyr: 4.988 ± 0.719
0.0ArgXaa: 0.0 ± 0.0
Ser
6.234SerAla: 6.234 ± 0.137
0.0SerCys: 0.0 ± 0.0
1.247SerAsp: 1.247 ± 0.855
2.494SerGlu: 2.494 ± 1.602
1.247SerPhe: 1.247 ± 0.801
9.352SerGly: 9.352 ± 0.209
0.623SerHis: 0.623 ± 0.4
3.117SerIle: 3.117 ± 0.346
1.247SerLys: 1.247 ± 0.801
6.234SerLeu: 6.234 ± 0.691
1.247SerMet: 1.247 ± 0.801
3.741SerAsn: 3.741 ± 0.91
6.858SerPro: 6.858 ± 1.393
0.623SerGln: 0.623 ± 0.428
4.364SerArg: 4.364 ± 1.975
3.117SerSer: 3.117 ± 0.346
2.494SerThr: 2.494 ± 0.883
8.105SerVal: 8.105 ± 0.236
0.623SerTrp: 0.623 ± 0.4
6.858SerTyr: 6.858 ± 1.92
0.0SerXaa: 0.0 ± 0.0
Thr
6.234ThrAla: 6.234 ± 2.621
0.623ThrCys: 0.623 ± 0.4
6.234ThrAsp: 6.234 ± 0.137
3.741ThrGlu: 3.741 ± 2.402
1.247ThrPhe: 1.247 ± 0.027
2.494ThrGly: 2.494 ± 0.883
0.623ThrHis: 0.623 ± 0.4
2.494ThrIle: 2.494 ± 0.883
0.623ThrLys: 0.623 ± 0.4
2.494ThrLeu: 2.494 ± 0.055
0.623ThrMet: 0.623 ± 0.428
3.117ThrAsn: 3.117 ± 1.311
4.988ThrPro: 4.988 ± 1.547
1.87ThrGln: 1.87 ± 0.455
4.364ThrArg: 4.364 ± 1.338
4.988ThrSer: 4.988 ± 0.719
4.988ThrThr: 4.988 ± 0.109
9.352ThrVal: 9.352 ± 1.447
1.247ThrTrp: 1.247 ± 0.855
2.494ThrTyr: 2.494 ± 1.602
0.0ThrXaa: 0.0 ± 0.0
Val
8.728ValAla: 8.728 ± 1.02
0.623ValCys: 0.623 ± 0.4
4.364ValAsp: 4.364 ± 1.338
3.741ValGlu: 3.741 ± 1.574
2.494ValPhe: 2.494 ± 0.883
6.858ValGly: 6.858 ± 0.264
1.247ValHis: 1.247 ± 0.855
3.117ValIle: 3.117 ± 0.482
0.623ValLys: 0.623 ± 0.428
7.481ValLeu: 7.481 ± 3.148
2.494ValMet: 2.494 ± 0.359
3.117ValAsn: 3.117 ± 1.174
5.611ValPro: 5.611 ± 1.365
1.247ValGln: 1.247 ± 0.801
6.234ValArg: 6.234 ± 0.137
8.105ValSer: 8.105 ± 1.064
8.105ValThr: 8.105 ± 2.248
4.988ValVal: 4.988 ± 0.719
0.0ValTrp: 0.0 ± 0.0
1.87ValTyr: 1.87 ± 0.373
0.0ValXaa: 0.0 ± 0.0
Trp
2.494TrpAla: 2.494 ± 1.602
0.0TrpCys: 0.0 ± 0.0
1.247TrpAsp: 1.247 ± 0.027
0.623TrpGlu: 0.623 ± 0.428
0.0TrpPhe: 0.0 ± 0.0
1.247TrpGly: 1.247 ± 0.801
0.623TrpHis: 0.623 ± 0.428
0.623TrpIle: 0.623 ± 0.428
0.623TrpLys: 0.623 ± 0.4
1.247TrpLeu: 1.247 ± 0.801
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.623TrpGln: 0.623 ± 0.4
2.494TrpArg: 2.494 ± 0.773
0.623TrpSer: 0.623 ± 0.428
0.0TrpThr: 0.0 ± 0.0
2.494TrpVal: 2.494 ± 0.883
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.741TyrAla: 3.741 ± 0.082
0.623TyrCys: 0.623 ± 0.4
1.247TyrAsp: 1.247 ± 0.801
1.247TyrGlu: 1.247 ± 0.027
0.623TyrPhe: 0.623 ± 0.428
4.364TyrGly: 4.364 ± 0.51
0.623TyrHis: 0.623 ± 0.4
0.623TyrIle: 0.623 ± 0.4
3.117TyrLys: 3.117 ± 2.002
7.481TyrLeu: 7.481 ± 2.32
1.87TyrMet: 1.87 ± 0.455
1.247TyrAsn: 1.247 ± 0.801
3.741TyrPro: 3.741 ± 0.746
0.623TyrGln: 0.623 ± 0.428
2.494TyrArg: 2.494 ± 0.883
0.623TyrSer: 0.623 ± 0.4
1.87TyrThr: 1.87 ± 0.373
2.494TyrVal: 2.494 ± 1.602
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1605 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski