Amino acid dipepetide frequency for Grapevine rupestris stem pitting-associated virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.115AlaAla: 2.115 ± 0.7
1.057AlaCys: 1.057 ± 0.578
1.41AlaAsp: 1.41 ± 0.771
2.82AlaGlu: 2.82 ± 2.117
4.935AlaPhe: 4.935 ± 2.379
4.935AlaGly: 4.935 ± 0.951
1.762AlaHis: 1.762 ± 2.31
4.23AlaIle: 4.23 ± 1.399
5.992AlaLys: 5.992 ± 2.028
10.575AlaLeu: 10.575 ± 2.169
0.705AlaMet: 0.705 ± 0.763
2.82AlaAsn: 2.82 ± 1.691
2.82AlaPro: 2.82 ± 1.309
1.057AlaGln: 1.057 ± 0.684
3.172AlaArg: 3.172 ± 2.052
3.877AlaSer: 3.877 ± 1.415
2.82AlaThr: 2.82 ± 1.246
3.525AlaVal: 3.525 ± 2.798
0.0AlaTrp: 0.0 ± 0.0
1.057AlaTyr: 1.057 ± 0.578
0.0AlaXaa: 0.0 ± 0.0
Cys
2.115CysAla: 2.115 ± 0.857
0.0CysCys: 0.0 ± 0.0
1.057CysAsp: 1.057 ± 0.684
1.41CysGlu: 1.41 ± 0.793
2.82CysPhe: 2.82 ± 1.246
3.172CysGly: 3.172 ± 1.318
1.41CysHis: 1.41 ± 0.805
1.057CysIle: 1.057 ± 0.578
1.057CysLys: 1.057 ± 0.83
2.82CysLeu: 2.82 ± 1.327
1.057CysMet: 1.057 ± 1.262
1.057CysAsn: 1.057 ± 0.848
0.0CysPro: 0.0 ± 0.0
1.057CysGln: 1.057 ± 0.578
2.115CysArg: 2.115 ± 1.027
2.82CysSer: 2.82 ± 0.974
1.057CysThr: 1.057 ± 0.578
0.705CysVal: 0.705 ± 0.386
0.0CysTrp: 0.0 ± 0.0
0.352CysTyr: 0.352 ± 0.193
0.0CysXaa: 0.0 ± 0.0
Asp
1.762AspAla: 1.762 ± 0.964
1.41AspCys: 1.41 ± 0.771
3.172AspAsp: 3.172 ± 1.735
4.935AspGlu: 4.935 ± 1.852
3.525AspPhe: 3.525 ± 0.625
2.115AspGly: 2.115 ± 1.027
0.352AspHis: 0.352 ± 0.193
3.172AspIle: 3.172 ± 1.159
1.057AspLys: 1.057 ± 0.83
5.992AspLeu: 5.992 ± 1.755
1.057AspMet: 1.057 ± 0.702
1.41AspAsn: 1.41 ± 0.805
1.762AspPro: 1.762 ± 1.728
0.352AspGln: 0.352 ± 0.193
2.82AspArg: 2.82 ± 1.066
3.877AspSer: 3.877 ± 1.346
0.352AspThr: 0.352 ± 0.193
3.877AspVal: 3.877 ± 1.531
0.352AspTrp: 0.352 ± 0.193
2.115AspTyr: 2.115 ± 1.15
0.0AspXaa: 0.0 ± 0.0
Glu
6.345GluAla: 6.345 ± 1.075
1.057GluCys: 1.057 ± 0.578
2.82GluAsp: 2.82 ± 1.587
6.345GluGlu: 6.345 ± 2.902
2.115GluPhe: 2.115 ± 1.66
4.582GluGly: 4.582 ± 1.616
1.762GluHis: 1.762 ± 0.964
2.467GluIle: 2.467 ± 1.711
3.525GluLys: 3.525 ± 0.625
5.287GluLeu: 5.287 ± 1.031
3.877GluMet: 3.877 ± 1.335
4.23GluAsn: 4.23 ± 1.876
1.762GluPro: 1.762 ± 0.68
1.057GluGln: 1.057 ± 0.578
1.762GluArg: 1.762 ± 0.964
6.697GluSer: 6.697 ± 2.492
1.41GluThr: 1.41 ± 1.526
8.107GluVal: 8.107 ± 1.393
1.057GluTrp: 1.057 ± 0.578
1.057GluTyr: 1.057 ± 0.83
0.0GluXaa: 0.0 ± 0.0
Phe
3.172PheAla: 3.172 ± 1.159
1.762PheCys: 1.762 ± 1.437
3.525PheAsp: 3.525 ± 1.007
5.64PheGlu: 5.64 ± 1.138
2.82PhePhe: 2.82 ± 1.006
3.877PheGly: 3.877 ± 3.223
2.115PheHis: 2.115 ± 1.157
3.172PheIle: 3.172 ± 2.274
4.23PheLys: 4.23 ± 1.713
9.165PheLeu: 9.165 ± 3.511
1.057PheMet: 1.057 ± 0.578
3.525PheAsn: 3.525 ± 0.625
2.115PhePro: 2.115 ± 2.033
1.057PheGln: 1.057 ± 0.848
1.762PheArg: 1.762 ± 0.964
7.755PheSer: 7.755 ± 2.172
3.172PheThr: 3.172 ± 1.584
4.582PheVal: 4.582 ± 1.201
0.352PheTrp: 0.352 ± 0.193
1.057PheTyr: 1.057 ± 0.83
0.0PheXaa: 0.0 ± 0.0
Gly
2.82GlyAla: 2.82 ± 1.309
1.762GlyCys: 1.762 ± 1.401
2.82GlyAsp: 2.82 ± 1.052
4.582GlyGlu: 4.582 ± 2.0
3.172GlyPhe: 3.172 ± 1.375
4.23GlyGly: 4.23 ± 1.74
0.705GlyHis: 0.705 ± 0.386
7.402GlyIle: 7.402 ± 4.626
5.287GlyLys: 5.287 ± 2.512
7.402GlyLeu: 7.402 ± 3.232
1.057GlyMet: 1.057 ± 0.578
3.172GlyAsn: 3.172 ± 1.447
2.467GlyPro: 2.467 ± 1.346
1.762GlyGln: 1.762 ± 0.807
4.582GlyArg: 4.582 ± 1.483
7.402GlySer: 7.402 ± 1.657
2.115GlyThr: 2.115 ± 1.082
3.525GlyVal: 3.525 ± 3.966
1.057GlyTrp: 1.057 ± 0.578
0.705GlyTyr: 0.705 ± 0.386
0.0GlyXaa: 0.0 ± 0.0
His
0.705HisAla: 0.705 ± 0.386
2.115HisCys: 2.115 ± 1.14
1.41HisAsp: 1.41 ± 0.771
1.762HisGlu: 1.762 ± 1.437
2.467HisPhe: 2.467 ± 0.942
1.057HisGly: 1.057 ± 0.848
1.057HisHis: 1.057 ± 0.578
2.115HisIle: 2.115 ± 1.157
2.115HisLys: 2.115 ± 1.157
1.41HisLeu: 1.41 ± 0.654
0.0HisMet: 0.0 ± 0.0
2.115HisAsn: 2.115 ± 0.857
1.057HisPro: 1.057 ± 0.848
0.705HisGln: 0.705 ± 1.816
0.705HisArg: 0.705 ± 0.907
3.172HisSer: 3.172 ± 1.203
0.705HisThr: 0.705 ± 0.386
1.41HisVal: 1.41 ± 1.575
0.352HisTrp: 0.352 ± 1.015
0.705HisTyr: 0.705 ± 0.386
0.0HisXaa: 0.0 ± 0.0
Ile
4.23IleAla: 4.23 ± 4.224
1.762IleCys: 1.762 ± 0.803
2.467IleAsp: 2.467 ± 1.711
5.287IleGlu: 5.287 ± 1.709
2.82IlePhe: 2.82 ± 1.052
4.23IleGly: 4.23 ± 2.164
2.82IleHis: 2.82 ± 2.188
1.762IleIle: 1.762 ± 1.681
4.935IleLys: 4.935 ± 1.212
5.64IleLeu: 5.64 ± 3.076
2.115IleMet: 2.115 ± 0.857
3.525IleAsn: 3.525 ± 1.323
2.467IlePro: 2.467 ± 0.595
0.352IleGln: 0.352 ± 1.234
3.877IleArg: 3.877 ± 1.372
7.05IleSer: 7.05 ± 1.25
3.525IleThr: 3.525 ± 1.613
1.762IleVal: 1.762 ± 1.663
0.352IleTrp: 0.352 ± 0.193
1.41IleTyr: 1.41 ± 0.771
0.0IleXaa: 0.0 ± 0.0
Lys
5.64LysAla: 5.64 ± 1.11
0.352LysCys: 0.352 ± 0.193
2.82LysAsp: 2.82 ± 1.543
3.172LysGlu: 3.172 ± 1.914
5.992LysPhe: 5.992 ± 1.755
3.172LysGly: 3.172 ± 1.321
1.057LysHis: 1.057 ± 0.684
3.172LysIle: 3.172 ± 1.159
3.877LysLys: 3.877 ± 1.494
5.287LysLeu: 5.287 ± 1.268
1.057LysMet: 1.057 ± 0.578
1.41LysAsn: 1.41 ± 0.771
4.23LysPro: 4.23 ± 1.683
1.762LysGln: 1.762 ± 1.181
2.82LysArg: 2.82 ± 3.053
5.992LysSer: 5.992 ± 2.829
2.467LysThr: 2.467 ± 1.246
5.287LysVal: 5.287 ± 2.408
0.0LysTrp: 0.0 ± 0.0
0.352LysTyr: 0.352 ± 1.015
0.0LysXaa: 0.0 ± 0.0
Leu
7.755LeuAla: 7.755 ± 1.797
2.467LeuCys: 2.467 ± 1.35
4.935LeuAsp: 4.935 ± 2.383
6.697LeuGlu: 6.697 ± 2.37
5.992LeuPhe: 5.992 ± 2.083
8.107LeuGly: 8.107 ± 2.53
1.41LeuHis: 1.41 ± 0.805
8.107LeuIle: 8.107 ± 3.124
7.402LeuLys: 7.402 ± 0.941
6.345LeuLeu: 6.345 ± 2.758
1.41LeuMet: 1.41 ± 0.805
3.877LeuAsn: 3.877 ± 1.494
3.877LeuPro: 3.877 ± 0.652
1.762LeuGln: 1.762 ± 0.964
6.697LeuArg: 6.697 ± 1.91
9.87LeuSer: 9.87 ± 1.377
5.287LeuThr: 5.287 ± 1.491
7.755LeuVal: 7.755 ± 2.461
0.0LeuTrp: 0.0 ± 0.0
1.41LeuTyr: 1.41 ± 0.771
0.0LeuXaa: 0.0 ± 0.0
Met
2.115MetAla: 2.115 ± 1.368
0.352MetCys: 0.352 ± 0.193
1.41MetAsp: 1.41 ± 0.654
0.705MetGlu: 0.705 ± 0.386
0.705MetPhe: 0.705 ± 0.386
1.057MetGly: 1.057 ± 0.578
0.705MetHis: 0.705 ± 0.386
1.41MetIle: 1.41 ± 0.771
1.41MetLys: 1.41 ± 0.771
1.41MetLeu: 1.41 ± 0.805
1.057MetMet: 1.057 ± 0.578
1.057MetAsn: 1.057 ± 0.83
1.762MetPro: 1.762 ± 0.807
1.41MetGln: 1.41 ± 0.793
2.115MetArg: 2.115 ± 0.756
0.0MetSer: 0.0 ± 0.0
1.057MetThr: 1.057 ± 0.578
0.352MetVal: 0.352 ± 0.878
0.352MetTrp: 0.352 ± 0.193
1.41MetTyr: 1.41 ± 1.326
0.0MetXaa: 0.0 ± 0.0
Asn
2.115AsnAla: 2.115 ± 1.368
2.82AsnCys: 2.82 ± 2.114
0.705AsnAsp: 0.705 ± 0.386
4.23AsnGlu: 4.23 ± 1.963
2.82AsnPhe: 2.82 ± 0.891
2.82AsnGly: 2.82 ± 1.543
2.467AsnHis: 2.467 ± 1.227
3.172AsnIle: 3.172 ± 2.453
1.057AsnLys: 1.057 ± 0.578
5.992AsnLeu: 5.992 ± 0.961
1.762AsnMet: 1.762 ± 0.964
1.41AsnAsn: 1.41 ± 0.993
2.467AsnPro: 2.467 ± 1.011
0.352AsnGln: 0.352 ± 0.193
1.762AsnArg: 1.762 ± 0.68
4.23AsnSer: 4.23 ± 1.079
1.41AsnThr: 1.41 ± 0.793
2.115AsnVal: 2.115 ± 1.521
1.41AsnTrp: 1.41 ± 1.3
1.41AsnTyr: 1.41 ± 0.771
0.0AsnXaa: 0.0 ± 0.0
Pro
1.762ProAla: 1.762 ± 1.181
2.115ProCys: 2.115 ± 1.027
2.467ProAsp: 2.467 ± 0.947
2.82ProGlu: 2.82 ± 0.54
2.467ProPhe: 2.467 ± 2.071
3.525ProGly: 3.525 ± 2.355
0.705ProHis: 0.705 ± 0.93
1.057ProIle: 1.057 ± 0.578
1.762ProLys: 1.762 ± 1.601
3.525ProLeu: 3.525 ± 1.613
0.705ProMet: 0.705 ± 0.386
2.82ProAsn: 2.82 ± 1.077
1.41ProPro: 1.41 ± 1.526
1.762ProGln: 1.762 ± 0.964
2.467ProArg: 2.467 ± 1.35
2.467ProSer: 2.467 ± 1.346
2.115ProThr: 2.115 ± 1.521
1.057ProVal: 1.057 ± 0.83
1.057ProTrp: 1.057 ± 0.578
2.115ProTyr: 2.115 ± 2.002
0.0ProXaa: 0.0 ± 0.0
Gln
1.41GlnAla: 1.41 ± 0.654
0.705GlnCys: 0.705 ± 0.386
1.057GlnAsp: 1.057 ± 0.83
0.352GlnGlu: 0.352 ± 0.193
2.115GlnPhe: 2.115 ± 1.082
1.41GlnGly: 1.41 ± 1.086
1.057GlnHis: 1.057 ± 0.578
2.467GlnIle: 2.467 ± 1.088
1.057GlnLys: 1.057 ± 0.684
3.172GlnLeu: 3.172 ± 1.735
0.705GlnMet: 0.705 ± 1.12
0.352GlnAsn: 0.352 ± 1.015
1.41GlnPro: 1.41 ± 1.3
1.41GlnGln: 1.41 ± 1.86
1.057GlnArg: 1.057 ± 0.684
3.525GlnSer: 3.525 ± 1.613
0.352GlnThr: 0.352 ± 0.193
1.762GlnVal: 1.762 ± 0.68
0.352GlnTrp: 0.352 ± 0.193
0.352GlnTyr: 0.352 ± 0.193
0.0GlnXaa: 0.0 ± 0.0
Arg
4.23ArgAla: 4.23 ± 2.874
1.057ArgCys: 1.057 ± 0.578
2.115ArgAsp: 2.115 ± 1.027
4.23ArgGlu: 4.23 ± 1.4
3.525ArgPhe: 3.525 ± 0.625
2.467ArgGly: 2.467 ± 0.947
2.115ArgHis: 2.115 ± 1.742
1.057ArgIle: 1.057 ± 0.684
2.115ArgLys: 2.115 ± 1.082
5.64ArgLeu: 5.64 ± 2.398
0.352ArgMet: 0.352 ± 0.193
2.467ArgAsn: 2.467 ± 0.938
1.762ArgPro: 1.762 ± 0.964
2.115ArgGln: 2.115 ± 1.082
3.525ArgArg: 3.525 ± 1.323
6.345ArgSer: 6.345 ± 2.268
0.705ArgThr: 0.705 ± 0.386
3.172ArgVal: 3.172 ± 1.735
0.705ArgTrp: 0.705 ± 0.386
2.82ArgTyr: 2.82 ± 0.891
0.0ArgXaa: 0.0 ± 0.0
Ser
3.877SerAla: 3.877 ± 0.743
4.23SerCys: 4.23 ± 1.659
4.935SerAsp: 4.935 ± 1.212
4.23SerGlu: 4.23 ± 2.033
8.46SerPhe: 8.46 ± 2.23
6.697SerGly: 6.697 ± 3.999
2.467SerHis: 2.467 ± 1.35
6.697SerIle: 6.697 ± 1.742
7.755SerLys: 7.755 ± 2.689
5.287SerLeu: 5.287 ± 1.393
1.762SerMet: 1.762 ± 0.964
4.935SerAsn: 4.935 ± 2.932
3.172SerPro: 3.172 ± 2.841
3.877SerGln: 3.877 ± 1.425
4.582SerArg: 4.582 ± 1.535
8.812SerSer: 8.812 ± 2.338
4.935SerThr: 4.935 ± 0.917
4.935SerVal: 4.935 ± 2.196
0.705SerTrp: 0.705 ± 0.386
2.467SerTyr: 2.467 ± 1.35
0.0SerXaa: 0.0 ± 0.0
Thr
3.172ThrAla: 3.172 ± 0.551
0.705ThrCys: 0.705 ± 2.03
1.762ThrAsp: 1.762 ± 0.68
1.41ThrGlu: 1.41 ± 0.771
4.582ThrPhe: 4.582 ± 1.855
5.64ThrGly: 5.64 ± 1.11
1.41ThrHis: 1.41 ± 0.654
1.762ThrIle: 1.762 ± 0.807
1.762ThrLys: 1.762 ± 1.181
3.525ThrLeu: 3.525 ± 2.453
0.352ThrMet: 0.352 ± 0.193
0.705ThrAsn: 0.705 ± 0.386
2.115ThrPro: 2.115 ± 1.027
0.352ThrGln: 0.352 ± 0.193
1.057ThrArg: 1.057 ± 0.83
2.467ThrSer: 2.467 ± 1.35
1.057ThrThr: 1.057 ± 0.684
2.115ThrVal: 2.115 ± 2.531
0.0ThrTrp: 0.0 ± 0.0
1.762ThrTyr: 1.762 ± 0.807
0.0ThrXaa: 0.0 ± 0.0
Val
3.525ValAla: 3.525 ± 1.007
0.352ValCys: 0.352 ± 0.193
3.525ValAsp: 3.525 ± 0.976
3.877ValGlu: 3.877 ± 1.346
1.762ValPhe: 1.762 ± 0.803
3.877ValGly: 3.877 ± 2.202
0.705ValHis: 0.705 ± 0.386
6.345ValIle: 6.345 ± 1.182
1.762ValLys: 1.762 ± 1.401
8.107ValLeu: 8.107 ± 3.333
1.057ValMet: 1.057 ± 0.684
2.82ValAsn: 2.82 ± 1.587
2.467ValPro: 2.467 ± 2.963
2.82ValGln: 2.82 ± 1.231
3.877ValArg: 3.877 ± 1.515
5.992ValSer: 5.992 ± 2.083
3.172ValThr: 3.172 ± 2.054
3.172ValVal: 3.172 ± 1.375
0.705ValTrp: 0.705 ± 1.584
0.705ValTyr: 0.705 ± 0.386
0.0ValXaa: 0.0 ± 0.0
Trp
1.057TrpAla: 1.057 ± 1.436
0.352TrpCys: 0.352 ± 0.193
0.352TrpAsp: 0.352 ± 0.193
0.352TrpGlu: 0.352 ± 0.193
1.41TrpPhe: 1.41 ± 0.793
0.0TrpGly: 0.0 ± 0.0
0.352TrpHis: 0.352 ± 0.193
0.0TrpIle: 0.0 ± 0.0
0.352TrpLys: 0.352 ± 0.193
1.762TrpLeu: 1.762 ± 0.964
0.0TrpMet: 0.0 ± 0.0
1.057TrpAsn: 1.057 ± 1.436
0.705TrpPro: 0.705 ± 0.386
0.352TrpGln: 0.352 ± 0.193
0.0TrpArg: 0.0 ± 0.0
0.352TrpSer: 0.352 ± 0.193
0.0TrpThr: 0.0 ± 0.0
1.057TrpVal: 1.057 ± 0.578
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.762TyrAla: 1.762 ± 0.68
0.705TyrCys: 0.705 ± 1.816
0.705TyrAsp: 0.705 ± 0.386
2.467TyrGlu: 2.467 ± 1.35
1.41TyrPhe: 1.41 ± 0.771
1.057TyrGly: 1.057 ± 0.578
0.705TyrHis: 0.705 ± 1.155
1.762TyrIle: 1.762 ± 1.728
1.41TyrLys: 1.41 ± 0.771
2.82TyrLeu: 2.82 ± 1.587
0.352TyrMet: 0.352 ± 0.193
1.762TyrAsn: 1.762 ± 0.964
0.352TyrPro: 0.352 ± 0.193
1.057TyrGln: 1.057 ± 0.83
1.762TyrArg: 1.762 ± 0.807
2.115TyrSer: 2.115 ± 1.027
0.0TyrThr: 0.0 ± 0.0
0.352TyrVal: 0.352 ± 0.193
0.705TyrTrp: 0.705 ± 0.386
0.705TyrTyr: 0.705 ± 0.763
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2838 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski