Amino acid dipepetide frequency for Grapevine Pinot gris virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.654AlaAla: 1.654 ± 0.56
1.241AlaCys: 1.241 ± 0.595
2.481AlaAsp: 2.481 ± 0.811
4.549AlaGlu: 4.549 ± 2.209
6.203AlaPhe: 6.203 ± 1.62
4.136AlaGly: 4.136 ± 1.148
0.827AlaHis: 0.827 ± 0.397
2.895AlaIle: 2.895 ± 1.378
4.963AlaLys: 4.963 ± 0.745
2.895AlaLeu: 2.895 ± 1.516
0.0AlaMet: 0.0 ± 0.0
2.895AlaAsn: 2.895 ± 1.143
1.654AlaPro: 1.654 ± 0.928
1.241AlaGln: 1.241 ± 0.599
1.241AlaArg: 1.241 ± 0.595
3.309AlaSer: 3.309 ± 0.97
1.654AlaThr: 1.654 ± 1.121
3.309AlaVal: 3.309 ± 0.281
0.0AlaTrp: 0.0 ± 0.0
0.827AlaTyr: 0.827 ± 0.397
0.0AlaXaa: 0.0 ± 0.0
Cys
1.241CysAla: 1.241 ± 1.119
0.0CysCys: 0.0 ± 0.0
1.654CysAsp: 1.654 ± 0.56
1.241CysGlu: 1.241 ± 0.595
1.241CysPhe: 1.241 ± 0.595
1.241CysGly: 1.241 ± 0.595
0.827CysHis: 0.827 ± 0.397
1.654CysIle: 1.654 ± 0.793
1.654CysLys: 1.654 ± 0.793
1.241CysLeu: 1.241 ± 0.406
0.0CysMet: 0.0 ± 0.0
2.068CysAsn: 2.068 ± 0.991
1.241CysPro: 1.241 ± 0.406
0.0CysGln: 0.0 ± 0.0
1.241CysArg: 1.241 ± 0.595
1.654CysSer: 1.654 ± 0.306
0.827CysThr: 0.827 ± 0.397
0.414CysVal: 0.414 ± 0.198
0.0CysTrp: 0.0 ± 0.0
1.241CysTyr: 1.241 ± 0.599
0.0CysXaa: 0.0 ± 0.0
Asp
1.241AspAla: 1.241 ± 0.599
2.068AspCys: 2.068 ± 0.991
3.722AspAsp: 3.722 ± 1.784
4.963AspGlu: 4.963 ± 2.379
4.136AspPhe: 4.136 ± 0.4
2.895AspGly: 2.895 ± 0.594
1.241AspHis: 1.241 ± 0.599
2.481AspIle: 2.481 ± 0.559
3.722AspLys: 3.722 ± 1.124
7.031AspLeu: 7.031 ± 0.332
0.827AspMet: 0.827 ± 0.397
2.481AspAsn: 2.481 ± 0.811
4.136AspPro: 4.136 ± 1.091
0.827AspGln: 0.827 ± 0.56
2.481AspArg: 2.481 ± 1.681
4.136AspSer: 4.136 ± 2.802
0.827AspThr: 0.827 ± 0.56
6.203AspVal: 6.203 ± 1.363
2.068AspTrp: 2.068 ± 0.991
3.309AspTyr: 3.309 ± 0.772
0.0AspXaa: 0.0 ± 0.0
Glu
1.654GluAla: 1.654 ± 0.56
1.654GluCys: 1.654 ± 0.793
4.963GluAsp: 4.963 ± 1.535
3.309GluGlu: 3.309 ± 1.586
4.963GluPhe: 4.963 ± 0.866
4.549GluGly: 4.549 ± 1.78
0.827GluHis: 0.827 ± 0.397
5.79GluIle: 5.79 ± 1.187
5.376GluLys: 5.376 ± 1.854
4.549GluLeu: 4.549 ± 0.88
2.895GluMet: 2.895 ± 0.594
4.136GluAsn: 4.136 ± 1.091
3.722GluPro: 3.722 ± 0.958
2.481GluGln: 2.481 ± 0.433
3.722GluArg: 3.722 ± 2.076
9.512GluSer: 9.512 ± 1.701
0.827GluThr: 0.827 ± 0.695
5.376GluVal: 5.376 ± 1.473
0.0GluTrp: 0.0 ± 0.0
0.827GluTyr: 0.827 ± 0.695
0.0GluXaa: 0.0 ± 0.0
Phe
3.722PheAla: 3.722 ± 1.131
2.481PheCys: 2.481 ± 1.19
4.549PheAsp: 4.549 ± 0.564
5.376PheGlu: 5.376 ± 1.02
5.376PhePhe: 5.376 ± 2.577
3.309PheGly: 3.309 ± 1.36
2.481PheHis: 2.481 ± 0.559
2.895PheIle: 2.895 ± 0.594
5.79PheLys: 5.79 ± 1.187
7.031PheLeu: 7.031 ± 2.515
1.654PheMet: 1.654 ± 0.793
3.722PheAsn: 3.722 ± 2.053
1.654PhePro: 1.654 ± 0.793
1.241PheGln: 1.241 ± 0.595
2.895PheArg: 2.895 ± 1.168
8.685PheSer: 8.685 ± 1.608
2.481PheThr: 2.481 ± 0.433
2.068PheVal: 2.068 ± 0.958
0.827PheTrp: 0.827 ± 0.397
3.309PheTyr: 3.309 ± 0.957
0.0PheXaa: 0.0 ± 0.0
Gly
2.481GlyAla: 2.481 ± 1.681
1.654GlyCys: 1.654 ± 0.306
2.895GlyAsp: 2.895 ± 0.691
2.895GlyGlu: 2.895 ± 1.378
3.309GlyPhe: 3.309 ± 1.586
0.827GlyGly: 0.827 ± 0.397
1.654GlyHis: 1.654 ± 0.793
3.722GlyIle: 3.722 ± 0.592
3.309GlyLys: 3.309 ± 0.772
5.79GlyLeu: 5.79 ± 1.772
0.414GlyMet: 0.414 ± 0.198
2.068GlyAsn: 2.068 ± 0.74
1.241GlyPro: 1.241 ± 0.406
0.414GlyGln: 0.414 ± 0.198
3.722GlyArg: 3.722 ± 1.217
7.031GlySer: 7.031 ± 1.188
3.722GlyThr: 3.722 ± 0.772
5.376GlyVal: 5.376 ± 1.02
2.068GlyTrp: 2.068 ± 0.991
2.068GlyTyr: 2.068 ± 0.991
0.0GlyXaa: 0.0 ± 0.0
His
0.414HisAla: 0.414 ± 0.198
0.827HisCys: 0.827 ± 0.397
0.827HisAsp: 0.827 ± 0.397
1.654HisGlu: 1.654 ± 0.793
3.309HisPhe: 3.309 ± 0.612
2.068HisGly: 2.068 ± 0.588
1.241HisHis: 1.241 ± 0.595
1.654HisIle: 1.654 ± 0.56
0.414HisLys: 0.414 ± 0.198
2.068HisLeu: 2.068 ± 0.991
0.0HisMet: 0.0 ± 0.0
0.414HisAsn: 0.414 ± 0.198
0.827HisPro: 0.827 ± 0.695
0.827HisGln: 0.827 ± 0.397
2.068HisArg: 2.068 ± 0.318
1.654HisSer: 1.654 ± 0.793
0.827HisThr: 0.827 ± 1.658
0.414HisVal: 0.414 ± 0.829
0.414HisTrp: 0.414 ± 0.198
1.241HisTyr: 1.241 ± 0.595
0.0HisXaa: 0.0 ± 0.0
Ile
3.309IleAla: 3.309 ± 2.662
0.414IleCys: 0.414 ± 0.198
4.136IleAsp: 4.136 ± 1.482
4.549IleGlu: 4.549 ± 2.181
2.481IlePhe: 2.481 ± 0.676
1.654IleGly: 1.654 ± 1.121
1.654IleHis: 1.654 ± 0.306
1.654IleIle: 1.654 ± 0.793
3.722IleLys: 3.722 ± 1.124
6.203IleLeu: 6.203 ± 2.122
0.827IleMet: 0.827 ± 0.397
2.895IleAsn: 2.895 ± 0.691
2.895IlePro: 2.895 ± 0.594
2.068IleGln: 2.068 ± 0.588
4.549IleArg: 4.549 ± 1.252
5.376IleSer: 5.376 ± 0.887
2.481IleThr: 2.481 ± 0.559
1.241IleVal: 1.241 ± 2.209
0.414IleTrp: 0.414 ± 0.198
2.481IleTyr: 2.481 ± 1.541
0.0IleXaa: 0.0 ± 0.0
Lys
3.722LysAla: 3.722 ± 1.124
0.827LysCys: 0.827 ± 0.695
3.309LysAsp: 3.309 ± 0.281
4.136LysGlu: 4.136 ± 1.48
4.963LysPhe: 4.963 ± 1.666
6.203LysGly: 6.203 ± 2.028
2.068LysHis: 2.068 ± 0.588
3.722LysIle: 3.722 ± 0.283
5.376LysLys: 5.376 ± 1.854
8.685LysLeu: 8.685 ± 1.629
2.895LysMet: 2.895 ± 0.594
2.895LysAsn: 2.895 ± 1.388
2.068LysPro: 2.068 ± 0.991
0.827LysGln: 0.827 ± 1.313
4.963LysArg: 4.963 ± 0.866
7.444LysSer: 7.444 ± 2.758
2.895LysThr: 2.895 ± 0.804
5.376LysVal: 5.376 ± 1.02
0.827LysTrp: 0.827 ± 0.397
2.068LysTyr: 2.068 ± 0.991
0.0LysXaa: 0.0 ± 0.0
Leu
5.79LeuAla: 5.79 ± 2.044
2.068LeuCys: 2.068 ± 0.588
8.271LeuAsp: 8.271 ± 2.298
4.549LeuGlu: 4.549 ± 1.341
7.444LeuPhe: 7.444 ± 0.961
4.963LeuGly: 4.963 ± 1.666
1.654LeuHis: 1.654 ± 0.928
3.309LeuIle: 3.309 ± 0.957
10.753LeuLys: 10.753 ± 1.675
8.685LeuLeu: 8.685 ± 0.637
3.309LeuMet: 3.309 ± 0.97
6.617LeuAsn: 6.617 ± 2.036
3.309LeuPro: 3.309 ± 0.281
3.309LeuGln: 3.309 ± 1.36
4.549LeuArg: 4.549 ± 1.481
7.031LeuSer: 7.031 ± 2.023
4.549LeuThr: 4.549 ± 1.252
4.963LeuVal: 4.963 ± 0.831
0.414LeuTrp: 0.414 ± 0.198
1.654LeuTyr: 1.654 ± 0.793
0.0LeuXaa: 0.0 ± 0.0
Met
1.654MetAla: 1.654 ± 1.121
0.414MetCys: 0.414 ± 0.198
1.241MetAsp: 1.241 ± 0.595
1.654MetGlu: 1.654 ± 0.306
0.414MetPhe: 0.414 ± 0.198
0.414MetGly: 0.414 ± 0.198
0.414MetHis: 0.414 ± 0.829
1.241MetIle: 1.241 ± 0.595
1.654MetLys: 1.654 ± 0.306
0.414MetLeu: 0.414 ± 0.198
0.414MetMet: 0.414 ± 0.538
1.241MetAsn: 1.241 ± 0.406
0.0MetPro: 0.0 ± 0.0
1.241MetGln: 1.241 ± 0.595
1.241MetArg: 1.241 ± 0.595
3.722MetSer: 3.722 ± 0.283
2.068MetThr: 2.068 ± 0.588
2.481MetVal: 2.481 ± 2.262
0.414MetTrp: 0.414 ± 0.198
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.068AsnAla: 2.068 ± 0.74
1.241AsnCys: 1.241 ± 0.595
2.481AsnAsp: 2.481 ± 2.262
4.963AsnGlu: 4.963 ± 0.179
4.963AsnPhe: 4.963 ± 0.179
4.549AsnGly: 4.549 ± 0.734
0.827AsnHis: 0.827 ± 0.397
2.895AsnIle: 2.895 ± 2.046
2.481AsnLys: 2.481 ± 1.541
6.617AsnLeu: 6.617 ± 1.068
1.241AsnMet: 1.241 ± 1.119
2.068AsnAsn: 2.068 ± 1.712
1.241AsnPro: 1.241 ± 0.599
2.481AsnGln: 2.481 ± 1.199
2.068AsnArg: 2.068 ± 0.588
4.963AsnSer: 4.963 ± 1.622
1.241AsnThr: 1.241 ± 0.595
2.895AsnVal: 2.895 ± 0.594
0.414AsnTrp: 0.414 ± 0.198
2.481AsnTyr: 2.481 ± 1.681
0.0AsnXaa: 0.0 ± 0.0
Pro
1.241ProAla: 1.241 ± 0.595
0.0ProCys: 0.0 ± 0.0
1.654ProAsp: 1.654 ± 0.793
4.136ProGlu: 4.136 ± 1.299
2.068ProPhe: 2.068 ± 0.318
0.827ProGly: 0.827 ± 0.56
0.0ProHis: 0.0 ± 0.0
3.722ProIle: 3.722 ± 0.592
1.654ProLys: 1.654 ± 0.56
3.722ProLeu: 3.722 ± 0.592
0.827ProMet: 0.827 ± 1.473
2.481ProAsn: 2.481 ± 1.366
0.414ProPro: 0.414 ± 0.198
0.414ProGln: 0.414 ± 0.198
1.241ProArg: 1.241 ± 0.406
3.722ProSer: 3.722 ± 1.676
1.654ProThr: 1.654 ± 0.56
1.654ProVal: 1.654 ± 0.928
0.827ProTrp: 0.827 ± 0.397
0.827ProTyr: 0.827 ± 0.397
0.0ProXaa: 0.0 ± 0.0
Gln
3.722GlnAla: 3.722 ± 0.592
0.827GlnCys: 0.827 ± 0.397
1.241GlnAsp: 1.241 ± 0.406
2.068GlnGlu: 2.068 ± 1.564
0.827GlnPhe: 0.827 ± 0.397
0.827GlnGly: 0.827 ± 0.397
0.0GlnHis: 0.0 ± 0.0
1.241GlnIle: 1.241 ± 0.595
2.481GlnLys: 2.481 ± 1.199
2.895GlnLeu: 2.895 ± 1.168
1.241GlnMet: 1.241 ± 0.558
1.241GlnAsn: 1.241 ± 0.599
0.414GlnPro: 0.414 ± 0.736
1.241GlnGln: 1.241 ± 0.595
2.068GlnArg: 2.068 ± 0.991
0.827GlnSer: 0.827 ± 0.397
1.654GlnThr: 1.654 ± 0.306
2.481GlnVal: 2.481 ± 1.541
0.0GlnTrp: 0.0 ± 0.0
0.827GlnTyr: 0.827 ± 0.397
0.0GlnXaa: 0.0 ± 0.0
Arg
2.481ArgAla: 2.481 ± 0.433
0.414ArgCys: 0.414 ± 0.198
3.309ArgAsp: 3.309 ± 0.772
3.309ArgGlu: 3.309 ± 0.957
2.895ArgPhe: 2.895 ± 1.388
3.309ArgGly: 3.309 ± 0.772
0.827ArgHis: 0.827 ± 0.397
3.722ArgIle: 3.722 ± 1.217
2.895ArgLys: 2.895 ± 0.691
5.79ArgLeu: 5.79 ± 0.791
1.241ArgMet: 1.241 ± 0.37
2.895ArgAsn: 2.895 ± 0.691
1.241ArgPro: 1.241 ± 0.406
1.241ArgGln: 1.241 ± 1.517
1.241ArgArg: 1.241 ± 1.961
2.481ArgSer: 2.481 ± 0.559
2.068ArgThr: 2.068 ± 0.318
3.309ArgVal: 3.309 ± 0.281
0.414ArgTrp: 0.414 ± 0.736
4.136ArgTyr: 4.136 ± 1.091
0.0ArgXaa: 0.0 ± 0.0
Ser
4.549SerAla: 4.549 ± 1.289
0.827SerCys: 0.827 ± 0.56
6.617SerAsp: 6.617 ± 1.224
8.685SerGlu: 8.685 ± 1.863
5.376SerPhe: 5.376 ± 0.911
7.031SerGly: 7.031 ± 3.546
2.895SerHis: 2.895 ± 0.804
4.136SerIle: 4.136 ± 2.4
4.963SerLys: 4.963 ± 0.866
8.685SerLeu: 8.685 ± 0.949
2.068SerMet: 2.068 ± 0.74
7.031SerAsn: 7.031 ± 1.162
2.068SerPro: 2.068 ± 0.318
2.895SerGln: 2.895 ± 0.691
3.722SerArg: 3.722 ± 0.958
7.031SerSer: 7.031 ± 2.574
2.068SerThr: 2.068 ± 1.564
5.79SerVal: 5.79 ± 0.223
1.654SerTrp: 1.654 ± 0.306
3.309SerTyr: 3.309 ± 0.612
0.0SerXaa: 0.0 ± 0.0
Thr
1.654ThrAla: 1.654 ± 0.928
0.414ThrCys: 0.414 ± 0.736
0.414ThrAsp: 0.414 ± 0.198
2.068ThrGlu: 2.068 ± 0.74
5.376ThrPhe: 5.376 ± 1.854
2.068ThrGly: 2.068 ± 0.318
1.241ThrHis: 1.241 ± 0.595
2.068ThrIle: 2.068 ± 0.74
2.481ThrLys: 2.481 ± 0.433
4.136ThrLeu: 4.136 ± 1.148
1.241ThrMet: 1.241 ± 0.599
2.895ThrAsn: 2.895 ± 2.361
1.241ThrPro: 1.241 ± 1.119
0.0ThrGln: 0.0 ± 0.0
0.827ThrArg: 0.827 ± 0.397
3.309ThrSer: 3.309 ± 0.97
0.827ThrThr: 0.827 ± 1.313
1.654ThrVal: 1.654 ± 0.56
0.0ThrTrp: 0.0 ± 0.0
1.654ThrTyr: 1.654 ± 0.306
0.0ThrXaa: 0.0 ± 0.0
Val
3.309ValAla: 3.309 ± 0.281
1.654ValCys: 1.654 ± 0.928
3.722ValAsp: 3.722 ± 0.592
4.549ValGlu: 4.549 ± 1.297
2.895ValPhe: 2.895 ± 0.594
3.309ValGly: 3.309 ± 0.772
1.654ValHis: 1.654 ± 0.793
2.481ValIle: 2.481 ± 0.676
7.858ValLys: 7.858 ± 2.21
7.031ValLeu: 7.031 ± 1.204
0.414ValMet: 0.414 ± 0.198
1.654ValAsn: 1.654 ± 0.56
2.481ValPro: 2.481 ± 0.811
2.481ValGln: 2.481 ± 0.433
2.068ValArg: 2.068 ± 2.458
4.963ValSer: 4.963 ± 1.535
1.654ValThr: 1.654 ± 0.306
2.481ValVal: 2.481 ± 1.199
0.414ValTrp: 0.414 ± 0.198
2.895ValTyr: 2.895 ± 1.168
0.0ValXaa: 0.0 ± 0.0
Trp
0.414TrpAla: 0.414 ± 0.736
0.414TrpCys: 0.414 ± 0.198
1.241TrpAsp: 1.241 ± 0.406
0.414TrpGlu: 0.414 ± 0.198
0.414TrpPhe: 0.414 ± 0.198
0.414TrpGly: 0.414 ± 0.198
0.414TrpHis: 0.414 ± 0.198
0.414TrpIle: 0.414 ± 0.198
0.827TrpLys: 0.827 ± 0.397
1.241TrpLeu: 1.241 ± 0.595
0.414TrpMet: 0.414 ± 0.198
0.414TrpAsn: 0.414 ± 0.198
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.241TrpArg: 1.241 ± 0.595
1.654TrpSer: 1.654 ± 0.793
0.414TrpThr: 0.414 ± 0.198
0.827TrpVal: 0.827 ± 0.397
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.654TyrAla: 1.654 ± 0.306
1.241TyrCys: 1.241 ± 0.595
2.068TyrAsp: 2.068 ± 0.318
2.068TyrGlu: 2.068 ± 0.588
2.895TyrPhe: 2.895 ± 0.594
2.068TyrGly: 2.068 ± 0.991
0.414TyrHis: 0.414 ± 0.198
3.309TyrIle: 3.309 ± 0.772
2.895TyrLys: 2.895 ± 2.046
2.481TyrLeu: 2.481 ± 0.433
0.414TyrMet: 0.414 ± 0.198
2.068TyrAsn: 2.068 ± 0.958
1.241TyrPro: 1.241 ± 0.599
2.895TyrGln: 2.895 ± 0.594
2.068TyrArg: 2.068 ± 0.74
2.895TyrSer: 2.895 ± 0.396
0.827TyrThr: 0.827 ± 0.397
1.654TyrVal: 1.654 ± 0.306
0.0TyrTrp: 0.0 ± 0.0
0.414TyrTyr: 0.414 ± 0.198
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2419 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski