Amino acid dipepetide frequency for Switchgrass mosaic-associated virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.414AlaAla: 9.414 ± 2.878
0.0AlaCys: 0.0 ± 0.0
8.368AlaAsp: 8.368 ± 3.659
3.138AlaGlu: 3.138 ± 0.414
4.184AlaPhe: 4.184 ± 1.83
5.23AlaGly: 5.23 ± 3.171
1.046AlaHis: 1.046 ± 0.989
2.092AlaIle: 2.092 ± 1.521
2.092AlaLys: 2.092 ± 1.977
2.092AlaLeu: 2.092 ± 0.915
1.046AlaMet: 1.046 ± 0.989
6.276AlaAsn: 6.276 ± 2.214
2.092AlaPro: 2.092 ± 1.977
5.23AlaGln: 5.23 ± 1.274
5.23AlaArg: 5.23 ± 1.397
6.276AlaSer: 6.276 ± 0.799
12.552AlaThr: 12.552 ± 3.268
3.138AlaVal: 3.138 ± 1.612
1.046AlaTrp: 1.046 ± 1.146
1.046AlaTyr: 1.046 ± 0.807
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.046CysGlu: 1.046 ± 0.845
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
1.046CysHis: 1.046 ± 0.845
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
3.138CysLeu: 3.138 ± 1.132
0.0CysMet: 0.0 ± 0.0
1.046CysAsn: 1.046 ± 0.989
2.092CysPro: 2.092 ± 0.915
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
2.092CysSer: 2.092 ± 0.915
2.092CysThr: 2.092 ± 0.915
1.046CysVal: 1.046 ± 1.146
1.046CysTrp: 1.046 ± 0.845
1.046CysTyr: 1.046 ± 0.989
0.0CysXaa: 0.0 ± 0.0
Asp
2.092AspAla: 2.092 ± 1.977
1.046AspCys: 1.046 ± 1.146
1.046AspAsp: 1.046 ± 0.845
3.138AspGlu: 3.138 ± 1.565
0.0AspPhe: 0.0 ± 0.0
7.322AspGly: 7.322 ± 0.554
0.0AspHis: 0.0 ± 0.0
2.092AspIle: 2.092 ± 0.824
0.0AspLys: 0.0 ± 0.0
2.092AspLeu: 2.092 ± 0.915
0.0AspMet: 0.0 ± 0.0
0.0AspAsn: 0.0 ± 0.0
0.0AspPro: 0.0 ± 0.0
3.138AspGln: 3.138 ± 0.414
0.0AspArg: 0.0 ± 0.0
5.23AspSer: 5.23 ± 2.421
1.046AspThr: 1.046 ± 0.989
2.092AspVal: 2.092 ± 0.915
9.414AspTrp: 9.414 ± 4.207
7.322AspTyr: 7.322 ± 1.083
0.0AspXaa: 0.0 ± 0.0
Glu
2.092GluAla: 2.092 ± 0.824
0.0GluCys: 0.0 ± 0.0
4.184GluAsp: 4.184 ± 1.649
1.046GluGlu: 1.046 ± 0.807
1.046GluPhe: 1.046 ± 0.989
2.092GluGly: 2.092 ± 0.824
0.0GluHis: 0.0 ± 0.0
2.092GluIle: 2.092 ± 0.915
1.046GluLys: 1.046 ± 1.146
2.092GluLeu: 2.092 ± 1.521
0.0GluMet: 0.0 ± 0.0
1.046GluAsn: 1.046 ± 0.989
5.23GluPro: 5.23 ± 1.019
1.046GluGln: 1.046 ± 0.845
8.368GluArg: 8.368 ± 3.845
5.23GluSer: 5.23 ± 2.356
3.138GluThr: 3.138 ± 0.414
4.184GluVal: 4.184 ± 0.879
2.092GluTrp: 2.092 ± 0.999
6.276GluTyr: 6.276 ± 2.744
0.0GluXaa: 0.0 ± 0.0
Phe
4.184PheAla: 4.184 ± 0.879
1.046PheCys: 1.046 ± 0.845
4.184PheAsp: 4.184 ± 1.83
3.138PheGlu: 3.138 ± 1.565
2.092PhePhe: 2.092 ± 0.824
1.046PheGly: 1.046 ± 1.146
3.138PheHis: 3.138 ± 0.414
1.046PheIle: 1.046 ± 0.989
5.23PheLys: 5.23 ± 1.024
3.138PheLeu: 3.138 ± 0.414
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
7.322PhePro: 7.322 ± 0.554
2.092PheGln: 2.092 ± 0.915
0.0PheArg: 0.0 ± 0.0
3.138PheSer: 3.138 ± 0.414
1.046PheThr: 1.046 ± 1.146
4.184PheVal: 4.184 ± 1.208
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.184GlyAla: 4.184 ± 2.079
0.0GlyCys: 0.0 ± 0.0
2.092GlyAsp: 2.092 ± 0.824
3.138GlyGlu: 3.138 ± 0.414
1.046GlyPhe: 1.046 ± 1.146
8.368GlyGly: 8.368 ± 2.422
0.0GlyHis: 0.0 ± 0.0
4.184GlyIle: 4.184 ± 0.879
1.046GlyLys: 1.046 ± 0.845
1.046GlyLeu: 1.046 ± 0.807
1.046GlyMet: 1.046 ± 0.989
8.368GlyAsn: 8.368 ± 3.688
3.138GlyPro: 3.138 ± 3.438
1.046GlyGln: 1.046 ± 0.807
4.184GlyArg: 4.184 ± 0.983
9.414GlySer: 9.414 ± 1.406
6.276GlyThr: 6.276 ± 2.284
5.23GlyVal: 5.23 ± 1.397
0.0GlyTrp: 0.0 ± 0.0
3.138GlyTyr: 3.138 ± 0.414
0.0GlyXaa: 0.0 ± 0.0
His
4.184HisAla: 4.184 ± 1.83
1.046HisCys: 1.046 ± 0.845
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.046HisPhe: 1.046 ± 0.989
1.046HisGly: 1.046 ± 0.807
2.092HisHis: 2.092 ± 0.915
1.046HisIle: 1.046 ± 0.989
0.0HisLys: 0.0 ± 0.0
4.184HisLeu: 4.184 ± 1.83
0.0HisMet: 0.0 ± 0.0
2.092HisAsn: 2.092 ± 0.915
5.23HisPro: 5.23 ± 2.356
0.0HisGln: 0.0 ± 0.0
1.046HisArg: 1.046 ± 0.989
0.0HisSer: 0.0 ± 0.0
1.046HisThr: 1.046 ± 0.989
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.046HisTyr: 1.046 ± 0.845
0.0HisXaa: 0.0 ± 0.0
Ile
6.276IleAla: 6.276 ± 1.003
0.0IleCys: 0.0 ± 0.0
3.138IleAsp: 3.138 ± 1.565
0.0IleGlu: 0.0 ± 0.0
7.322IlePhe: 7.322 ± 0.554
3.138IleGly: 3.138 ± 2.966
2.092IleHis: 2.092 ± 0.915
4.184IleIle: 4.184 ± 2.09
1.046IleLys: 1.046 ± 0.989
1.046IleLeu: 1.046 ± 0.807
1.046IleMet: 1.046 ± 0.933
1.046IleAsn: 1.046 ± 0.989
5.23IlePro: 5.23 ± 1.674
0.0IleGln: 0.0 ± 0.0
1.046IleArg: 1.046 ± 0.989
1.046IleSer: 1.046 ± 0.989
4.184IleThr: 4.184 ± 3.042
3.138IleVal: 3.138 ± 1.762
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
0.0LysAla: 0.0 ± 0.0
0.0LysCys: 0.0 ± 0.0
3.138LysAsp: 3.138 ± 1.218
0.0LysGlu: 0.0 ± 0.0
3.138LysPhe: 3.138 ± 1.565
2.092LysGly: 2.092 ± 1.521
0.0LysHis: 0.0 ± 0.0
2.092LysIle: 2.092 ± 0.824
3.138LysLys: 3.138 ± 2.535
4.184LysLeu: 4.184 ± 1.83
3.138LysMet: 3.138 ± 1.565
6.276LysAsn: 6.276 ± 0.827
0.0LysPro: 0.0 ± 0.0
0.0LysGln: 0.0 ± 0.0
5.23LysArg: 5.23 ± 4.943
2.092LysSer: 2.092 ± 0.824
2.092LysThr: 2.092 ± 0.915
1.046LysVal: 1.046 ± 0.989
1.046LysTrp: 1.046 ± 0.989
4.184LysTyr: 4.184 ± 0.879
0.0LysXaa: 0.0 ± 0.0
Leu
1.046LeuAla: 1.046 ± 1.146
3.138LeuCys: 3.138 ± 1.132
2.092LeuAsp: 2.092 ± 0.915
2.092LeuGlu: 2.092 ± 0.999
2.092LeuPhe: 2.092 ± 0.915
5.23LeuGly: 5.23 ± 1.397
3.138LeuHis: 3.138 ± 1.505
3.138LeuIle: 3.138 ± 2.295
2.092LeuLys: 2.092 ± 0.915
6.276LeuLeu: 6.276 ± 0.799
2.092LeuMet: 2.092 ± 0.831
1.046LeuAsn: 1.046 ± 0.989
1.046LeuPro: 1.046 ± 0.845
4.184LeuGln: 4.184 ± 1.208
3.138LeuArg: 3.138 ± 0.414
5.23LeuSer: 5.23 ± 2.356
8.368LeuThr: 8.368 ± 1.389
8.368LeuVal: 8.368 ± 1.52
2.092LeuTrp: 2.092 ± 0.915
4.184LeuTyr: 4.184 ± 1.442
0.0LeuXaa: 0.0 ± 0.0
Met
3.138MetAla: 3.138 ± 1.505
0.0MetCys: 0.0 ± 0.0
2.092MetAsp: 2.092 ± 0.915
1.046MetGlu: 1.046 ± 1.146
1.046MetPhe: 1.046 ± 0.989
2.092MetGly: 2.092 ± 0.915
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
0.0MetLeu: 0.0 ± 0.0
1.046MetMet: 1.046 ± 0.807
0.0MetAsn: 0.0 ± 0.0
2.092MetPro: 2.092 ± 1.977
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
2.092MetSer: 2.092 ± 0.915
1.046MetThr: 1.046 ± 0.845
2.092MetVal: 2.092 ± 1.977
0.0MetTrp: 0.0 ± 0.0
1.046MetTyr: 1.046 ± 0.989
0.0MetXaa: 0.0 ± 0.0
Asn
1.046AsnAla: 1.046 ± 0.989
1.046AsnCys: 1.046 ± 0.845
2.092AsnAsp: 2.092 ± 1.521
5.23AsnGlu: 5.23 ± 2.421
2.092AsnPhe: 2.092 ± 0.915
5.23AsnGly: 5.23 ± 2.169
2.092AsnHis: 2.092 ± 0.915
2.092AsnIle: 2.092 ± 0.824
0.0AsnLys: 0.0 ± 0.0
3.138AsnLeu: 3.138 ± 0.414
1.046AsnMet: 1.046 ± 0.989
0.0AsnAsn: 0.0 ± 0.0
8.368AsnPro: 8.368 ± 1.816
3.138AsnGln: 3.138 ± 0.414
1.046AsnArg: 1.046 ± 0.989
3.138AsnSer: 3.138 ± 2.295
0.0AsnThr: 0.0 ± 0.0
4.184AsnVal: 4.184 ± 1.83
0.0AsnTrp: 0.0 ± 0.0
3.138AsnTyr: 3.138 ± 0.414
0.0AsnXaa: 0.0 ± 0.0
Pro
6.276ProAla: 6.276 ± 3.504
1.046ProCys: 1.046 ± 0.989
3.138ProAsp: 3.138 ± 1.565
8.368ProGlu: 8.368 ± 2.445
4.184ProPhe: 4.184 ± 1.537
4.184ProGly: 4.184 ± 0.799
3.138ProHis: 3.138 ± 1.505
4.184ProIle: 4.184 ± 0.799
5.23ProLys: 5.23 ± 1.674
1.046ProLeu: 1.046 ± 0.989
2.092ProMet: 2.092 ± 0.915
4.184ProAsn: 4.184 ± 1.83
2.092ProPro: 2.092 ± 1.521
2.092ProGln: 2.092 ± 2.292
4.184ProArg: 4.184 ± 1.208
4.184ProSer: 4.184 ± 1.851
2.092ProThr: 2.092 ± 0.999
2.092ProVal: 2.092 ± 0.915
1.046ProTrp: 1.046 ± 0.989
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.138GlnAla: 3.138 ± 1.565
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
3.138GlnGlu: 3.138 ± 1.524
2.092GlnPhe: 2.092 ± 1.977
1.046GlnGly: 1.046 ± 0.807
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
3.138GlnLys: 3.138 ± 1.565
5.23GlnLeu: 5.23 ± 1.71
0.0GlnMet: 0.0 ± 0.901
1.046GlnAsn: 1.046 ± 0.989
3.138GlnPro: 3.138 ± 0.414
1.046GlnGln: 1.046 ± 0.807
4.184GlnArg: 4.184 ± 1.851
0.0GlnSer: 0.0 ± 0.0
1.046GlnThr: 1.046 ± 0.845
2.092GlnVal: 2.092 ± 0.999
0.0GlnTrp: 0.0 ± 0.0
4.184GlnTyr: 4.184 ± 1.83
0.0GlnXaa: 0.0 ± 0.0
Arg
5.23ArgAla: 5.23 ± 2.824
4.184ArgCys: 4.184 ± 1.83
6.276ArgAsp: 6.276 ± 0.827
3.138ArgGlu: 3.138 ± 0.414
2.092ArgPhe: 2.092 ± 1.977
6.276ArgGly: 6.276 ± 2.058
3.138ArgHis: 3.138 ± 0.414
4.184ArgIle: 4.184 ± 1.83
3.138ArgLys: 3.138 ± 2.966
3.138ArgLeu: 3.138 ± 2.966
3.138ArgMet: 3.138 ± 1.505
2.092ArgAsn: 2.092 ± 0.915
3.138ArgPro: 3.138 ± 1.816
2.092ArgGln: 2.092 ± 0.915
5.23ArgArg: 5.23 ± 1.397
8.368ArgSer: 8.368 ± 1.816
4.184ArgThr: 4.184 ± 0.799
1.046ArgVal: 1.046 ± 0.989
0.0ArgTrp: 0.0 ± 0.0
3.138ArgTyr: 3.138 ± 0.414
0.0ArgXaa: 0.0 ± 0.0
Ser
13.598SerAla: 13.598 ± 3.773
0.0SerCys: 0.0 ± 0.0
3.138SerAsp: 3.138 ± 1.505
0.0SerGlu: 0.0 ± 0.0
3.138SerPhe: 3.138 ± 1.565
3.138SerGly: 3.138 ± 2.505
1.046SerHis: 1.046 ± 0.989
4.184SerIle: 4.184 ± 0.799
5.23SerLys: 5.23 ± 1.019
7.322SerLeu: 7.322 ± 1.806
0.0SerMet: 0.0 ± 0.0
5.23SerAsn: 5.23 ± 1.71
4.184SerPro: 4.184 ± 1.83
2.092SerGln: 2.092 ± 0.824
11.506SerArg: 11.506 ± 1.032
5.23SerSer: 5.23 ± 2.356
4.184SerThr: 4.184 ± 0.983
2.092SerVal: 2.092 ± 0.915
0.0SerTrp: 0.0 ± 0.0
2.092SerTyr: 2.092 ± 1.222
0.0SerXaa: 0.0 ± 0.0
Thr
9.414ThrAla: 9.414 ± 2.044
0.0ThrCys: 0.0 ± 0.0
1.046ThrAsp: 1.046 ± 0.989
5.23ThrGlu: 5.23 ± 4.128
4.184ThrPhe: 4.184 ± 1.83
4.184ThrGly: 4.184 ± 1.208
0.0ThrHis: 0.0 ± 0.0
5.23ThrIle: 5.23 ± 1.932
2.092ThrLys: 2.092 ± 0.915
6.276ThrLeu: 6.276 ± 0.799
0.0ThrMet: 0.0 ± 0.0
2.092ThrAsn: 2.092 ± 1.977
4.184ThrPro: 4.184 ± 0.799
2.092ThrGln: 2.092 ± 0.915
6.276ThrArg: 6.276 ± 0.827
5.23ThrSer: 5.23 ± 2.356
3.138ThrThr: 3.138 ± 1.132
2.092ThrVal: 2.092 ± 1.521
3.138ThrTrp: 3.138 ± 2.295
4.184ThrTyr: 4.184 ± 1.208
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.0ValCys: 0.0 ± 0.0
0.0ValAsp: 0.0 ± 0.0
4.184ValGlu: 4.184 ± 0.983
3.138ValPhe: 3.138 ± 0.414
4.184ValGly: 4.184 ± 1.537
1.046ValHis: 1.046 ± 0.845
2.092ValIle: 2.092 ± 1.977
3.138ValLys: 3.138 ± 1.132
4.184ValLeu: 4.184 ± 0.799
0.0ValMet: 0.0 ± 0.0
3.138ValAsn: 3.138 ± 1.345
1.046ValPro: 1.046 ± 0.845
6.276ValGln: 6.276 ± 0.827
7.322ValArg: 7.322 ± 1.892
4.184ValSer: 4.184 ± 2.079
4.184ValThr: 4.184 ± 0.799
0.0ValVal: 0.0 ± 0.0
0.0ValTrp: 0.0 ± 0.0
2.092ValTyr: 2.092 ± 1.977
0.0ValXaa: 0.0 ± 0.0
Trp
6.276TrpAla: 6.276 ± 2.744
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
2.092TrpGlu: 2.092 ± 0.915
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
3.138TrpLys: 3.138 ± 1.612
2.092TrpLeu: 2.092 ± 0.915
2.092TrpMet: 2.092 ± 1.977
0.0TrpAsn: 0.0 ± 0.0
1.046TrpPro: 1.046 ± 0.989
0.0TrpGln: 0.0 ± 0.0
2.092TrpArg: 2.092 ± 2.292
0.0TrpSer: 0.0 ± 0.0
2.092TrpThr: 2.092 ± 0.915
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
3.138TyrCys: 3.138 ± 0.414
1.046TyrAsp: 1.046 ± 0.989
3.138TyrGlu: 3.138 ± 1.565
2.092TyrPhe: 2.092 ± 0.824
1.046TyrGly: 1.046 ± 0.845
2.092TyrHis: 2.092 ± 0.915
1.046TyrIle: 1.046 ± 0.845
2.092TyrLys: 2.092 ± 1.977
8.368TyrLeu: 8.368 ± 0.512
0.0TyrMet: 0.0 ± 0.0
3.138TyrAsn: 3.138 ± 1.565
4.184TyrPro: 4.184 ± 0.799
0.0TyrGln: 0.0 ± 0.0
3.138TyrArg: 3.138 ± 0.414
5.23TyrSer: 5.23 ± 2.356
6.276TyrThr: 6.276 ± 3.147
2.092TyrVal: 2.092 ± 0.915
0.0TyrTrp: 0.0 ± 0.0
1.046TyrTyr: 1.046 ± 1.146
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (957 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski