Amino acid dipepetide frequency for Beihai shrimp virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.486AlaAla: 5.486 ± 1.214
0.323AlaCys: 0.323 ± 0.169
4.84AlaAsp: 4.84 ± 0.875
4.84AlaGlu: 4.84 ± 0.235
3.227AlaPhe: 3.227 ± 0.584
3.872AlaGly: 3.872 ± 0.188
0.645AlaHis: 0.645 ± 0.216
3.55AlaIle: 3.55 ± 0.358
4.84AlaLys: 4.84 ± 1.346
6.776AlaLeu: 6.776 ± 0.781
2.581AlaMet: 2.581 ± 0.8
1.291AlaAsn: 1.291 ± 0.433
2.904AlaPro: 2.904 ± 0.141
3.55AlaGln: 3.55 ± 0.358
2.581AlaArg: 2.581 ± 0.866
7.099AlaSer: 7.099 ± 0.16
7.099AlaThr: 7.099 ± 2.381
3.55AlaVal: 3.55 ± 0.753
0.968AlaTrp: 0.968 ± 0.602
1.613AlaTyr: 1.613 ± 0.292
0.0AlaXaa: 0.0 ± 0.0
Cys
0.645CysAla: 0.645 ± 0.772
0.323CysCys: 0.323 ± 0.386
0.645CysAsp: 0.645 ± 0.339
1.613CysGlu: 1.613 ± 0.847
1.291CysPhe: 1.291 ± 0.433
0.968CysGly: 0.968 ± 0.047
0.645CysHis: 0.645 ± 0.216
0.645CysIle: 0.645 ± 0.339
0.968CysLys: 0.968 ± 0.508
1.613CysLeu: 1.613 ± 0.292
0.0CysMet: 0.0 ± 0.0
0.645CysAsn: 0.645 ± 0.339
1.936CysPro: 1.936 ± 0.649
0.323CysGln: 0.323 ± 0.169
0.323CysArg: 0.323 ± 0.386
0.323CysSer: 0.323 ± 0.169
0.968CysThr: 0.968 ± 0.508
0.0CysVal: 0.0 ± 0.0
0.323CysTrp: 0.323 ± 0.169
0.968CysTyr: 0.968 ± 0.047
0.0CysXaa: 0.0 ± 0.0
Asp
4.195AspAla: 4.195 ± 1.092
0.968AspCys: 0.968 ± 0.508
3.55AspAsp: 3.55 ± 0.198
4.195AspGlu: 4.195 ± 2.203
4.195AspPhe: 4.195 ± 1.092
3.227AspGly: 3.227 ± 0.028
1.291AspHis: 1.291 ± 0.433
2.259AspIle: 2.259 ± 1.591
1.291AspLys: 1.291 ± 0.678
4.518AspLeu: 4.518 ± 0.706
1.936AspMet: 1.936 ± 0.649
0.968AspAsn: 0.968 ± 0.047
3.872AspPro: 3.872 ± 0.367
1.291AspGln: 1.291 ± 0.678
1.613AspArg: 1.613 ± 0.292
4.195AspSer: 4.195 ± 0.019
4.518AspThr: 4.518 ± 0.151
3.227AspVal: 3.227 ± 0.028
0.968AspTrp: 0.968 ± 0.047
2.259AspTyr: 2.259 ± 0.48
0.0AspXaa: 0.0 ± 0.0
Glu
6.776GluAla: 6.776 ± 0.885
1.291GluCys: 1.291 ± 0.678
3.55GluAsp: 3.55 ± 1.308
8.39GluGlu: 8.39 ± 2.184
1.936GluPhe: 1.936 ± 0.461
4.195GluGly: 4.195 ± 0.019
1.936GluHis: 1.936 ± 0.094
5.163GluIle: 5.163 ± 1.045
3.872GluLys: 3.872 ± 0.367
2.581GluLeu: 2.581 ± 0.245
1.936GluMet: 1.936 ± 0.094
3.872GluAsn: 3.872 ± 0.923
2.259GluPro: 2.259 ± 1.591
5.808GluGln: 5.808 ± 0.282
1.936GluArg: 1.936 ± 0.094
8.39GluSer: 8.39 ± 1.629
5.486GluThr: 5.486 ± 0.452
7.099GluVal: 7.099 ± 1.506
2.259GluTrp: 2.259 ± 0.075
2.904GluTyr: 2.904 ± 1.252
0.0GluXaa: 0.0 ± 0.0
Phe
1.936PheAla: 1.936 ± 0.094
1.291PheCys: 1.291 ± 0.122
2.904PheAsp: 2.904 ± 0.141
5.808PheGlu: 5.808 ± 0.828
1.613PhePhe: 1.613 ± 0.292
1.936PheGly: 1.936 ± 0.094
0.645PheHis: 0.645 ± 0.339
1.291PheIle: 1.291 ± 0.433
3.227PheLys: 3.227 ± 0.527
1.936PheLeu: 1.936 ± 1.017
1.936PheMet: 1.936 ± 0.094
2.904PheAsn: 2.904 ± 0.141
3.872PhePro: 3.872 ± 0.367
1.291PheGln: 1.291 ± 0.122
0.968PheArg: 0.968 ± 0.047
2.581PheSer: 2.581 ± 0.311
3.55PheThr: 3.55 ± 0.753
2.581PheVal: 2.581 ± 0.245
0.968PheTrp: 0.968 ± 0.508
0.968PheTyr: 0.968 ± 0.047
0.0PheXaa: 0.0 ± 0.0
Gly
4.518GlyAla: 4.518 ± 0.706
0.323GlyCys: 0.323 ± 0.169
2.904GlyAsp: 2.904 ± 0.141
2.904GlyGlu: 2.904 ± 0.696
1.936GlyPhe: 1.936 ± 0.649
1.613GlyGly: 1.613 ± 0.292
0.645GlyHis: 0.645 ± 0.216
3.872GlyIle: 3.872 ± 0.188
3.227GlyLys: 3.227 ± 0.028
4.195GlyLeu: 4.195 ± 0.574
1.613GlyMet: 1.613 ± 0.264
2.581GlyAsn: 2.581 ± 1.355
0.968GlyPro: 0.968 ± 1.158
1.936GlyGln: 1.936 ± 0.094
1.613GlyArg: 1.613 ± 0.264
3.227GlySer: 3.227 ± 1.082
2.904GlyThr: 2.904 ± 1.807
3.872GlyVal: 3.872 ± 0.923
0.645GlyTrp: 0.645 ± 0.339
2.904GlyTyr: 2.904 ± 0.97
0.0GlyXaa: 0.0 ± 0.0
His
1.291HisAla: 1.291 ± 0.988
0.323HisCys: 0.323 ± 0.169
0.645HisAsp: 0.645 ± 0.339
0.968HisGlu: 0.968 ± 0.047
1.936HisPhe: 1.936 ± 0.461
1.291HisGly: 1.291 ± 0.122
1.291HisHis: 1.291 ± 0.433
0.645HisIle: 0.645 ± 0.339
0.968HisLys: 0.968 ± 0.047
1.613HisLeu: 1.613 ± 0.847
0.968HisMet: 0.968 ± 0.602
0.323HisAsn: 0.323 ± 0.169
1.936HisPro: 1.936 ± 0.094
0.0HisGln: 0.0 ± 0.0
0.323HisArg: 0.323 ± 0.169
1.291HisSer: 1.291 ± 0.433
2.259HisThr: 2.259 ± 0.48
3.227HisVal: 3.227 ± 0.584
0.645HisTrp: 0.645 ± 0.216
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.55IleAla: 3.55 ± 0.358
0.323IleCys: 0.323 ± 0.386
2.904IleAsp: 2.904 ± 0.414
4.518IleGlu: 4.518 ± 0.96
0.968IlePhe: 0.968 ± 1.158
1.936IleGly: 1.936 ± 1.017
1.613IleHis: 1.613 ± 0.292
1.613IleIle: 1.613 ± 0.847
3.55IleLys: 3.55 ± 0.753
2.904IleLeu: 2.904 ± 0.414
1.936IleMet: 1.936 ± 0.094
5.163IleAsn: 5.163 ± 0.49
2.581IlePro: 2.581 ± 0.311
1.613IleGln: 1.613 ± 0.264
2.581IleArg: 2.581 ± 1.355
1.936IleSer: 1.936 ± 1.017
3.872IleThr: 3.872 ± 0.188
5.808IleVal: 5.808 ± 0.828
0.968IleTrp: 0.968 ± 0.508
1.613IleTyr: 1.613 ± 0.292
0.0IleXaa: 0.0 ± 0.0
Lys
2.904LysAla: 2.904 ± 0.414
2.259LysCys: 2.259 ± 0.48
2.259LysAsp: 2.259 ± 0.075
4.84LysGlu: 4.84 ± 1.431
1.613LysPhe: 1.613 ± 0.264
2.904LysGly: 2.904 ± 1.252
0.968LysHis: 0.968 ± 0.047
4.518LysIle: 4.518 ± 1.261
3.227LysLys: 3.227 ± 1.139
5.163LysLeu: 5.163 ± 0.621
0.323LysMet: 0.323 ± 0.169
1.291LysAsn: 1.291 ± 0.678
1.613LysPro: 1.613 ± 0.819
1.613LysGln: 1.613 ± 0.292
4.518LysArg: 4.518 ± 0.151
1.936LysSer: 1.936 ± 0.461
6.131LysThr: 6.131 ± 0.998
4.195LysVal: 4.195 ± 0.574
0.323LysTrp: 0.323 ± 0.386
2.259LysTyr: 2.259 ± 0.48
0.0LysXaa: 0.0 ± 0.0
Leu
5.163LeuAla: 5.163 ± 0.621
1.291LeuCys: 1.291 ± 0.122
5.163LeuAsp: 5.163 ± 1.6
4.518LeuGlu: 4.518 ± 0.706
4.195LeuPhe: 4.195 ± 0.537
5.486LeuGly: 5.486 ± 0.452
1.613LeuHis: 1.613 ± 0.292
3.872LeuIle: 3.872 ± 1.478
4.195LeuLys: 4.195 ± 0.019
6.131LeuLeu: 6.131 ± 2.664
2.904LeuMet: 2.904 ± 1.252
1.936LeuAsn: 1.936 ± 0.094
4.518LeuPro: 4.518 ± 0.96
1.936LeuGln: 1.936 ± 0.461
4.195LeuArg: 4.195 ± 1.092
5.486LeuSer: 5.486 ± 0.452
2.904LeuThr: 2.904 ± 0.141
5.163LeuVal: 5.163 ± 0.621
1.613LeuTrp: 1.613 ± 0.292
2.581LeuTyr: 2.581 ± 0.866
0.0LeuXaa: 0.0 ± 0.0
Met
0.968MetAla: 0.968 ± 0.508
0.0MetCys: 0.0 ± 0.0
1.291MetAsp: 1.291 ± 0.122
2.904MetGlu: 2.904 ± 0.97
1.291MetPhe: 1.291 ± 0.122
0.968MetGly: 0.968 ± 0.047
0.0MetHis: 0.0 ± 0.0
1.291MetIle: 1.291 ± 0.678
2.581MetLys: 2.581 ± 0.245
0.968MetLeu: 0.968 ± 0.602
0.968MetMet: 0.968 ± 0.508
1.613MetAsn: 1.613 ± 0.292
1.291MetPro: 1.291 ± 0.433
0.645MetGln: 0.645 ± 0.216
1.613MetArg: 1.613 ± 0.264
3.872MetSer: 3.872 ± 0.188
3.227MetThr: 3.227 ± 1.082
0.645MetVal: 0.645 ± 0.339
0.323MetTrp: 0.323 ± 0.386
1.291MetTyr: 1.291 ± 0.433
0.0MetXaa: 0.0 ± 0.0
Asn
4.195AsnAla: 4.195 ± 0.019
0.645AsnCys: 0.645 ± 0.339
1.613AsnAsp: 1.613 ± 0.264
3.227AsnGlu: 3.227 ± 1.082
1.936AsnPhe: 1.936 ± 1.205
2.581AsnGly: 2.581 ± 0.8
1.291AsnHis: 1.291 ± 0.678
3.227AsnIle: 3.227 ± 0.584
2.259AsnLys: 2.259 ± 0.075
4.518AsnLeu: 4.518 ± 1.261
1.291AsnMet: 1.291 ± 0.608
1.936AsnAsn: 1.936 ± 0.094
2.259AsnPro: 2.259 ± 0.48
0.968AsnGln: 0.968 ± 0.602
0.645AsnArg: 0.645 ± 0.339
1.936AsnSer: 1.936 ± 1.205
2.259AsnThr: 2.259 ± 0.48
3.872AsnVal: 3.872 ± 0.367
0.645AsnTrp: 0.645 ± 0.216
0.645AsnTyr: 0.645 ± 0.216
0.0AsnXaa: 0.0 ± 0.0
Pro
2.904ProAla: 2.904 ± 0.97
0.968ProCys: 0.968 ± 0.602
1.613ProAsp: 1.613 ± 0.264
3.227ProGlu: 3.227 ± 0.527
0.968ProPhe: 0.968 ± 0.508
1.613ProGly: 1.613 ± 0.292
0.323ProHis: 0.323 ± 0.169
2.904ProIle: 2.904 ± 0.141
0.645ProLys: 0.645 ± 0.216
3.55ProLeu: 3.55 ± 0.913
1.613ProMet: 1.613 ± 0.292
1.936ProAsn: 1.936 ± 1.205
0.968ProPro: 0.968 ± 0.602
1.936ProGln: 1.936 ± 0.461
3.55ProArg: 3.55 ± 0.198
5.163ProSer: 5.163 ± 2.842
5.808ProThr: 5.808 ± 3.614
4.84ProVal: 4.84 ± 0.235
0.968ProTrp: 0.968 ± 0.047
1.291ProTyr: 1.291 ± 0.122
0.0ProXaa: 0.0 ± 0.0
Gln
3.55GlnAla: 3.55 ± 0.358
0.323GlnCys: 0.323 ± 0.386
1.291GlnAsp: 1.291 ± 0.122
3.55GlnGlu: 3.55 ± 1.308
1.291GlnPhe: 1.291 ± 0.678
1.613GlnGly: 1.613 ± 0.264
1.291GlnHis: 1.291 ± 0.122
2.904GlnIle: 2.904 ± 0.414
0.323GlnLys: 0.323 ± 0.169
5.163GlnLeu: 5.163 ± 0.621
0.323GlnMet: 0.323 ± 0.169
1.291GlnAsn: 1.291 ± 0.122
1.291GlnPro: 1.291 ± 0.433
1.936GlnGln: 1.936 ± 0.461
2.581GlnArg: 2.581 ± 0.311
2.581GlnSer: 2.581 ± 0.245
2.904GlnThr: 2.904 ± 0.414
3.872GlnVal: 3.872 ± 0.367
0.323GlnTrp: 0.323 ± 0.386
0.968GlnTyr: 0.968 ± 0.508
0.0GlnXaa: 0.0 ± 0.0
Arg
3.227ArgAla: 3.227 ± 1.139
0.0ArgCys: 0.0 ± 0.0
2.259ArgAsp: 2.259 ± 0.075
2.259ArgGlu: 2.259 ± 0.631
2.904ArgPhe: 2.904 ± 1.525
1.291ArgGly: 1.291 ± 0.433
1.291ArgHis: 1.291 ± 0.122
2.904ArgIle: 2.904 ± 0.696
5.486ArgLys: 5.486 ± 0.659
4.195ArgLeu: 4.195 ± 0.574
1.291ArgMet: 1.291 ± 0.678
1.936ArgAsn: 1.936 ± 1.017
3.55ArgPro: 3.55 ± 0.913
2.259ArgGln: 2.259 ± 0.631
4.195ArgArg: 4.195 ± 1.092
1.291ArgSer: 1.291 ± 0.678
2.581ArgThr: 2.581 ± 0.8
2.581ArgVal: 2.581 ± 0.866
0.323ArgTrp: 0.323 ± 0.169
2.259ArgTyr: 2.259 ± 0.075
0.0ArgXaa: 0.0 ± 0.0
Ser
5.486SerAla: 5.486 ± 1.562
1.291SerCys: 1.291 ± 0.678
3.872SerAsp: 3.872 ± 0.188
7.099SerGlu: 7.099 ± 0.715
4.84SerPhe: 4.84 ± 0.875
3.55SerGly: 3.55 ± 1.468
1.613SerHis: 1.613 ± 0.264
2.581SerIle: 2.581 ± 0.311
4.195SerLys: 4.195 ± 0.537
4.518SerLeu: 4.518 ± 0.706
0.968SerMet: 0.968 ± 0.332
2.581SerAsn: 2.581 ± 0.311
2.581SerPro: 2.581 ± 0.8
1.936SerGln: 1.936 ± 0.094
3.872SerArg: 3.872 ± 2.033
5.808SerSer: 5.808 ± 0.282
6.454SerThr: 6.454 ± 2.72
6.131SerVal: 6.131 ± 2.334
0.323SerTrp: 0.323 ± 0.169
1.936SerTyr: 1.936 ± 0.461
0.0SerXaa: 0.0 ± 0.0
Thr
6.776ThrAla: 6.776 ± 2.551
1.936ThrCys: 1.936 ± 0.461
3.872ThrAsp: 3.872 ± 0.367
5.163ThrGlu: 5.163 ± 1.732
3.227ThrPhe: 3.227 ± 0.028
3.872ThrGly: 3.872 ± 1.299
1.936ThrHis: 1.936 ± 0.094
2.259ThrIle: 2.259 ± 0.075
3.227ThrLys: 3.227 ± 1.082
7.099ThrLeu: 7.099 ± 2.381
2.259ThrMet: 2.259 ± 0.075
4.195ThrAsn: 4.195 ± 2.795
5.486ThrPro: 5.486 ± 2.118
3.55ThrGln: 3.55 ± 0.198
4.518ThrArg: 4.518 ± 1.261
5.808ThrSer: 5.808 ± 1.393
7.422ThrThr: 7.422 ± 2.767
3.872ThrVal: 3.872 ± 1.478
0.968ThrTrp: 0.968 ± 0.047
2.581ThrTyr: 2.581 ± 0.866
0.0ThrXaa: 0.0 ± 0.0
Val
6.131ValAla: 6.131 ± 0.113
0.0ValCys: 0.0 ± 0.0
5.808ValAsp: 5.808 ± 0.838
6.776ValGlu: 6.776 ± 0.329
3.227ValPhe: 3.227 ± 0.028
3.872ValGly: 3.872 ± 0.367
1.613ValHis: 1.613 ± 0.264
3.55ValIle: 3.55 ± 1.308
3.55ValLys: 3.55 ± 0.753
3.872ValLeu: 3.872 ± 0.923
1.613ValMet: 1.613 ± 0.292
3.227ValAsn: 3.227 ± 0.028
2.259ValPro: 2.259 ± 0.631
4.518ValGln: 4.518 ± 1.817
2.904ValArg: 2.904 ± 0.696
5.163ValSer: 5.163 ± 0.066
4.84ValThr: 4.84 ± 1.901
6.131ValVal: 6.131 ± 0.668
2.904ValTrp: 2.904 ± 0.696
1.936ValTyr: 1.936 ± 1.017
0.0ValXaa: 0.0 ± 0.0
Trp
0.323TrpAla: 0.323 ± 0.169
0.0TrpCys: 0.0 ± 0.0
0.968TrpAsp: 0.968 ± 0.602
1.613TrpGlu: 1.613 ± 0.847
0.968TrpPhe: 0.968 ± 0.602
0.323TrpGly: 0.323 ± 0.169
0.323TrpHis: 0.323 ± 0.169
0.323TrpIle: 0.323 ± 0.169
1.291TrpLys: 1.291 ± 0.988
0.323TrpLeu: 0.323 ± 0.169
0.323TrpMet: 0.323 ± 0.386
1.291TrpAsn: 1.291 ± 0.433
0.323TrpPro: 0.323 ± 0.169
0.968TrpGln: 0.968 ± 0.047
1.613TrpArg: 1.613 ± 0.264
1.613TrpSer: 1.613 ± 0.292
2.904TrpThr: 2.904 ± 0.141
0.968TrpVal: 0.968 ± 0.047
0.323TrpTrp: 0.323 ± 0.386
0.645TrpTyr: 0.645 ± 0.339
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.613TyrAla: 1.613 ± 0.292
1.291TyrCys: 1.291 ± 0.433
2.581TyrAsp: 2.581 ± 1.355
3.227TyrGlu: 3.227 ± 0.028
0.645TyrPhe: 0.645 ± 0.216
1.291TyrGly: 1.291 ± 0.122
0.968TyrHis: 0.968 ± 0.047
2.581TyrIle: 2.581 ± 0.311
1.936TyrLys: 1.936 ± 0.461
3.227TyrLeu: 3.227 ± 0.028
0.645TyrMet: 0.645 ± 0.216
0.968TyrAsn: 0.968 ± 0.602
0.323TyrPro: 0.323 ± 0.169
1.291TyrGln: 1.291 ± 0.988
1.936TyrArg: 1.936 ± 1.017
2.259TyrSer: 2.259 ± 0.075
1.936TyrThr: 1.936 ± 0.649
2.259TyrVal: 2.259 ± 0.48
0.645TyrTrp: 0.645 ± 0.216
2.259TyrTyr: 2.259 ± 0.075
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (3100 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski