Class: Work
Overview
rubocop:disable Metrics/ClassLength
Defined Under Namespace
Classes: InvalidGroupError
Instance Attribute Summary collapse
Class Method Summary
collapse
Instance Method Summary
collapse
-
#activities ⇒ Object
-
#add_message(message, current_user_id) ⇒ Object
-
#add_provenance_note(date, note, current_user_id, change_label = "") ⇒ Object
-
#administered_by?(user) ⇒ Boolean
-
#artifact_uploads ⇒ Array<S3File>
Retrieve the S3 file uploads which are research artifacts proper (not README or other files providing metadata/documentation).
-
#as_json(*args) ⇒ String
Generates the JSON serialized expression of the Work.
-
#change_curator(curator_user_id, current_user) ⇒ Object
-
#changes ⇒ Object
-
#clear_curator(current_user) ⇒ Object
-
#created_by_user ⇒ Object
-
#current_transition ⇒ Object
-
#date_approved ⇒ Object
-
#doi_url ⇒ String
Return the DOI formatted as a URL, so it can be used as a link on display pages.
-
#draft_doi ⇒ Object
-
#editable_by?(user) ⇒ Boolean
Is this work editable by a given user? A work is editable when: * it is being edited by the person who made it * it is being edited by a group admin of the group where is resides * it is being edited by a super admin.
-
#editable_in_current_state?(user) ⇒ Boolean
-
#embargoed? ⇒ Boolean
Determine whether or not the Work is under active embargo Embargoes are not fully released until the day after the embargo date.
-
#file_list ⇒ Object
Returns the list of files for the work with some basic information about each of them.
-
#files_location_cluster? ⇒ Boolean
-
#files_location_other? ⇒ Boolean
-
#files_location_upload? ⇒ Boolean
-
#files_mode ⇒ Object
Returns the bucket name where the files are stored for this work.
-
#files_mode_human ⇒ Object
Returns a human friendly name for the bucket where the files for the work are located.
-
#find_bucket_s3_dir(bucket_name:) ⇒ Aws::S3::Types::HeadObjectOutput
Transmit a HEAD request for the S3 Bucket directory for this Work.
-
#form_attributes ⇒ Object
-
#format_date_for_solr(date) ⇒ String
Format the date for Apache Solr.
-
#full_metadata_as_json(*args) ⇒ Object
these are for approved works and what they return on the JSON document.
-
#has_rights?(rights_id) ⇒ Boolean
rubocop:disable Naming/PredicateName.
-
#log_changes(resource_compare, current_user_id) ⇒ Object
-
#log_file_changes(current_user_id) ⇒ Object
-
#mark_new_notifications_as_read(user_id) ⇒ Object
Marks as read the notifications for the given user_id in this work.
-
#new_notification_count_for_user(user_id) ⇒ Object
-
#past_snapshots ⇒ Object
-
#pdc_discovery_url ⇒ Object
This is the solr id / work show page in PDC Discovery.
-
#post_curation_s3_resources ⇒ Object
Accesses post-curation S3 Bucket Objects.
-
#post_curation_uploads(force_post_curation: false) ⇒ Object
Returns the files in post-curation for the work.
-
#pre_curation_uploads ⇒ Object
Fetches the data from S3 directly bypassing ActiveStorage.
-
#presenter ⇒ Object
-
#readme_uploads ⇒ Array<S3File>
Retrieve the S3 file uploads named "README".
-
#reload(options = nil) ⇒ Object
-
#reload_snapshots(user_id: nil) ⇒ UploadSnapshot
Build or find persisted UploadSnapshot models for this Work.
-
#resource ⇒ Object
-
#resource=(resource) ⇒ Object
-
#s3_client ⇒ Object
-
#s3_files ⇒ Object
-
#s3_object_key ⇒ String
Generates the S3 Object key.
-
#s3_query_service ⇒ S3QueryService
S3QueryService object associated with this Work.
-
#state=(new_state) ⇒ Object
-
#submitted_by?(user) ⇒ Boolean
-
#title ⇒ Object
-
#total_file_size ⇒ Object
rubocop:enable Metrics/MethodLength.
-
#total_file_size_from_list(files) ⇒ Object
Calculates the total file size from a given list of files This is so that we don't fetch the list twice from AWS since it can be expensive when there are thousands of files on the work.
-
#track_change(action, filename) ⇒ Object
-
#unpublished_metadata_json(*_args) ⇒ Object
this method returns only the DOI and the work state, for now, for draft and withdrawn works so that only that information is visible when harvested.
-
#update_curator(curator_user_id, current_user) ⇒ Object
-
#uploads ⇒ Array<S3File>
Retrieve the S3 file uploads associated with the Work.
-
#uploads_attributes ⇒ Object
-
#url ⇒ Object
Instance Attribute Details
#user_entered_doi ⇒ Object
Returns the value of attribute user_entered_doi.
18
19
20
|
# File 'app/models/work.rb', line 18
def user_entered_doi
@user_entered_doi
end
|
Class Method Details
.find_by_ark(ark) ⇒ Object
107
108
109
110
111
|
# File 'app/models/work.rb', line 107
def find_by_ark(ark)
prefix = "ark:/"
ark = "#{prefix}#{ark}" unless ark.blank? || ark.start_with?(prefix)
Work.find_by!("metadata @> ?", JSON.dump(ark:))
end
|
.find_by_doi(doi) ⇒ Object
101
102
103
104
105
|
# File 'app/models/work.rb', line 101
def find_by_doi(doi)
prefix = "10.34770/"
doi = "#{prefix}#{doi}" unless doi.blank? || doi.start_with?(prefix)
Work.find_by!("metadata @> ?", JSON.dump(doi:))
end
|
.list_embargoed ⇒ Object
115
116
117
|
# File 'app/models/work.rb', line 115
def list_embargoed
Work.where("embargo_date >= current_date").where(state: "approved")
end
|
.list_released_embargo ⇒ Object
119
120
121
|
# File 'app/models/work.rb', line 119
def list_released_embargo
Work.where("embargo_date = current_date-1").where(state: "approved")
end
|
.presenter_class ⇒ Object
525
526
527
|
# File 'app/models/work.rb', line 525
def self.presenter_class
WorkPresenter
end
|
Instance Method Details
#add_message(message, current_user_id) ⇒ Object
#add_provenance_note(date, note, current_user_id, change_label = "") ⇒ Object
#administered_by?(user) ⇒ Boolean
96
97
98
|
# File 'app/models/work.rb', line 96
def administered_by?(user)
user.has_role?(:group_admin, group)
end
|
#artifact_uploads ⇒ Array<S3File>
Retrieve the S3 file uploads which are research artifacts proper (not README or other files providing metadata/documentation)
344
345
346
|
# File 'app/models/work.rb', line 344
def artifact_uploads
uploads.reject { |s3_file| s3_file.filename.include?("README") }
end
|
#as_json(*args) ⇒ String
Generates the JSON serialized expression of the Work
451
452
453
454
455
456
457
|
# File 'app/models/work.rb', line 451
def as_json(*args)
if state == "approved" || (args.first && args.first[:force_post_curation])
full_metadata_as_json(*args)
else
unpublished_metadata_json(*args)
end
end
|
#change_curator(curator_user_id, current_user) ⇒ Object
229
230
231
232
233
234
235
|
# File 'app/models/work.rb', line 229
def change_curator(curator_user_id, current_user)
if curator_user_id == "no-one"
clear_curator(current_user)
else
update_curator(curator_user_id, current_user)
end
end
|
#changes ⇒ Object
533
534
535
|
# File 'app/models/work.rb', line 533
def changes
@changes ||= []
end
|
#clear_curator(current_user) ⇒ Object
237
238
239
240
241
242
243
244
|
# File 'app/models/work.rb', line 237
def clear_curator(current_user)
self.curator_user_id = nil
save!
WorkActivity.add_work_activity(id, "Unassigned existing curator", current_user.id, activity_type: WorkActivity::SYSTEM)
end
|
#created_by_user ⇒ Object
195
196
197
198
199
|
# File 'app/models/work.rb', line 195
def created_by_user
User.find(created_by_user_id)
rescue ActiveRecord::RecordNotFound
nil
end
|
#current_transition ⇒ Object
324
325
326
|
# File 'app/models/work.rb', line 324
def current_transition
aasm.current_event.to_s.humanize.delete("!")
end
|
#date_approved ⇒ Object
590
591
592
593
594
|
# File 'app/models/work.rb', line 590
def date_approved
@approved_activity ||= WorkActivity.where("work_id = ? and message='marked as Approved'", id).first
return nil if @approved_activity.nil?
@approved_activity.created_at.to_date.to_s
end
|
#doi_url ⇒ String
Return the DOI formatted as a URL, so it can be used as a link on display pages
190
191
192
193
|
# File 'app/models/work.rb', line 190
def doi_url
return "https://doi.org/#{doi}" unless doi.starts_with?("https://doi.org")
doi
end
|
#draft_doi ⇒ Object
182
183
184
185
186
|
# File 'app/models/work.rb', line 182
def draft_doi
return if resource.doi.present?
resource.doi = datacite_service.draft_doi
save!
end
|
#editable_by?(user) ⇒ Boolean
Is this work editable by a given user?
A work is editable when:
- it is being edited by the person who made it
- it is being edited by a group admin of the group where is resides
- it is being edited by a super admin
80
81
82
|
# File 'app/models/work.rb', line 80
def editable_by?(user)
submitted_by?(user) || administered_by?(user)
end
|
#editable_in_current_state?(user) ⇒ Boolean
84
85
86
87
88
89
90
|
# File 'app/models/work.rb', line 84
def editable_in_current_state?(user)
return editable_by?(user) if draft?
administered_by?(user)
end
|
#embargoed? ⇒ Boolean
Determine whether or not the Work is under active embargo
Embargoes are not fully released until the day after the embargo date
555
556
557
558
559
560
|
# File 'app/models/work.rb', line 555
def embargoed?
return false if embargo_date.blank?
current_date = Time.zone.now.to_date
current_date <= embargo_date
end
|
#file_list ⇒ Object
Returns the list of files for the work with some basic information about each of them.
This method is much faster than uploads because it does not return the actual S3File
objects to the client, instead it returns just a few selected data elements.
rubocop:disable Metrics/MethodLength
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
|
# File 'app/models/work.rb', line 352
def file_list
start = Time.zone.now
s3_files = approved? ? post_curation_uploads : pre_curation_uploads
files_info = s3_files.map do |s3_file|
{
"safe_id": s3_file.safe_id,
"filename": s3_file.filename,
"filename_display": s3_file.filename_display,
"last_modified": s3_file.last_modified,
"last_modified_display": s3_file.last_modified_display,
"size": s3_file.size,
"display_size": s3_file.display_size,
"url": s3_file.url,
"is_folder": s3_file.is_folder
}
end
log_performance(start, "file_list called for #{id}")
files_info
end
|
#files_location_cluster? ⇒ Boolean
221
222
223
|
# File 'app/models/work.rb', line 221
def files_location_cluster?
files_location == "file_cluster"
end
|
#files_location_other? ⇒ Boolean
225
226
227
|
# File 'app/models/work.rb', line 225
def files_location_other?
files_location == "file_other"
end
|
#files_location_upload? ⇒ Boolean
217
218
219
|
# File 'app/models/work.rb', line 217
def files_location_upload?
files_location.blank? || files_location == "file_upload"
end
|
#files_mode ⇒ Object
Returns the bucket name where the files are stored for this work.
#files_mode_human ⇒ Object
Returns a human friendly name for the bucket where the files for the work are located.
Notice that we don't use the values from PULS3Client because those are not human friendly
(e.g. the lack dashes between words)
565
566
567
568
569
570
571
572
573
574
575
|
# File 'app/models/work.rb', line 565
def files_mode_human
if approved?
if embargoed?
"embargo"
else
"post-curation"
end
else
"pre-curation"
end
end
|
#find_bucket_s3_dir(bucket_name:) ⇒ Aws::S3::Types::HeadObjectOutput
Transmit a HEAD request for the S3 Bucket directory for this Work
434
435
436
437
438
439
440
441
442
443
444
|
# File 'app/models/work.rb', line 434
def find_bucket_s3_dir(bucket_name:)
s3_client.head_object({
bucket: bucket_name,
key: s3_object_key
})
true
rescue Aws::S3::Errors::NotFound
nil
end
|
176
177
178
179
180
|
# File 'app/models/work.rb', line 176
def form_attributes
{
uploads: uploads_attributes
}
end
|
Format the date for Apache Solr
485
486
487
|
# File 'app/models/work.rb', line 485
def format_date_for_solr(date)
date.strftime("%Y-%m-%dT%H:%M:%SZ")
end
|
these are for approved works and what they return on the JSON document
460
461
462
463
464
465
466
467
468
469
470
471
|
# File 'app/models/work.rb', line 460
def full_metadata_as_json(*args)
files = files_as_json(*args)
{
"resource" => resource.as_json.except("datacite_serialization"),
"files" => files,
"group" => group.as_json.except("id"),
"embargo_date" => embargo_date_as_json,
"created_at" => format_date_for_solr(created_at),
"updated_at" => format_date_for_solr(updated_at),
"date_approved" => date_approved
}
end
|
#has_rights?(rights_id) ⇒ Boolean
rubocop:disable Naming/PredicateName
542
543
544
|
# File 'app/models/work.rb', line 542
def has_rights?(rights_id)
resource.rights_many.index { |rights| rights.identifier == rights_id } != nil
end
|
#log_changes(resource_compare, current_user_id) ⇒ Object
278
279
280
281
|
# File 'app/models/work.rb', line 278
def log_changes(resource_compare, current_user_id)
return if resource_compare.identical?
WorkActivity.add_work_activity(id, resource_compare.differences.to_json, current_user_id, activity_type: WorkActivity::CHANGES)
end
|
#log_file_changes(current_user_id) ⇒ Object
#mark_new_notifications_as_read(user_id) ⇒ Object
Marks as read the notifications for the given user_id in this work.
In practice, the user_id is the id of the current user and therefore this method marks the current's user
notifications as read.
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
|
# File 'app/models/work.rb', line 302
def mark_new_notifications_as_read(user_id)
now_utc = Time.now.utc
WorkActivityNotification.joins(:work_activity)
.where(user_id:)
.where(work_activity: { work_id: id })
.in_batches(of: 1000)
.update_all(read_at: now_utc)
end
|
#new_notification_count_for_user(user_id) ⇒ Object
292
293
294
295
296
297
|
# File 'app/models/work.rb', line 292
def new_notification_count_for_user(user_id)
WorkActivityNotification.joins(:work_activity)
.where(user_id:, read_at: nil)
.where(work_activity: { work_id: id })
.count
end
|
#past_snapshots ⇒ Object
498
499
500
|
# File 'app/models/work.rb', line 498
def past_snapshots
UploadSnapshot.where(work: self)
end
|
#pdc_discovery_url ⇒ Object
This is the solr id / work show page in PDC Discovery
548
549
550
|
# File 'app/models/work.rb', line 548
def pdc_discovery_url
"https://datacommons.princeton.edu/discovery/catalog/doi-#{doi.tr('/', '-').tr('.', '-')}"
end
|
#post_curation_s3_resources ⇒ Object
Accesses post-curation S3 Bucket Objects
394
395
396
397
398
399
400
|
# File 'app/models/work.rb', line 394
def post_curation_s3_resources
if approved?
s3_resources
else
[]
end
end
|
#post_curation_uploads(force_post_curation: false) ⇒ Object
Returns the files in post-curation for the work
403
404
405
406
407
408
409
410
411
412
|
# File 'app/models/work.rb', line 403
def post_curation_uploads(force_post_curation: false)
if force_post_curation
post_curation_s3_query_service = S3QueryService.new(self, PULS3Client::POSTCURATION)
post_curation_s3_query_service.data_profile.fetch(:objects, [])
else
post_curation_s3_resources
end
end
|
#pre_curation_uploads ⇒ Object
Fetches the data from S3 directly bypassing ActiveStorage
389
390
391
|
# File 'app/models/work.rb', line 389
def pre_curation_uploads
s3_query_service.client_s3_files.sort_by(&:filename)
end
|
#presenter ⇒ Object
529
530
531
|
# File 'app/models/work.rb', line 529
def presenter
self.class.presenter_class.new(work: self)
end
|
#readme_uploads ⇒ Array<S3File>
Retrieve the S3 file uploads named "README"
338
339
340
|
# File 'app/models/work.rb', line 338
def readme_uploads
uploads.select { |s3_file| s3_file.filename.include?("README") }
end
|
#reload(options = nil) ⇒ Object
Overload ActiveRecord.reload method
https://apidock.com/rails/ActiveRecord/Base/reload
NOTE: Usually after_save is a better place to put this kind of code:
after_save do |work|
work.resource = nil
end
but that does not work in this case because the block points to a different
memory object for work than the we want we want to reload.
152
153
154
155
156
157
|
# File 'app/models/work.rb', line 152
def reload(options = nil)
super
@resource = nil
self
end
|
#reload_snapshots(user_id: nil) ⇒ UploadSnapshot
Build or find persisted UploadSnapshot models for this Work
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
|
# File 'app/models/work.rb', line 505
def reload_snapshots(user_id: nil)
work_changes = []
s3_files = pre_curation_uploads
s3_filenames = s3_files.map(&:filename)
upload_snapshot = latest_snapshot
upload_snapshot.snapshot_deletions(work_changes, s3_filenames)
upload_snapshot.snapshot_modifications(work_changes, s3_files)
unless work_changes.empty?
new_snapshot = UploadSnapshot.new(work: self, url: s3_query_service.prefix)
new_snapshot.store_files(s3_files)
new_snapshot.save!
WorkActivity.add_work_activity(id, work_changes.to_json, user_id, activity_type: WorkActivity::FILE_CHANGES)
end
end
|
#resource=(resource) ⇒ Object
201
202
203
204
205
|
# File 'app/models/work.rb', line 201
def resource=(resource)
@resource = resource
self.metadata = JSON.parse(resource.to_json)
end
|
#s3_client ⇒ Object
418
419
420
|
# File 'app/models/work.rb', line 418
def s3_client
s3_query_service.client
end
|
#s3_files ⇒ Object
414
415
416
|
# File 'app/models/work.rb', line 414
def s3_files
pre_curation_uploads
end
|
#s3_object_key ⇒ String
Generates the S3 Object key
427
428
429
|
# File 'app/models/work.rb', line 427
def s3_object_key
"#{doi}/#{id}"
end
|
S3QueryService object associated with this Work
494
495
496
|
# File 'app/models/work.rb', line 494
def s3_query_service
@s3_query_service ||= S3QueryService.new(self, files_mode)
end
|
#state=(new_state) ⇒ Object
65
66
67
68
69
70
|
# File 'app/models/work.rb', line 65
def state=(new_state)
new_state_sym = new_state.to_sym
valid_states = self.class.aasm.states.map(&:name)
raise(StandardError, "Invalid state '#{new_state}'") unless valid_states.include?(new_state_sym)
aasm_write_state_without_persistence(new_state_sym)
end
|
#submitted_by?(user) ⇒ Boolean
92
93
94
|
# File 'app/models/work.rb', line 92
def submitted_by?(user)
created_by_user_id == user.id
end
|
#title ⇒ Object
159
160
161
|
# File 'app/models/work.rb', line 159
def title
resource.main_title
end
|
#total_file_size ⇒ Object
rubocop:enable Metrics/MethodLength
373
374
375
376
377
378
379
|
# File 'app/models/work.rb', line 373
def total_file_size
total_size = 0
file_list.each do |file|
total_size += file[:size]
end
total_size
end
|
#total_file_size_from_list(files) ⇒ Object
Calculates the total file size from a given list of files
This is so that we don't fetch the list twice from AWS since it can be expensive when
there are thousands of files on the work.
384
385
386
|
# File 'app/models/work.rb', line 384
def total_file_size_from_list(files)
files.sum { |file| file[:size] }
end
|
#track_change(action, filename) ⇒ Object
537
538
539
|
# File 'app/models/work.rb', line 537
def track_change(action, filename)
changes << { action:, filename: }
end
|
this method returns only the DOI and the work state, for now, for draft and withdrawn works
so that only that information is visible when harvested.
475
476
477
478
479
480
|
# File 'app/models/work.rb', line 475
def unpublished_metadata_json(*_args)
{
"resource" => { "doi" => resource.doi },
"state" => state
}
end
|
#update_curator(curator_user_id, current_user) ⇒ Object
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
|
# File 'app/models/work.rb', line 246
def update_curator(curator_user_id, current_user)
self.curator_user_id = curator_user_id
save!
new_curator = User.find(curator_user_id)
work_url = "[#{title}](#{Rails.application.routes.url_helpers.work_url(self)})"
if work_url.include?("/describe/describe/")
Rails.logger.error("URL #{work_url} included /describe/describe/ and was fixed. See https://github.com/pulibrary/pdc_describe/issues/1783")
work_url = work_url.gsub("/describe/describe/", "/describe/")
end
message = if curator_user_id.to_i == current_user.id
"Self-assigned @#{current_user.uid} as curator for work #{work_url}"
else
"Set curator to @#{new_curator.uid} for work #{work_url}"
end
WorkActivity.add_work_activity(id, message, current_user.id, activity_type: WorkActivity::SYSTEM)
end
|
#uploads ⇒ Array<S3File>
Retrieve the S3 file uploads associated with the Work
330
331
332
333
334
|
# File 'app/models/work.rb', line 330
def uploads
return post_curation_uploads if approved?
pre_curation_uploads
end
|
#uploads_attributes ⇒ Object
163
164
165
166
167
168
169
170
171
172
173
174
|
# File 'app/models/work.rb', line 163
def uploads_attributes
return [] if approved? uploads.map do |upload|
{
id: upload.id,
key: upload.key,
filename: upload.filename.to_s,
created_at: upload.created_at,
url: upload.url
}
end
end
|
#url ⇒ Object
211
212
213
214
215
|
# File 'app/models/work.rb', line 211
def url
return unless persisted?
@url ||= url_for(self)
end
|